CrawlJobs Logo

Technical Program Manager, Core Infrastructure

United States, Menlo Park 199000.00 - 272000.00 USD / Year · Job Posted January 23, 2026
Apply Position
Job Link Share

Job Description

Meta’s Core Infrastructure team seeks a Technical Program Manager (TPM) to lead complex, large-scale projects focused on advancing language model scaling. In this key position, you will collaborate across engineering, hardware, data center, research, and product teams to design, build, and scale foundational hardware, software systems, and tools that support Meta’s AI innovation. You will be responsible for driving the end-to-end integration of new AI hardware and core infra stack, from initial design validation of our software stack through production deployment. This includes developing and refining repeatable frameworks for efficient onboarding, ensuring robust and predictable execution, and proactively resolving technical and organizational challenges to maintain project momentum. You will use your problem-solving, technical acumen, and business insight to streamline onboarding of new AI hardware platforms into Meta’s suite of core infrastructure services. You will communicate transparently across all levels, motivate multidisciplinary teams, and champion best practices to deliver impactful outcomes that advance Meta’s infrastructure.

Job Responsibility

  • Establish and lead effective program teams to ensure alignment and achieve common objectives
  • Work closely with engineering, data center, hardware and business stakeholders to define program requirements, prioritize initiatives, and establish scope, including shaping the roadmap and long-term strategy for partner teams
  • Create and implement communication strategies to proactively share program status, challenges, and risks with stakeholders
  • Drive successful outcomes by actively managing cross-functional dependencies, mitigating risks, and adjusting scope, timeline, and resources as needed
  • Collaborate with cross-functional teams to lead the end-to-end lifecycle of programs, including technical analysis, design, development, testing, implementation, and post-launch support
  • Establish and track key metrics, quality benchmarks, and performance indicators to drive accountability and ensure effective cross-functional execution of program deliverables
  • Anticipate and evaluate complex, long-term infrastructure challenges in close partnership with engineering leaders and key stakeholders
  • Drive product strategy to support and align with key company initiatives
  • Lead process improvements across internal and external teams, streamlining workflows and reducing manual effort through automation

Requirements

  • Bachelor of Science in Electrical Engineering, Computer Science, Mechanical Engineering, or a related technical field, or equivalent experience
  • 12+ years of experience in software engineering, hardware engineering, systems engineering, or technical product/program management
  • Knowledge of software and hardware development for large scale hardware readiness, including end-to-end product development processes
  • Excel at clearly communicating complex technical investments in a simple and understandable manner
  • Experience delivering complex technology programs and products from inception through to successful delivery
  • Knowledge of understanding user needs, gathering requirements, and defining project scope
  • Experience working under your own initiative, across multiple teams, demonstrating critical thinking and providing thought leadership in ambiguous spaces
  • Experience defining and optimizing engineering processes at scale
  • Excel at building cross-functional relationships, thrive amid complex challenges, excel at clearly communicating complex technical investments in a simple and understandable manner
  • Experience in analytical thinking and problem-solving for large-scale systems
  • Experience building work relationships across multi-disciplinary teams and with partners in different time zones
  • Experience defining strategic direction and identifying new opportunities for impact across products, platforms, and programs
  • Experience communicating at the executive level and influencing leadership and technical management teams to drive the development of systems, solutions, and products
  • Knowledge of Large Language Model and machine learning, and scaling distributed systems
  • Demonstrated experience of identifying new opportunities for the larger organization and influencing the appropriate stakeholders
  • Proven commitment to scale infrastructure for large scale AI distributed compute systems

Nice to have

  • Knowledge of software and hardware development for large scale system readiness
  • Excel at clearly communicating complex technical investments in a simple and understandable manner

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Technical Program Manager, Core Infrastructure

8 matching positions

Technical Program Manager, Core Infra

Meta’s Core Infrastructure team seeks a Technical Program Manager (TPM) to lead ...
Location
Location
United States , Menlo Park
Salary
Salary:
168000.00 - 234000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor of Science in Electrical Engineering, Computer Science, Mechanical Engineering, or a related technical field, or equivalent experience
  • 10+ years of experience in software engineering, hardware engineering, systems engineering, or technical product/program management for large scale programs
  • Demonstrated knowledge of software and hardware development for large scale data center readiness, including end-to-end product development processes
  • Excel at clearly communicating complex technical investments in a simple and understandable manner
  • Experience delivering complex technology programs and products from inception through to successful delivery
  • Knowledge of understanding user needs, gathering requirements, and defining project scope
  • Experience working under your own initiative, across multiple teams, demonstrating critical thinking and providing thought leadership in ambiguous spaces
  • Experience defining and optimizing engineering processes and public cloud vendor technical milestones at scale
  • Excel at building cross-functional relationships, thrive amid complex challenges, excel at clearly communicating complex technical investments in a simple and understandable manner
  • Demonstrated experience of identifying new opportunities for the larger organization and influencing the appropriate stakeholders
Job Responsibility
Job Responsibility
  • Establish and lead effective program teams to ensure alignment and achieve common objectives
  • Partner closely with engineering, cloud engineering, data center, hardware and business stakeholders to define program requirements, prioritize initiatives, and establish scope, including shaping the roadmap and long-term strategy for partner teams
  • Develop and implement communication strategies to proactively share program status, challenges, and risks with stakeholders
  • Drive successful outcomes by actively managing cross-functional dependencies, data center & public cloud vendor tech milestones, mitigating risks, and adjusting scope, timeline, and resources as needed
  • Collaborate with cross-functional teams to lead the end-to-end lifecycle of programs, including technical analysis, design, development, testing, implementation, and post-launch support
  • Establish and track key metrics, quality benchmarks, and performance indicators to drive accountability and ensure effective cross-functional execution of program deliverables
  • Anticipate and evaluate complex, long-term infrastructure challenges in close partnership with engineering leaders and key stakeholders
  • Drive product strategy to support and align with key company initiatives such region and public cloud turn-ups
  • Lead process improvements across internal and external teams, streamlining workflows and reducing manual effort through automation
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Infrastructure Hardware Technical Program Manager- Hardware Systems

Meta is seeking a Technical Program Manager (TPM) with experience in server and ...
Location
Location
United States , Menlo Park
Salary
Salary:
168000.00 - 234000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • B.S. in Computer Science or a related technical discipline, or equivalent experience
  • 10+ years of experience in software engineering, systems engineering, hardware engineering, or technical product/program management experience
  • Experience delivering tech programs or products from inception to delivery
  • Knowledge of user needs, gathering requirements, and defining scope
  • Experience operating under your own initiative across multiple teams, demonstrated critical thinking, and thought leadership
  • Communication experience and experience working with technical management teams to develop systems, solutions, and products
  • Organizational, coordination and multi-tasking experience
  • Analytical and problem-solving experience with large-scale systems
  • Experience establishing work relationships across multidisciplinary teams and multiple partners in different time zones
Job Responsibility
Job Responsibility
  • Lead technical program management of next-generation hardware platform(s) for Meta Infrastructure in a matrix organization covering a range of areas (Data Center, Network, Hardware Systems, Infrastructure Engineering, Software Engineering, Capacity Management) and across multiple physical locations
  • Own overall program success, spanning the end-to-end development of the hardware product. spanning internal and external development work through successful ingestion into Meta’s infrastructure and support of production workloads at scale
  • Develop and manage programs including defining scope, requirements, development model, schedules, and deliverables with engineering teams, partners, and stakeholders
  • Influence broader roadmaps through product interception and market fit, competitive analysis, and feasibility studies
  • Provide hands-on program management during analysis, design, development, testing, implementation, and post implementation phases
  • Partner with Engineering counterparts across a range of specialties as well as other teams to define product roadmaps
  • Drive overall communication to leadership, stakeholders and core working teams in regular cadence
  • Drive internal process improvements across multiple teams and functions
  • Analyze infrastructure needs and produce hardware designs and prototypes to meet those needs
  • Manage and drive strategic vendor engagement and deliveries
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Senior Technical Program Manager - Datacenter Infrastructure

The Datacenter leasing Senior Technical Program Manager will be part of a team r...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Civil, Electrical, Mechanical, Telecom Engineering, or related technical field AND 4+ years’ experience in engineering, operations, commissioning or technical program management
  • 3+ years’ experience managing cross functional and/or cross-team projects
  • 3+ years of experience in data center design, infrastructure, and critical environments
  • Broad infrastructure knowledge across mechanical, electrical, and controls systems with a focus on Datacenter integration and performance
  • Familiarity with key industry standards and best practices, including ASHRAE, Uptime Institute, ANSI, and NFPA
  • Familiarity with high-density power and cooling solutions, sustainability initiatives, and emerging technologies for AI workloads
  • Ability to meet Microsoft, customer and/or government security screening requirements
Job Responsibility
Job Responsibility
  • Act as a Subject Matter Expert (SME) and provide global program support
  • Drive technical solutions for leased datacenters in partnership with Microsoft’s and Lessor’s core engineering teams
  • Evaluate lessor’s design proposal against technical requirements and mitigate non-compliance through technical and commercial solutions
  • Assesses lessor’s compliance through review of technical documents, site assessments, and stakeholder engagement
  • Partner with internal and external stakeholders during construction, RFS, and operations handover to unblock any technical issues risking the on-time delivery of Datacenter to customers
  • Drive cost impact analysis on non-compliance and specification changes. Escalate and provide visibility and feedback to leadership on cost drivers
  • Partner with Microsoft Engineering, Integration, Security, Operations, and Energy teams on resolution management
  • Drive partner accountability on contractual milestones and commercial commitments
  • Own overall schedule tracking, risk identification, blockers, and mitigation for the assigned projects
  • creating clear visibility for leadership
  • Fulltime
Read More
Arrow Right
New

Technical Program Manager, Enterprise

As a Technical Program Manager, you will partner with our Frontier Agent Enginee...
Location
Location
United States , New York, NY; San Francisco, CA
Salary
Salary:
211200.00 - 264000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as a Technical Program Manager or in a technical leadership role managing complex, large-scale software engineering or machine learning development projects
  • 2+ years of dedicated experience managing programs focused directly on core engineering infrastructure, platform services, or distributed systems
  • Strong foundational understanding of the Generative AI lifecycle, including LLM utilization for structured downstream tasks, model fine-tuning, and performance evaluation
  • Proven track record of presenting to and influencing executive-level stakeholders, with the ability to translate complex technical challenges into clear business impacts
  • Advanced proficiency with iterative development methodologies and modern project management tooling (Jira, Linear, etc.)
  • An insatiable appetite for learning and deeply engaging with modern ML/GenAI practices and infrastructure
Job Responsibility
Job Responsibility
  • End-to-End Program Ownership: Own the strategic planning, scheduling, and high-velocity execution of multiple enterprise-grade programs, ensuring on-time delivery against aggressive product goals
  • Cross-Functional Architecture Integration: Manage complex dependencies and technical communication across core teams (e.g., Platform, Forward Deployed Engineering, Product) to seamlessly deliver frontier agents to our enterprise customers
  • Technical Translation & Executive Influence: Synthesize deep technical complexities into concise, actionable insights for both engineers and C-suite stakeholders
  • Risk & Dependency Mitigation: Proactively identify, track, and architect mitigations for technical risks unique to enterprise AI deployment
  • Process Evolution: Modernize and scale agile execution frameworks (e.g., Jira, Linear) to support rapid, iterative machine learning and software development lifecycles
  • Metrics-Driven Accountability: Define, track, and report on key program health metrics, delivery forecasts, and engineering bottlenecks directly to executive leadership
What we offer
What we offer
  • Equity
  • comprehensive health, dental and vision coverage
  • retirement benefits
  • learning and development stipend
  • generous PTO
  • commuter stipend (potentially)
  • Fulltime
Read More
Arrow Right

Technical Program Manager- AI Cluster Validation

We are seeking a Technical Program Manager to lead execution of AI cluster engin...
Location
Location
United States , Austin
Salary
Salary:
162640.00 - 243960.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience leading complex hardware or AI infrastructure programs with ownership across bring-up, validation, and deployment phases
  • Strong technical understanding of GPU-based AI systems, rack architectures, and datacenter infrastructure
  • Proven ability to manage ambiguity, drive debug execution, and lead cross-functional teams without direct authority
  • Strong written and verbal communication skills, including executive-level status reporting
  • Proficiency with program management and execution tools (Jira, Confluence, dashboards, Excel/PowerPoint)
  • Bachelor's or master's degree in systems, EE, CS, or related engineering discipline
  • PMP, Scrum Master, or equivalent program management training
Job Responsibility
Job Responsibility
  • Define, plan, and drive program plans for AI infrastructure systems validation and readiness, including server integration, rack bring-up, and cluster-scale deployment readiness
  • Create and maintain core PM artifacts: schedules, dependency maps, resource forecasts, risk/issue logs, and program dashboards/status reports
  • Identify and drive mitigation plans for issues/risks, including cross-team escalations and corrective actions across multiple engineering areas
  • Drive regular execution reviews with engineering teams and provide concise, data-driven updates to senior leadership
  • Own program execution for GPU-based AI platforms, spanning system bring-up, qualification, scale readiness, and deployment validation across server, rack, and cluster levels
  • Drive alignment across GPU, CPU, firmware, BIOS/BMC, and system teams to ensure readiness for scale testing and customer workloads
  • Track platform issues, and debug dependencies
  • ensure risks are clearly documented, owned, and mitigated
  • Own program planning and execution for multi-node and multi-rack scale testing, including test strategy, scheduling, coverage tracking, and readiness gates
  • Lead end-to-end delivery of rack-level AI solutions, including compute trays, switch trays, cabling, power, cooling, and management infrastructure
  • Fulltime
Read More
Arrow Right

Staff Technical Program Manager - FinOps

The FinOps function is responsible for financial accountability, visibility, and...
Location
Location
United States , New York
Salary
Salary:
175200.00 - 284400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6–10+ years of relevant experience working at the intersection of engineering, infrastructure, data, or finance in a cloud-native or SaaS environment
  • Proven experience partnering closely with engineering teams to influence decisions involving cloud infrastructure, data platforms, AI/ML workloads, or SaaS spend
  • Working understanding of modern cloud-native architectures, including core components such as compute, storage, networking, data pipelines, and managed services—enough to engage credibly with engineers on design, tradeoffs, and cost drivers
  • Strong foundation in cost analysis, forecasting, budgeting, and variance management, with the ability to translate data into clear, actionable insights
  • Comfort working directly with data, including writing SQL (or effectively using AI-assisted tools to do so) to explore datasets, validate assumptions, and answer ad hoc questions
  • Experience building clear, high-quality dashboards and BI artifacts that are not only accurate, but intuitive and delightful for engineers and leaders to use
  • Demonstrated success driving adoption and behavior change—embedding cost awareness into day-to-day engineering workflows, not just producing reports
  • Experience owning and delivering cross-functional programs end-to-end, often without direct authority or a dedicated team
  • Familiarity with FinOps principles and practices (e.g., shared ownership, showback/chargeback, unit economics, optimization strategies)
  • Strong communication skills, with the ability to tailor complex technical and financial concepts for engineering, finance, and executive audiences
Job Responsibility
Job Responsibility
  • Monitors and analyzes engineering spend across cloud, AI/ML, data platforms, and SaaS, identifying trends, anomalies, and optimization opportunities
  • Builds and maintains forecasts for engineering spend, partnering with Finance and engineering leaders to understand drivers, assumptions, and risks
  • Partners with engineering, product, and TPMs to incorporate cost considerations into roadmaps, architectural decisions, and execution plans
  • Leads cost optimization initiatives, such as rightsizing, commitment strategies, and workload efficiency improvements, in collaboration with engineering owners
  • Creates and maintains dashboards and reporting that make spend understandable and actionable for both engineers and executives
  • Implements FinOps practices and processes, including showback/chargeback models, unit economics, and cost ownership frameworks
  • Partners on tooling and automation, working with data and engineering teams to improve cost visibility, forecasting accuracy, and operational efficiency
  • Drives alignment and behavior change, helping teams balance cost, performance, reliability, and velocity through data-driven decision making
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • Fulltime
Read More
Arrow Right

Senior Technical Program Manager

The CO+I AI Delivery team is focused on delivering various platform services to ...
Location
Location
United States , Multiple Locations
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree AND 4+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience
  • 2+ years of experience managing cross-functional and/or cross-team projects
  • Bachelor's Degree AND 8+ years experience engineering, product/technical program management, data analysis, or product development OR equivalent experience
  • 6+ years of experience managing cross-functional and/or cross-team projects
  • 1+ year(s) of experience reading and/or writing code (e.g., sample documentation, product demos)
Job Responsibility
Job Responsibility
  • Owns end-to-end cloud data center supply strategy and execution, ensuring capacity availability, scalability, and delivery readiness aligned to cloud and AI demand (CPU/GPU)
  • Define strategy and success by setting program goals, OKRs/KPIs, and prioritized deliverables for multi-product cloud supply planning programs
  • Own end-to-end execution of cloud data center supply planning programs aligned to cloud and AI workload demand (CPU/GPU)
  • Drive execution rigor and alignment by running structured programs and execution reviews, establishing clear operating rhythms, and enforcing accountability
  • Partner across engineering and the ecosystem to align technical requirements, designs, and standards with supply planning and delivery timelines
  • Measure and improve performance by monitoring telemetry, KPIs, and AI-driven insights to assess program health and outcomes
  • Build durable planning capabilities by owning core supply planning artifacts (execution plans, roadmaps, status reporting, leadership updates)
  • Fulltime
Read More
Arrow Right

Technical Program Manager

The Azure Maia team is at the forefront of software+hardware AI innovation! Azur...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree AND 4+ years experience in product/service/project/program management or software development OR equivalent experience
  • 1+ years of experience managing cross-functional and/or cross-team projects with emphasis on large scale cloud Hardware & infrastructure
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Job Responsibility
Job Responsibility
  • Drive technical projects from requirements through launch, including managing complex project schedules, removing roadblocks and keeping processes running smoothly
  • Partner closely with cross-functional teams to define, prioritize, and implement features, infrastructure, processes, and workflows
  • Partner with Azure service teams to drive integration of Azure Maia hardware into core Azure cloud services to make it accessible at scale
  • Communicate with precision and clarity to drive alignment in both small and broad teams
  • Effectively collaborate, build relationships, and influence cross-functional team members to achieve product perfection
  • Serve as an escalation point for troubleshooting complex integration issue and implement fixes
  • Create documents and internal tools to facilitate delivery
  • Proactively develop systems and processes to ensure performance and reliability
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right