This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Meta’s Core Infrastructure team seeks a Technical Program Manager (TPM) to lead complex, large-scale projects focused on advancing language model scaling. In this key position, you will collaborate across engineering, hardware, data center, research, and product teams to design, build, and scale foundational OSS software systems, and tools that support Meta’s AI innovation. This role sits at the intersection of building Public Cloud and system software integration, requiring technical expertise across OSS systems, Inference and foundations compute systems. You will be responsible for driving the end-to-end integration of new hardware and core infra stack, from initial design validation of our software stack through production deployment across both data centers and public cloud. This includes developing and refining repeatable frameworks for efficient onboarding, ensuring robust and predictable execution, and proactively resolving technical and organizational challenges to maintain project momentum. You will use your problem-solving, technical acumen, and business insight to streamline onboarding of new hardware platforms into Meta’s suite of core infrastructure services. You will communicate transparently across all levels, motivate multidisciplinary teams, and champion best practices to deliver impactful outcomes that advance Meta’s infrastructure.
Job Responsibility:
Establish and lead effective program teams to ensure alignment and achieve common objectives
Partner closely with engineering, cloud engineering, data center, hardware and business stakeholders to define program requirements, prioritize initiatives, and establish scope, including shaping the roadmap and long-term strategy for partner teams
Develop and implement communication strategies to proactively share program status, challenges, and risks with stakeholders
Drive successful outcomes by actively managing cross-functional dependencies, data center & public cloud vendor tech milestones, mitigating risks, and adjusting scope, timeline, and resources as needed
Collaborate with cross-functional teams to lead the end-to-end lifecycle of programs, including technical analysis, design, development, testing, implementation, and post-launch support
Establish and track key metrics, quality benchmarks, and performance indicators to drive accountability and ensure effective cross-functional execution of program deliverables
Anticipate and evaluate complex, long-term infrastructure challenges in close partnership with engineering leaders and key stakeholders
Drive product strategy to support and align with key company initiatives such region and public cloud turn-ups
Lead process improvements across internal and external teams, streamlining workflows and reducing manual effort through automation
Requirements:
Bachelor of Science in Electrical Engineering, Computer Science, Mechanical Engineering, or a related technical field, or equivalent experience
10+ years of experience in software engineering, hardware engineering, systems engineering, or technical product/program management for large scale programs
Demonstrated knowledge of software and hardware development for large scale data center readiness, including end-to-end product development processes
Excel at clearly communicating complex technical investments in a simple and understandable manner
Experience delivering complex technology programs and products from inception through to successful delivery
Knowledge of understanding user needs, gathering requirements, and defining project scope
Experience working under your own initiative, across multiple teams, demonstrating critical thinking and providing thought leadership in ambiguous spaces
Experience defining and optimizing engineering processes and public cloud vendor technical milestones at scale
Excel at building cross-functional relationships, thrive amid complex challenges, excel at clearly communicating complex technical investments in a simple and understandable manner
Demonstrated experience of identifying new opportunities for the larger organization and influencing the appropriate stakeholders
Nice to have:
Advanced knowledge of Open Source Software such as Kubernetes development for large-scale systems for inference