This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Microsoft is laying the foundation for the next generation of cloud and AI platforms—systems that must operate reliably, safely, and at global scale. This role is central to ensuring that engineering execution matches product intent across complex, interdependent infrastructure systems. As the Principal Technical Program Manager - AI Frameworks you will drive how work happens at scale. You will bring order, clarity, and operational discipline, ensuring teams can deliver reliably and sustainably. You will orchestrate execution across multiple services, manage critical-path dependencies, surface risks before they become issues, and create the transparency needed for leaders to make high-quality decisions. You will act as an essential partner to the head of product management and engineering technical leads. This role is ideal for someone who thrives in ambiguity, can navigate across organizational boundaries, and excels in turning complex engineering realities into actionable clarity. You will bring rigor, insight, and a systems-oriented mindset to an environment where infrastructure reliability, developer velocity, and operational excellence are paramount. Much of this role is ambiguous; success involves transforming ambiguity into actionable clarity. The successful candidate thrives in uncertain environments, is committed to delivering high-quality results, embraces ongoing learning and adaptability, works collaboratively with cross-functional teams, and demonstrates strong ownership and accountability for their contributions.
Job Responsibility:
Execution orchestration: Drive multi-team execution across numerous infrastructure components, services, and systems. Maintain the single source of truth for execution health, delivery status, and alignment to product intent. Ensure schedule integrity and build mechanisms to detect slippage or divergence early
Dependency management: Identify, track, and manage dependencies across org boundaries, plus writing white papers
Build and maintain critical-path plans that reveal coupling, blockers, and downstream risks
Operations: Establish and run cross-team rhythms of business (RoB), including reviews, readiness checkpoints, and launch orchestration
Risk identification and mitigation: Surface execution risks, capacity constraints, misalignment, and timeline threats before they impact delivery. Drive mitigation strategies without altering product priorities or increasing scope. Reveal implications and tradeoffs with clarity and objectivity
Governance and compliance: Ensure teams meet security, privacy, and compliance milestones. Track audit status and ensure risks are surfaced and addressed proactively
Resource utilization: Develop and maintain visibility into team utilization, workload distribution, and bottlenecks. Highlight when teams are overloaded, underutilized, or misaligned with business priorities
Accountability: Ensure engineering execution supports reliability, availability, and operational health requirements. Track execution debt and ensure teams have mechanisms to resolve or mitigate it
Requirements:
Bachelor's Degree AND 8+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience
6+ years of experience managing cross-functional and/or cross-team projects
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Nice to have:
10+ years of infrastructure technical project and program management experience in a large tech company working with cloud infrastructure, AI/ML systems, or hyperscale distributed services
Familiarity with security and privacy compliance reviews, operational readiness, and service reliability practices
Ability to model long-range execution plans, capacity forecasts, and critical-path scenarios
Exceptional written and verbal communication skills, with the ability to influence across organizations
Systems-thinking mindset with a strong bias for transparency, accountability, and operational excellence
High tolerance for ambiguity
Ability to work with executive stakeholders across business units