This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are part of the Core AI Platform team at Microsoft, that builds the platforms, services, and operating mechanisms that power Microsoft’s rapidly growing AI model ecosystem. Our mission enables every developer to achieve more by using AI tools and our platform capabilities to infuse AI in their applications and services. This role is in the CoreAI Infrastructure team which powers the Microsoft AI Foundry Services. We manage the GPU fleet to run Microsoft’s AI services on a planet scale. We are in the eye of the storm to accelerate the transition and scaling of Generative AI models to work with the latest AI aware application. Our team focuses on fleet efficiency, reliability and the agility to bring the latest AI innovations from research into production.
Job Responsibility:
Define product strategy, roadmap, and success metrics for platform capabilities that increase overall fleet efficiency, while delivering cost-effective, high-performance solutions for customers running AI workloads
Define and track efficiency metrics, manage dependencies and lead experimentation to validate hypotheses and drive optimizations
Partner with engineering, data science, finance, and partner teams
manage dependencies and unblock execution in ambiguous environments
Build crisp narratives and dashboards that support decision-making and keep stakeholders aligned on progress, tradeoffs, and outcomes
Translate customer needs and platform constraints into clear requirements and iterative delivery plans
Identify high-impact opportunities to increase capacity efficiency
Provide clarity in ambiguous environments, influence stakeholders, and foster a culture of innovation and accountability
Requirements:
Bachelor's Degree AND 4+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience
2+ years of experience managing cross-functional and/or cross-team projects
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Nice to have:
5+ years of experience designing and shipping complex products for developers, ML professionals, or similar audiences
3+ years of experience with distributed platform ecosystems (e.g., multi‑tenant systems, real‑time processing, batch computing)
Proven experience in technical program management, preferably supporting AI infrastructure or cloud services
Strong understanding of GPUs, virtual machines, operating systems, and cloud infrastructure fundamentals
Experience navigating ambiguity and driving clarity across complex, cross‑functional initiatives
Experience building solutions using Azure, AWS, or Google Cloud
Experience writing Python, including for machine learning workloads
Experience with machine learning platforms
Experience driving complex, multi‑stakeholder processes and cross‑team programs
Ability to build effective relationships, influence, and collaborate at all organizational levels
Strong verbal and written communication skills for a global audience
Strong business acumen with an analytical, detail‑oriented approach
Experience leading cross‑functional teams and managing complex dependencies
Ability to thrive in fast‑paced environments with a bias for action