CrawlJobs Logo

Principal Group Software Engineering Manager

United States, Redmond Employment contract 165600.00 - 296400.00 USD / Year · Job Posted May 27, 2026
Apply Position
Job Link Share

Job Description

M365 Copilot inference is a high-impact engineering team advancing applied AI and large-scale machine learning across Microsoft. We design and operate the platform powering Microsoft 365 Copilot experiences. Our team is operating at massive GPU scale across multiple regions and SKUs in global datacenters. We build the core LLM API, routing, capacity, and control plane services that turn that fleet into Copilot experiences. We are hiring a Principal Group Software Engineering Manager to own GPU fleet health, capacity intake and planning, and automated model deployment for Copilot. This is one of the most strategic leadership roles in Copilot: every feature, experiment, and model launch flows through the systems this leader owns. You will lead existing teams, grow the org, and build the control plane that turns capacity management from a manual, ticket-driven process into an automated, self-driven platform. You will own end‑to‑end GPU fleet health and capacity platform, establishing a single source of truth with strong observability across hardware, hosts, and workloads to drive utilization and reliability. Design and scale capacity intake, planning, and deployment reducing models time‑to‑production and meeting SLAs for priority workloads through automation and data‑driven operations. Build a unified control plane that connects intake, planning, deployment, and fleet operations, enabling global optimization across cost, latency, compliance, and flexible model scaling (0→1 platform ownership). Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Job Responsibility

  • Build and lead a high-performing organization of engineering managers and senior engineers across capacity buildouts/automation, capacity planning, and the control plane.
  • Set the strategy and roadmap for Copilot capacity management and the control plane.
  • Drive execution across existing teams today, with a clear plan to grow the org as control plane scope expands.
  • Partner deeply with Copilot, AI Core, Azure to align demand, supply, and COGs for Copilot workloads.
  • Own live-site, reliability, and operational excellence for the capacity surface area.
  • Establish metrics and SLAs for intake latency, fleet utilization, automation coverage, and time-to-deploy
  • use them to guide investment decisions.
  • Coach and grow managers and senior ICs
  • raise the engineering bar
  • recruit experienced platform leaders into the team.
  • Represent capacity in executive reviews and cross-org leadership forums
  • communicate trade-offs between cost, speed, and reliability with clarity.

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice to have

  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • 6+ years people management experience.
  • Experience as a manager of managers leading distributed-systems or platform engineering teams at scale.
  • Demonstrated success building and operating large-scale distributed systems, control planes, orchestration platforms, or cloud infrastructure.
  • Track record of taking a platform from concept to broad production adoption — design, staffing, execution, and live-site ownership.
  • Systems thinking
  • able to identify and remove bottlenecks across intake, planning, scheduling, deployment, and operations.
  • Experience driving multi-org programs and influencing partner teams without direct authority.
  • Ability to translate ambiguous business needs into clear engineering strategy, priorities, and execution plans.
  • Hiring, coaching, and people-development track record across multiple levels.
  • Experience with large capacity fleets, AI/ML infrastructure, or large-scale inference or training systems.
  • Experience with capacity planning, fleet management, or supply/demand optimization at hyperscale.
  • Familiarity with Azure, M365, and AI workloads
  • understanding of inference and training cost models (COGS, utilization, throughput per GPU).
  • Background building automation, control planes, or orchestration platforms from 0→1.

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Principal Group Software Engineering Manager

8 matching positions

Principal Group Software Engineering Manager

Would you be excited about building a global-scale, Kubernetes-based service pla...
Location
Location
United States , Redmond
Salary
Salary:
165600.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 6+ years people management experience
  • 2+ years of people management experience as a manager of managers
Job Responsibility
Job Responsibility
  • Lead and grow multiple engineering teams responsible for delivering the COSMIC Application Platform, while cultivating a team culture that embraces Microsoft’s values of Respect, Accountability, and Integrity
  • Own end-to-end delivery of platform capabilities—from planning through execution and operational readiness—ensuring the platform scales reliably and meets high bars for safety, availability, and compliance
  • Partner with product management and architects to translate platform priorities into shipped capabilities, balancing near-term delivery with long-term platform evolution
  • Drive execution across teams and dependencies, aligning engineering work across the COSMIC organization and partners to deliver cohesive, high-quality platform experiences at scale
  • Establish and track clear success metrics for platform adoption, reliability, and developer experience, using data to guide prioritization and continuous improvement
  • Build a strong leadership bench by mentoring Engineering Managers and senior technical leaders, fostering a culture of accountability, craftsmanship, and continuous learning
  • Champion operational excellence by ensuring platform capabilities reduce toil, improve incident outcomes, and make best practices the default for services running on COSMIC
  • Represent the App Platform in cross-org forums, communicating progress, risks, and tradeoffs to stakeholders, and ensuring alignment with broader Substrate and E+D objectives
  • Fulltime
Read More
Arrow Right

Principal Group Software Engineering Manager - Azure Storage

Are you passionate about distributed systems, massive scalability, and durabilit...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • 4 years people management experience minimum
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Oversees partnership with appropriate stakeholders to determine user requirements within and across teams for multiple solutions or product lines
  • Oversees teams and provides technical leadership for identification of dependencies and the development of design documents for a product, application, service, or platform
  • Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI)
  • Oversees teams to drive multiple group's project plans, release plans, and work items in coordination with appropriate stakeholders across products
  • Leads the resolution of complex site incidents and oversees for Designated Responsible Individuals (DRI) and directs the work of other engineers across product lines
  • Keeps informed of and communicates new standards to ensure that the product development and scaling to customer requirements and applies best practices for meeting scaling needs and performance expectations and holds accountability for products that do not meet expectations
  • Fulltime
Read More
Arrow Right

Principal Software Engineering Manager - Data Science & Engineering

The MSRC Data Science team is responsible in building data pipelines, data minin...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Leads team on the disciplined use of, and improving artificial intelligence (AI) tools and practices across the software development lifecycle (SDLC)
  • Guides team on proactively taking responsibility for the content of their AI-generated requirements, design documents, code, and other assets, and assisting other members of the team to do the same
  • Leads team on incorporating Responsible AI practices into the SDLC to ensure appropriate controls over AI-generated assets
  • Coaches team on applying SDLC and engineering health measures (e.g., Accelerate, SPACE framework, Engineering System Success Playbook [ESSP]) to guide improvements to processes and practices, especially those involving AI
  • Leads team on experimenting with AI tools and practices to improve their own capabilities, and providing recommendations on how to adopt them to others
  • Reviews debugging tools, tests, logs, telemetry, and other methods, and acts as an expert for others to proactively verify assumptions while developing code before issues occur across products in production
  • Guides team to perform machine learning/data extraction, transformation, and loading (ETL) pipelines (e.g., data collection, cleaning) based on data prepared
  • Guides the architecture of scalable pipelines and datasets
  • Influences the direction of the team
  • Begins to anticipate potential data pipeline issues and provides solutions
  • Fulltime
Read More
Arrow Right

Principal Group Engineering Manager

GitHub is growing its Software Engineering team and we’re seeking experienced pr...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 9+ years' experience in software engineering, computer science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python
  • OR associate’s degree in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field AND 7+ years' experience in software engineering, computer science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python
  • OR bachelor's degree in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field AND 5+ years' experience in software engineering, computer science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python
  • OR master's degree in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field AND 3+ years' experience in software engineering, computer science, or related technical discipline with proven experience maintaining and delivering production software coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, Ruby, Rust, or Python
  • OR doctorate in Computer Science, Electrical Engineering, Electronics Engineering, Math, Physics, Computer Engineering, Computer Science, or related field
  • OR equivalent experience
  • 5+ year(s) direct people management or leadership experience
Job Responsibility
Job Responsibility
  • Engage with project managers and technical leads to determine and refine requirements for features and scenarios, ensuring alignment with customer needs and project goals
  • Lead cross team execution and delivery – lead discussions and create proposals for technical solutions, testing design hypotheses, and refining code plans to ensure robust architecture and high-quality outcomes
  • Drive architecture, quality and operational excellence for products, services, or features
  • Mentor team members in best practices for producing maintainable and extensible code
  • Identify and manage dependencies, risk and strategic tradeoffs
  • Collect, analyze, and integrate data to inform engineering decisions, driving product refinement and ensuring solutions meet performance expectations
  • Identify potential risks in projects and develop mitigation strategies to ensure successful project delivery and minimize disruptions
  • Partner across functions to align engineering with customer and business needs
  • Hire, coach, and develop top engineering talent, fostering a culture of innovation and continuous learning
  • Fulltime
Read More
Arrow Right

Principal Group Engineering Manager - FastTrack

Microsoft FastTrack delivers migration and deployment services to some of the wo...
Location
Location
United States , Redmond
Salary
Salary:
163000.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Job Responsibility
Job Responsibility
  • Lead Technology-Driven Transformation of FastTrack Delivery
  • Drive Practitioner Productivity at Global Scale
  • Own Service Platform Outcomes
  • Partner Across Product, Engineering, and Delivery Organizations
  • Lead Organizational Change
  • Fulltime
Read More
Arrow Right

Principal Group Engineering Manager

Microsoft’s Azure Data engineering team is leading the transformation of analyti...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python and building large scale software systems
  • OR equivalent experience
  • 5+ years of experience leading software engineering teams of 15 or more engineers
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR Master's Degree in Computer Science or related technical field AND 13+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • 10+ years of experience leading software engineering teams of 15 or more engineers
  • 5+ years of software development experience building scalable, distributed services using C# or other high level languages
  • 5+ years of experience with multi-threaded/parallel programming
Job Responsibility
Job Responsibility
  • Collaborate with Product managers on product specifications and requirements
  • Collaborate with the US team on technical aspects and support the local team to ramp up, contribute and support large scale business critical Azure services
  • Guide design and development of high quality software incrementally
  • Solid execution: Plan, schedule and deliver quality software incrementally
  • Maintain and operate online services
  • Review changes to product codebase and provide constructive feedback that align with industry best practices to mentor and grow junior and senior engineers
  • Participate respectfully in design, architecture, execution reviews or other team discussions
  • Listen to others perspectives and feedback and take action on valid feedback
  • Provide constructive feedback to others
  • Partner with other teams in the organization to leverage work ideas of others to deliver efficiently
  • Fulltime
Read More
Arrow Right

Principal Group Engineering Manager

Microsoft Specialized Clouds combines the power of edge platforms, devices, and ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of professional software engineering experience, including designing, building, and operating distributed, cloud-scale services
  • 5+ years of engineering leadership experience, including managing managers and leading multi-team engineering organizations (M2+)
  • Deep experience with network device platforms — specifically Arista (EOS, eAPI, CloudVision) and/or Cisco (NX-OS, DCNM/NDFC) — including device programming, configuration management, and automation
  • Strong background in device programming and network automation — building systems that programmatically configure, validate, and manage network device state at scale
  • Experience with Azure Resource Provider (RP) engineering — ARM resource modeling, deployment pipelines, control-plane architecture, and resource lifecycle management
  • Solid understanding of L2/L3 networking fundamentals: spine-leaf architecture, VXLAN, overlay/underlay networking, BGP, and data center network design
  • Proven ability to set technical direction and architectural strategy for complex platforms spanning multiple components and partner teams
  • Demonstrated success owning end-to-end delivery of customer-critical services, including design, development, release, and live-site operations
  • Strong experience driving operational excellence, including reliability, incident management, automation, and cost optimization for production services
  • Proven track record of leading organizational transformation — such as quality resets, reliability turnarounds, code yellow resolution, or engineering culture change across an engineering org
Job Responsibility
Job Responsibility
  • Lead engineering teams through the design, architecture, development, testing, and operations of the Network Fabric platform — the cloud-managed networking layer for Azure Operator Nexus and Azure Local
  • Drive execution excellence across the full software lifecycle: semester planning, feature delivery, release management, and live-site operations
  • Own engineering commitments across multiple workstreams including network device programming, Azure Resource Provider development, fabric orchestration, and network configuration management
  • Ensure services meet Microsoft standards for quality, reliability, security, and operational readiness
  • Establish and enforce engineering best practices — including test-driven development, automated validation, secure development lifecycle (SDL/SFI), and continuous integration
  • Continue and accelerate the ongoing engineering transformation: driving quality resets, improving release predictability, and reducing customer-impacting incidents
  • Own the resolution of code yellow and equivalent quality escalations, driving root cause analysis and systemic remediation across the engineering organization
  • Champion a culture of engineering fundamentals — ensuring that quality, security, and operational maturity are embedded into every sprint, not treated as afterthoughts
  • Drive measurable reduction in support costs through automation, improved test coverage, and process optimization
  • Provide technical leadership across device programming (Arista EOS, Cisco NX-OS), network fabric orchestration, and Azure Resource Provider engineering
  • Fulltime
Read More
Arrow Right

Principal Group Engineering Manager

Microsoft is a company where passionate innovators come to collaborate, envision...
Location
Location
United States , Redmond
Salary
Salary:
163000.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 2+ years of people management experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
Job Responsibility
Job Responsibility
  • Lead the design and implementation of multi-tenant, enterprise class Storage and Analytics platform
  • Build solutions, create tools, and automate issue detection and diagnosis to enable customers or support to self-resolve product issues
  • Identify emerging trends or re-occurring escalation scenarios and drive engineering opportunities to mitigate and/or eliminate the category of failure
  • Drive to product improvements by filing impactful bugs, design change requests, and fixes shipped to production, preventing customer impact
  • Able to work well in challenging situations while exhibiting flexibility and the ability to tolerate and manage through ambiguity and uncertainty
  • Beyond extensive technical and product focus, this role requires the ability to frame and communicate issues and recommendations clearly and concisely, show exceptional attention to detail, and demonstrate the ability to build broad relationships with the right influencers, leveraging those relationships to impact key business results
  • Fulltime
Read More
Arrow Right