This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Microsoft’s Cloud Operations and Innovation (CO+I) organization builds and operates the global datacenter infrastructure that powers Microsoft’s cloud. Within CO+I, the Engineering organization (CO+IE) delivers the software platforms, telemetry pipelines, and automation that enable scalable, reliable, and cost‑efficient datacenter planning and operations. These systems form a critical competitive advantage for Microsoft, translating physical infrastructure signals into intelligence that protects availability, improves sustainability, and enables continuous scale. As a Principal Software Engineer, you operate as a hands‑on architect and AI expert at the intersection of cloud platforms, critical physical infrastructure, and intelligent operations. You design and incubate next‑generation AI Ops solutions for Critical Environments (CE)—powering real‑time situational awareness, proactive risk detection, and autonomous decision support across Microsoft’s global datacenter fleet. We are looking for a Principal Software Engineer to help deliver automation capabilities that power the long-range execution planning efforts, drive workflow improvements and build solutions to assist in the delivery of large scale data centers through efficient management of cost and schedule.
Job Responsibility:
AI‑Driven Situational Awareness for Critical Environments
Telemetry Intelligence
Anomaly Detection & Prediction
Event Correlation & Blast Radius Analysis
AI‑Assisted Triage and Decision Support
From Human‑in‑the‑Loop to Autonomous Operations
Technical Leadership and Organizational Impact
Write high quality, maintainable, reusable code following SOLID principles
Collaborate with and demonstrate features developed to stakeholders in an Agile environment
Resolve complex system integration challenges working with other members of the team and external teams
Share learnings and code assets developed with the CO+I engineering team
Leverage subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
Act as a Designated Responsible Individual (DRI) and guide other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
Proactively seek new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Experience or exposure in data engineering and backend work
Experience working with MODBUS, BACNET, DATACENTER CRITICAL ENVIRONMENT TELEMETRY, AZURE IOT, AI OPS, LLM, Agentic Apps, KUSTO, MACHINE LEARNING, MQTT, OPC-UA OR Equivalent experience