This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Azure Chaos Studio is one of the core products our team builds, an engineering platform that enables controlled experimentation and fault injection across Azure and Microsoft’s cloud infrastructure. With Azure Chaos Studio, engineers can intentionally introduce real‑world failure conditions such as network latency, resource throttling, and service outages. These controlled experiments help teams validate system resilience, uncover hidden issues before they impact customers, and strengthen the reliability and availability of cloud services at scale. By driving a disciplined approach to chaos engineering, our team empowers product teams across Microsoft to build more robust and dependable systems.
Job Responsibility:
Collaborate with cross team stakeholders to capture user requirements, incorporate insights into future designs
Contribute to architectural discussions and develop proposals independently
Deliver high quality system design that meets service reliability, performance, scalability requirements
Leads by example within the team by producing extensible and maintainable code
Use debugging tools and methodologies proactively and reactively to ensure product robustness
Maintain operational integrity of live services on a rotational, on-call basis. Act as a DRI for system monitoring and issue resolution
Develop internal tools to streamline product creation and evaluate current methods' applicability
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 7+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or Python OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Nice to have:
Master's Degree in Computer Science or related technical field AND 7+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or Python OR Bachelor's Degree in Computer Science or related technical field AND 7+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or Python OR equivalent experience
Advanced knowledge of SQL/NoSQL technologies and principles of high availability database design
Troubleshooting skills with a track record of working on service-oriented products
Awareness of agile software development methods and best practices, including delivering software features in phases
3+ years of software development experience in building low latency high throughput highly reliable services
Experience in developing large-scale distributed systems utilizing cloud technologies such as Azure or AWS
Proficient in distributed system management with solid architectural and technical system design skills