This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Are you a customer-obsessed, AI-curious problem-solver who thrives in an inclusive, collaborative global team? Join Engineering Operations (EngOps) – the organization driving operational excellence across the Microsoft Cloud to strengthen quality, reliability, security, and customer trust. As part of EngOps, you’ll design solutions that prevent issues before they happen, embed AI-powered automation, and turn signals into actions that deliver measurable customer impact. Our culture of empowerment, inclusion, and growth mindset defines how we work. Azure Reliability is driving transformation to AI-powered operations by building scalable ML infrastructure that enables autonomous, reliable, and secure cloud systems. We are looking for candidates that can combine deep technical expertise in MLOps with a proven ability to deliver measurable business impact through continuous learning, policy-driven governance, and responsible AI practices. Success in this role means advancing operational autonomy, quality, and security, while fostering collaboration and accountability across teams. Every day, customers stake their business and reputation on our cloud. You can help #EngOps keep them secure, resilient, and ready. This role will require a minimum of three days in office.
Job Responsibility:
Collaborate with stakeholders to identify user requirements, incorporating feedback and actionable metrics into future designs
Contribute to product architecture, helps create proposals, owns solutions, and ensures security and compliance
Assist in testing plans and automation features
Implement code for products, services, or features with a focus on extensibility and maintainability
Review code for quality, reliability, and adherence to coding standards
Execute project plans, break down work items, conduct experimentation, and support safe deployment of features
Manage live service operations, act as a Designated Responsible Individual (DRI) on call, integrate telemetry data, and contribute to data analysis for system health
Improve developer tools, contribute to automation in production, ensure compliance with security and privacy, and stay current with industry developments
Maintain communication with key partners across the Microsoft ecosystem to achieve desirable user experiences and meet the dynamic needs of partners and customers through product development.
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
Or equivalent experience.
Nice to have:
Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python