This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Azure Core is responsible for building and operating the foundational infrastructure that powers Microsoft Azure. This includes the platform services that enable compute, networking, storage, and distributed systems to operate reliably and at scale across Microsoft’s global cloud footprint. These services are central to Azure’s control plane and are critical to ensuring platform availability, performance, and customer trust. As a Senior Software Engineer within this organization, you will work on developing and operating highly reliable, low-latency platform services that manage persistent, distributed system state. These services are deeply integrated into Azure’s infrastructure and serve as essential components for internal and external workloads that demand strict service letter agreements (SLAs) for durability, availability, and scalability. This is a high-impact role focused on building foundational platform services for the Azure cloud. You’ll be part of a team that values deep technical expertise, strong execution, and a commitment to delivering secure, scalable, and maintainable systems at global scale.
Job Responsibility:
Provide technical leadership to define testing scope, create quality plans, and align on test requirements across products
Lead cross-team coordination to build and manage test plans, including collaboration with vendors and alignment with industry best practices
Use data to build impactful visualizations that identify gaps and guide testing strategy for complex products
Partner with teams to implement automated testing and define manual test plans
lead issue resolution and mitigation efforts
Drive stakeholder communication, vendor coordination, and influence product decisions through quality-focused best practices
Design and build cloud-native services with a strong emphasis on resilience, fault tolerance, and observability
Drive service reliability through operational ownership, including production support and incident response
Contribute to architectural design, performance optimization, and long-term service evolution
Mentor engineers and promote engineering excellence across the team
Oversee the deployment, monitoring, and maintenance of services in production environments
Ensure high availability and performance of all systems
Analyze system performance and implement improvements to enhance scalability and reliability
Respond to and resolve production incidents, ensuring minimal downtime and impact on users
Requirements:
Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
3+ years of experience working with distributed systems
3+ years of experience managing lifecycle of services from development to production
2+ years of experience working with Azure Cosmos DB
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Nice to have:
Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python