This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Our team builds and operates core Azure Storage infrastructure that powers Blob Storage and ADLS at hyperscale. We focus on distributed systems, performance, scalability, and reliability of the data plane, enabling customers to run mission‑critical and AI/ML workloads with predictable latency, high throughput, and strong durability guarantees. The team works end‑to‑end—from design and implementation to live‑site operations—on systems that run at massive scale across Azure storage stamps.
Job Responsibility:
Reasoning about complex distributed failure modes
Making performance vs. reliability trade‑offs at scale
Debugging production issues in high‑traffic, multi‑tenant environments
Delivering changes safely while operating a 24×7 global service
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, Rust, C#
OR equivalent experience
Ability to meet Microsoft customer and/or government security screening requirements are required for this role
These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Nice to have:
Bachelor's Degree in Computer Science OR related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, Rust, C#
OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, Rust, C#
OR equivalent experience
3+ years of experience in designing and developing reliable, high-performance distributed systems with a strong focus on scalability and resilience
Demonstrated experience with distributed systems and large-scale storage, including solving complex systems problems and working on storage internals such as availability, remote replication, durability schemes, and high-performance I/O
Practical knowledge in one or more technical domains: database internals, indexing and search techniques, caching and data placement strategies, and metadata/namespace management at scale
Ability to influence technical direction and contribute to architecture decisions across teams, fostering collaboration and innovation