This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Join the Strategic Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsoft’s expanding Cloud Infrastructure and for powering Microsoft’s “Intelligent Cloud” mission. As part of the Systems Planning and Architecture (SPARC) group, you will help with pathfinding and architecture for future compute platforms, storage and related technologies that create advantages for Azure and Microsoft. You will collaborate across the Azure organization to evaluate next-generation datacenter technologies and influence Azure product roadmaps for both Microsoft and 3rd party silicon and systems.
Job Responsibility:
Drive pathfinding initiatives to identify and quantify optimization opportunities across distributed and disaggregated storage and memory architectures for computing and AI inferencing systems
Conduct in-depth architectural analysis of next-generation storage technologies, leveraging a strong understanding of workloads across key Azure segments
Collaborate cross-functionally to influence technology direction and contribute to long-term strategic planning for evolving datacenter architectures
Lead architectural exploration of emerging technologies through robust proof-of-concepts (PoCs) and end-to-end prototyping, aligned with product segments and real-world usage scenarios
Partner with hardware design and software enablement teams to mature innovations from concept validation to production-ready solutions
Engage with Azure operators and customers to identify current challenges and anticipate emerging needs across the platform
Evaluate and identify promising technologies aligned with Azure business priorities, engage ecosystem partners, and de-risk productization through capable PoCs
Work across organizational boundaries with roadmap planners, product architects, and hardware and software engineering teams to successfully integrate innovative solutions into Azure datacenters
Requirements:
Master's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 9+ years technical engineering experience OR Bachelor's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 11+ years technical engineering experience OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements
10+ years of experience with system performance evaluation using industry standard benchmarks and/or common cloud workloads
10+ years of experience with significant hardware/software co-design projects involving CPU and/or systems architecture and influencing technical direction
Prior experience developing and driving cloud technologies for improved TCO/$ and TCO/$/performance
Intellectual curiosity and passion about learning and deploying new technologies
Verbal and written communication skills and ability to engage technical & non-technical peers
Experience contributing to complex projects with respect and integrity, including those with multiple workstreams spanning different business and technical disciplines
Understanding of compute and storage systems in the cloud, including storage and network technologies, experience with software-defined storage and distributed file systems
Deep understanding of AI inference systems and associated software, and emerging approaches to orchestrate tiered memory and storage capabilities for distributed serving and KV caching for agentic systems
Deep expertise in AI scale-up and scale-out networking/interconnect architectures, along with a good understanding of memory/storage technologies spanning HBM, LPDDR, HBF, etc
Skilled in partnering and influencing architects, hardware engineers, and software leads
Experience with gathering and analyzing system telemetry and low-level performance counters to identify and root-cause performance bottlenecks
Problem-solving skills, analytical capabilities, and attention to detail
Ability to manage through ambiguity, teamwork, and sense of presumed responsibility