This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Support for Mission Critical is a team within Microsoft that provides solution-specific expertise designed to drive peak health and optimum performance of a customer’s most important solutions. As a key technical resource for the customer, you will be primarily focused on delivering proactive services such as education workshops, delivering assessments, and providing tailored guidance. Troubleshooting skills are essential as this role will include working with Microsoft Support to expedite incident resolution.
Job Responsibility:
Own delivery of Support Mission Critical Services for customers’ most critical workloads, driving a seamless, high‑quality experience across CSU, CSS, Engineering, and partner teams
Lead Proactive Resiliency for assigned workloads, improving reliability, security, supportability, and observability at scale
Drive customer onboarding and readiness, including Consolidated Assessment Week and execution of workload‑specific improvement and success plans
Own Accelerated Incident Resolution, ensuring visibility into critical incidents and that RCAs and learnings translate into durable resilience improvements
Partner with customers and internal teams to enable strong monitoring and observability, increasing customer self‑sufficiency and operational maturity
Provide cross‑team leadership with CSAMs and internal stakeholders, clarifying ownership, managing escalations, and maintaining alignment throughout the contract lifecycle
Requirements:
Bachelors Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field AND 4+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting OR equivalent experience
Proven experience in Cloud Solutions Architecture or Mission Critical Support for enterprise customers
Deep knowledge of Azure infrastructure services (Compute, Storage, Networking), Container services (such as Azure Kubernetes Service) and Platform-as-a-Service (PaaS) offerings
Strong troubleshooting skills across distributed systems and mission-critical workloads
Familiarity with performance optimization, high availability, and disaster recovery strategies
Demonstrated ability to manage high-severity incidents and provide rapid mitigation strategies
Experience working with financial services customers or other highly regulated industries
Excellent verbal and written communication skills for executive-level updates and technical deep dives
Ability to collaborate across engineering, product groups, and global support teams
Nice to have:
Expertise in virtualization, VM performance tuning, and cache optimization
Knowledge of observability tools, telemetry, and proactive monitoring solutions
Site reliability / operational troubleshooting experience for large infrastructure as a service or infrastructure environments