Site Reliability Engineer II Job at Microsoft Corporation (Hyderabad)

Job Description

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world. Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture. Within Azure Data, the business intelligence team builds and maintains Power BI which is the undisputed leader in BI tool in the industry today and serves as the bridge between data professionals & Office users. Power BI’s core value proposition is to enable end users to get answers & insights from their data through data visualization. Within this organization, the Analysis Services Engine (AS) team builds and evolves the analytical database engine that powers Power BI Semantic Models (Datasets) as well as SQL Server Analysis Services (SSAS) and Azure Analysis Services (AAS). These platforms deliver fast, enterprise-grade analytics at global scale and serve as a critical foundation for customers turning data into decisions. As a Site Reliability Engineer II, you will help ensure the reliability, scalability, availability, and performance of Analysis Services. You will apply a software engineering approach to operations by building automation, improving observability, strengthening incident response, and enhancing system resilience. This role is ideal for an engineer who is passionate about operating large-scale services with high quality and efficiency while enabling fast and safe delivery. We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

Job Responsibility

Work with all aspects of a high throughput and multi-tenant service
Collaborate effectively within the team and with partner teams across Microsoft
Be part of the on-call rotation for maintaining service health
Design, implement, and refine chosen solutions in close partnership with Product Management and partner teams
Champion operational excellence via established metrics, process governance, and policy controls for regular assessment and improvement
Document and define existing data engineering processes, data and technology, while evaluating them for optimization
System Reliability & Uptime – Ensuring high availability of services
Incident Management – Detecting, responding to, and mitigating system failures
Performance Monitoring – Tracking system health and resolving bottlenecks
Automation & Tooling – Reducing manual work through scripts and automation
Capacity Planning – Scaling infrastructure efficiently to handle demand
Postmortems & Continuous Improvement – Analyzing failures to prevent recurrence

Requirements

Master's Degree in Computer Science, Information Technology, or related field AND 2+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice to have

Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience
2+ years technical experience working with large-scale cloud or distributed systems

Microsoft Corporation - All Job Offers

Select Country

Site Reliability Engineer II

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?