This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Microsoft is leading the AI strategy with an ambitious mission to democratize AI, make it an essential ingredient for delivering breakthrough customer experiences and to ensure the benefits of AI reach every person and organization on the planet, safely and responsibly. We are looking for a Principal Software Engineer at Microsoft, Hyderabad, who is passionate about designing, architecting and running highly reliable and available platform to support model inferencing at the scale of billions of requests per day. You will also have an opportunity to work on high throughput/ low latency scenarios and drive performance optimization capabilities. You will be joining the Azure Open AI Inference team that builds and runs the model-serving platform for large OpenAI generative models. This team is at the forefront of delivering this innovation with massive scale powering every Azure OpenAI customer and scenario in the industry, both 3P and 1P customer like Bing and Office and solve exciting problems on the intersection of AI and Cloud.
Job Responsibility:
Design and build: Lead the architectural design of complex software systems, considering performance, scalability, maintainability, and cost-effectiveness
Strong Quality Focus: Create a team culture of quality first mentality that directly contributes to delivering high quality features and changes as part of the stack
One Microsoft: Collaborate cross organization to deliver key features in Azure Open AI Platform
Contribute to the overall technical direction alongside other leads and managers across geo locations
Leadership: Own and show strong technical and non-technical leadership in driving the product (End-to-End) E2E
Take ownership of and drive key cross-team development projects, ensuring the results align with business goals and timeline requirements while managing risks
Production Service Support: Act as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
Building AI powered intelligent systems to augment the engineers will be an opportunity as well
Requirements:
B Tech or M Tech in computer science, engineering, mathematics or a related field, or equivalent industry experience
12+ years of software development experience
Work experience of running a real time service with high throughput and low latency requirements is a plus