This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The AI Platform organization at Microsoft builds the end-to-end Azure AI stack/PaaS and is core to Azure’s innovation and differentiation, as well as all of Microsoft’s flagship products, from Office to Teams, to Xbox. We are the team building Azure OpenAI, Azure ML, Cognitive Services, and the global Azure AI infrastructure for running the largest AI workloads on the planet. Within AI Platform, the AI Foundry team enables data scientists and developers to quickly and easily build, train, deploy, manage, and consume machine learning model. Our Azure AI Foundry Model Customization Team is at the forefront of this mission, working on groundbreaking projects for customizing OpenAI and OSS models. We collaborate closely with research institutions, industry leaders, and organizations worldwide to create innovative solutions that impact millions of users.
Job Responsibility:
Design, implement, and support scalable, reliable, high-performance services with a strong focus on SLA/SLO, customer adoption and velocity of iterations
Advocate new trends to adapt them to current problems and shares knowledge with peers
Improve artificial intelligence tools and practices across the software development lifecycle
Lead discussions for architecture of complex products ensuring test strategies for solution quality
Mentor in identifying dependencies and producing extensible code across teams
Lead debugging efforts and application of coding patterns to improve code quality
Develop automation for production deployment targeting zero-touch when possible
Embody our Culture and Values
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited to C++ / C#, Java or Python OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to C++ / C#, Java or Python OR equivalent experience
Depth in Generative AI and Engineering
A strong background in machine learning, deep learning, and natural language processing
Proficiency in Python and relevant ML libraries (e.g., PyTorch, Transformers)
Experience with transformer-based models (e.g., BERT, GPT, Llama)
Familiarity with cloud platforms (e.g., Azure, AWS) and distributed computing (Kubernetes)
Excellent problem-solving skills and the ability to work independently and collaboratively
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter