This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Help deliver one of the best foundational models in the world at Microsoft AI. At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI.
Job Responsibility
Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations
Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack
Collaborate closely with teams on infrastructure, data, post-training, and multimodality
Embody our culture and values
Requirements
Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Demonstrated experience in large-scale AI
Passionate about conversational AI and its deployment
Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
Nice to have
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python