This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Microsoft Research (MSR) AI Frontiers lab is seeking applications for the position of Principal Research Engineer – Multimodal AI to join their team in Redmond, WA or New York City, NY. We are seeking a Principal Research Engineer – Multimodal AI to join our team and lead engineering efforts on the advancement of Generative AI and Multimodal Model (MLM) technologies. As a Principal Research Engineer, you will play a crucial role in developing, improving, and exploring the capabilities of Multimodal AI and agentic models. Your work will have a significant impact on the development of cutting-edge technologies, advancing state-of-the-art and providing practical solutions to real-world problems. Our ongoing research areas encompass but are not limited to: Reasoning method for Multimodal models; New multimodal model architectures and training methods; Action models for automating web and computer tasks; Orchestration and multi-agent systems: automated orchestration between multiple agents incorporating human feedback and oversight; Evaluation and Understanding of model and agent capabilities. The AI Frontiers lab at Microsoft Research is charted with ambitious research goals for advancing Artificial Intelligence (AI) capabilities in several key areas including modeling, algorithms, reasoning and agentic AI. Our lab offers a vibrant environment for cutting-edge multidisciplinary research, including an open publication policy and close links to top academic institutions around the world.
Job Responsibility:
Design, develop, execute, and implement technology research projects in collaboration with other researchers, engineers, and product groups
Be a part of research breakthroughs in the field and play a crucial role in developing, improving, and exploring the capabilities of Large Language Models (LLMs), reasoning and agentic AI
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
2+ year(s) experience developing with Python and Pytorch/JAX
Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR Master's or Doctorate in Computer Science or relevant field AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Experience with architecture and optimizations for large language models
Hands-on work in debugging and profiling Pytorch distributed code
Understanding of working of CUDA kernels
Experience with pre-training, mid-training and/or post-training pipelines for language and/or multimodal models
Foundational understanding of reinforcement learning and key challenges in the field
Experience with verl, Ray, Megatron and/or vLLM is a significant plus
Any experience in building scalable services can be highly complementary