This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Microsoft Research AI Frontiers lab is seeking applications for the position of Senior Researcher – Foundations of Generative AI to join their team in New York, NY. The mission of the AI Frontiers lab is to expand the pareto frontier of Artificial Intelligence (AI) capabilities, efficiency, and safety through innovations in foundation models and learning agent platforms. Some of our projects include work on small language/action models (e.g., Phi, Orca, Fara-7B), new architectures and optimizers (e.g., Belief State Transformer, Dion), and agentic AI systems (e.g. AutoGen, MagenticOne, OmniParser). We are seeking a Senior Researcher – Foundations of Generative AI to join our team and lead efforts in discovering and building the foundations of generative AI through representations and objectives. As a Senior Researcher – Foundations of Generative AI, you will play a crucial role in leading, developing, improving, and exploring new architectures, representations, and learning objectives that unlock new capabilities and/or scalability. Your work will have a significant impact on the development of cutting-edge technologies, advancing the state-of-the-art, and providing practical solutions to real-world problems.
Job Responsibility:
Apply research and engineering skills to develop, prototype, and evaluate cutting-edge research ideas
Work closely with other researchers and engineers to rapidly prototype and test new research ideas, driving a high-impact agenda and publishing results where appropriate
Collaborate hands-on with other researchers, engineers, and internal and external product groups to deliver high-impact solutions to real-world problems
Embody our culture and values
Requirements:
Doctorate (or currently pursuing) in Computer Science or relevant field OR equivalent experience
Doctorate in Computer Science or relevant field AND 2+ years related research experience OR equivalent experience
Research program demonstrated by public artifacts like models, tools, code in the AI space or publications at conferences: NeurIPS, ICML, ICLR, ACL, NAACL, CVPR, COLT, ECCV, ICCV, EMNLP
2+ years of academic or industry experience in developing, applying, and/or implementing algorithms for machine learning/statistics, using common ML engineering programming languages and platforms such as Python, Python numerical libraries, PyTorch, TensorFlow and/or HuggingFace
Experience publishing academic papers as a lead author or essential contributor in a top AI conference or journal
Deep understanding of frontier model architectures, especially transformers and state space models
Hands-on experience building and working with Large Language Models (LLMs) or multimodal models (VLMs, VLAs), including pre-training, fine-tuning, and inference
2+ years of industry or academic experience with building, debugging and optimizing large-scale ML training pipelines
Demonstrated software engineering excellence building and deploying prototypes, applications, or open-source (OSS) technologies
Ability to work independently and ramp-up quickly on complex projects or unfamiliar code
Ability to collaborate, communicate effectively, and work as part of a multi-disciplinary team
Keen interest in real-world applications and impact