This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Join the Microsoft 365 Excel Data and Applied Science team, where we are pioneering the next generation of generative AI workflows. As part of this dynamic group, you will help redefine the future of Excel by driving innovations such as Excel Agent and advanced formula autocompletion features. If you are passionate about transforming productivity through intuitive, real-world AI solutions, this position offers a unique opportunity to apply your expertise in a collaborative and forward-thinking environment. We are seeking a Senior Applied Scientist to drive the ongoing evolution of Microsoft 365 Copilot in Excel, focusing on optimizing large language model (LLM) and agentic workflows. In this influential role, you will leverage your deep expertise in LLMs, information retrieval, and machine learning to lead the innovation, design, and evaluation of high-impact AI solutions for millions of enterprise users worldwide. Your contributions will shape the technical strategy, inform product direction, and foster meaningful cross-functional partnerships - all with the goal of delivering transformative, AI-powered experiences that enable users to achieve more.
Job Responsibility:
Design, fine-tune, and deliver models and agentic flows for integration with Excel Agent and on-canvas experiences
Leverage state-of-the-art LLM fine-tuning and retrieval methods, with robust evaluation metrics and A/B testing to ensure data-driven progress
Gather and curate relevant benchmarks, build a comprehensive evaluation framework, and develop GPT-based evaluators (LLM-as-a-Judge)
Run controlled experiments to compare performance, efficiency, and scalability using data-driven metrics and A/B testing focusing on reproducible and impactful results
Continuously study emerging literature, share insights with leadership and peers during research reviews and deep dives adapt quickly to new findings, and integrate them into experiments and when applicable share with broader research community
Requirements:
M.Sc. or PhD in the fields of Computer Science, Information Systems, Mathematics or Data Science
Hands-on (at least 2+ years) experience in building and deploying LLM focused products
Evidence of research contributions through conference publications in Large Language Models (LLM) and Natural Language Processing (NLP) domains submitted/accepted in top venues (KDD, ICML, AAAI, ACL, ICLR, etc.)
Strong understanding of advanced research concepts in the field
Experienced in evaluating the performance of large language models (LLMs), developing benchmarks tailored to practical scenarios, and using simulation and synthetic data generation techniques
Customer obsession and passionate about making real world product impact
Excellent verbal and written communication skills, with the ability to simplify and explain complex ideas
Effective collaboration skills while working effectively within a globally distributed organization
Nice to have:
Ph.D. in Computer Science, Information Systems, Data Science, or a closely related field