This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Microsoft Cloud Operations and Innovation (CO+I) underpins Microsoft’s global cloud infrastructure, driving the innovation, planning, design, construction, and operation of one of the largest data center fleets in the world. We are seeking an exceptional Senior Applied Scientist to join the Data Center Applied AI team. In this role, you will play a pivotal role in advancing and integrating cutting‑edge, multimodal agentic AI systems into core tools and operational workflows that power Microsoft’s data centers. Your work will drive operational efficiency at scale and help advance Microsoft’s mission to empower every person and every organization on the planet to achieve more through intelligent, multimodal AI agents. As a Senior Applied Scientist, you will bridge state‑of‑the‑art AI research with production‑grade engineering, delivering agent architectures that are reliable, secure, scalable, observable, and measurable. You will collaborate closely with Business and Engineering to build and deploy agentic capabilities across workflow automation, information retrieval, retrieval‑augmented generation (RAG), tool and function calling, long‑horizon task execution, and multi‑agent orchestration. This role blends deep AI and applied science expertise with strong engineering judgment, operational rigor, and a bias for action. Success requires end‑to‑end ownership, a growth mindset, and strong customer empathy. Join us to shape the future of agentic AI for data center operations at global scale.
Job Responsibility:
Advance Applied AI Research to Improve Quality and Reliability
Apply deep expertise in Generative AI, deep learning, NLP, and multimodal models to translate cutting-edge research into high-impact, production-ready AI solutions.
Design and execute experiments that measurably improve agent planning, memory, grounding, reasoning, and long‑horizon task completion.
Perform lightweight fine‑tuning (e.g., LoRA and related techniques) on multimodal and large multimodal language models—to improve entity recognition, reasoning accuracy, and response quality.
Implement state‑of‑the‑art approaches using foundation models, advanced prompt engineering, RAG, knowledge graphs, and multi‑agent architectures, complemented by classical ML techniques where appropriate.
Build and Ship Scalable Agentic AI Systems
Architect and implement end‑to‑end agent workflows that decompose user intent into executable plans, intelligently select and invoke tools, and recover gracefully from errors and edge cases.
Rapidly prototype solutions and partner with engineering teams to drive production deployment, including debugging live systems and building AIOps workflows.
Use data and telemetry to identify AI quality gaps, generate insights, and deliver proofs of concept that apply research innovations to real‑world operational challenges.
Design robust multi‑step reasoning and tool‑use strategies, including function calling, code execution, APIs, and secure connectors, with strong safety and reliability guardrails.
Drive production excellence across latency, reliability, cost efficiency, observability, monitoring, and safe fallback behaviors.
Own Evaluation, Metrics, and Iteration Loops
Define task‑level evaluation metrics for agentic behavior, including success rate, tool‑call accuracy, step efficiency, hallucination rate, safety violations, time‑to‑completion, and user satisfaction.
Build and maintain offline and online evaluation pipelines, including: Golden datasets, scenario simulators, and regression test suites
Human‑in‑the‑loop evaluation frameworks and rubric design
A/B experimentation and telemetry‑driven iteration loops.
Collaborate Across Disciplines and Lead Technical Execution
Partner with business and engineering stakeholders to translate requirements into clear technical specifications, research plans, and delivery milestones.
Lead design reviews and influence engineering decisions across agent frameworks, model integration, and system architecture.
Mentor and support other applied scientists and engineers, raising the bar for applied AI execution and production quality
Embody our Culture and Values
Requirements:
Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research) OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research) OR equivalent experience.
Experience with Azure technology Stack
Experience with parameter‑efficient fine‑tuning techniques (e.g., LoRA) for LLMs or multimodal models.
Hands‑on experience building agentic or tool‑augmented LLM systems, including function calling, planners, and API/tool integration.
Experience with advanced RAG architectures, such as hybrid retrieval, reranking, grounding, and citation strategies.
Strong AIOps depth, including quality drift detection, alerting, rollback, and telemetry‑driven optimization.
Experience optimizing systems for latency, reliability, scalability, and cost efficiency at enterprise scale.
Prior experience working on mission‑critical or large‑scale enterprise AI systems.
Demonstrated mentorship or technical leadership within applied science or engineering teams.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.