This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Shape the way the M365 measures and drives the feedback loop for its AI offerings! On the M365 Evaluation Platform Team, you’ll have a front-row seat to how AI impacts millions of users. You’ll help steer one of Microsoft’s most important efforts forward, taking our evaluation system to the next level for our builders and partners. Our goal is to accelerate learning by making sure all the user journeys of an eval system (fine tuning a model, launching a new feature or experiment, adding metrics, onboarding a new 1P or 3P partner, understanding user feedback, creating query sets, etc.) are supported by friendly, agile, reliable, scalable and well documented tools. In this role you will build capabilities that: Enable builders to be more agile, running more evaluations and faster; Provide a continuous set of tools and evaluation capabilities throughout the development lifecycle; Automate tasks via tools or agents to help us understand our performance better.
Job Responsibility:
Partners with appropriate stakeholders to determine user requirements for a set of scenarios
Leads identification of dependencies and the development of design documents for a product, application, service, or platform
Leads by example and mentors others to produce extensible and maintainable code used across products
Leverages subject-matter expertise of cross-product features with appropriate stakeholders (e.g., project managers) to drive multiple group's project plans, release plans, and work items
Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions
Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Experience building systems to evaluate and drive quality in a product and using data to drive engineering decisions
A focus for building reliable, scalable infrastructure and making users successful
Comfortable at operating in a dynamic environment
takes initiative to bring clarity and momentum
Self-motivated and outcomes-focused, with a strong sense of ownership and accountability
Platform engineering mindset: building reusable components, reducing time‑to‑launch, improving debuggability, and delivering well‑documented tooling
Demonstrated technical leadership experience in evaluation, distributed systems or development platforms
Nice to have:
Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python