This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Microsoft AI is looking for a Member of Technical Staff, Evaluations Engineer to help build the next wave of capabilities of our personalized AI assistant, Copilot. We’re looking for someone who will bring an abundance of positive energy, empathy, and kindness to the team every day, in addition to being highly effective. The right candidate enjoys building world-class consumer experiences and products in a fast-paced environment. You will actively contribute to the development of AI models that are powering our innovative products. You will wear multiple hats and work on engineering, research, and everything in between. Your contributions will span model architecture, data curation, training and inference infrastructures, evaluation protocols, alignment and reinforcement learning from human feedback (RLHF), and many other exciting topics at the cutting edge of AI. Microsoft AI is building foundational models to develop novel responsible and efficient artificial general intelligence. Foundational models demand significant compute capacity, as a Member of Technical Staff, Evaluations Engineer, you will design and build the evaluation infrastructure for generative AI on large-scale GPU clusters. This role involves developing sophisticated tools and techniques to ensure the reliability, performance, and health of hundreds of nodes across supercomputers with thousands of GPUs. You will collaborate closely with model scientists to implement state-of-the-art and novel evaluation methods, inference strategies, and metrics algorithms, enabling smooth and efficient execution of evaluation workloads. As a contributing member of the core group of engineers, you would also bring to the table best practices driving architectural changes and influence roadmap of relevant software and hardware components. Your work will directly impact the business goals of a wide range of users and facilitate the next wave of growth and innovation in AI.
Job Responsibility:
Develop and tune the pretraining scalable software for Nvidia GB200 72NVL CX8 and AMD MIxxx architectures
Benchmark GB200 and AMD MIxxx GPU clusters
Gather data and insights to develop the pretraining compute roadmap
Care deeply about conversational AI and its deployment
Actively contribute to the development of AI models that are powering our innovative products
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values
Requirements:
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Experience with generative AI
Experience with distributed computing
Experience in leading technical projects and supporting architectural decisions with data