Member of Technical Staff, Evaluations Engineering Job at Microsoft Corporation (Mountain View)

Member of Technical Staff, Principal Engineering Manager

As Microsoft continues to push the boundaries of AI, we are on the lookout for s...

Location

United States , Redmond

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, Javascript, or Python OR equivalent experience
Demonstrated track record of building and scaling engineering organizations (hiring teams from scratch, structuring orgs, growing managers)
Experience delivering large-scale software systems in AI, machine learning, or related fields
Experience managing organizations of 30+ engineers across multiple teams and workstreams
Deep expertise in LLM evaluation, AI quality measurement, or ML infrastructure at scale
Track record of partnering with senior leadership (VP/CVP level) to set strategy and drive cross-organizational programs
Experience recruiting and developing senior engineering talent (principal engineers, engineering managers) in a competitive market
Proven ability to operate effectively in fast-paced, ambiguous environments — comfortable making decisions with incomplete information and course-correcting quickly
Strong technical judgment: ability to evaluate architectural tradeoffs, assess technical risk, and guide teams toward sound engineering decisions without needing to write the code yourself
Experience leading distributed or multi-site engineering teams.

Job Responsibility

Build and lead a multi-team engineering organization (30+ engineers across multiple teams), including hiring and developing engineering managers who lead their own teams
Set the technical and organizational strategy for Copilot AI Evaluation and response quality, aligning with MAI's broader product and engineering vision
Partner with senior Eng and Product leadership (Partner+ level) to define priorities, influence roadmaps, and drive cross-organizational initiatives
Own end-to-end delivery of evaluation platforms, novel evaluation techniques, and agentic solutions for measuring and improving Copilot quality at scale
Recruit, develop, and retain world-class engineering talent — building a culture of technical excellence, accountability, and continuous learning
Drive operational rigor: establish engineering processes, quality bars, and delivery cadences that enable predictable, high-quality execution across multiple concurrent workstreams
Navigate ambiguity and make high-judgment tradeoff decisions on technology, staffing, and investment priorities in a fast-moving AI landscape
Foster a diverse, inclusive team culture where engineers at all levels can do their best work and grow their careers
Embody our Culture and Values.

Fulltime

Member of Technical Staff - Infrastructure & Engineering

The Member of Technical Staff (MTS) - Systems is a senior individual contributor...

Location

United States , Austin

Salary:

Not provided

Aptiv plc

Expiration Date

Until further notice

Requirements

Bachelor degree in Computer Science, Electrical Engineering, or related field
8+ years of software engineering experience
5+ years of experience with embedded Linux or systems programming
Experience leading technical projects and mentoring engineers
Strong background in C/C++ programming
Expert-level proficiency in C/C++ programming
Deep understanding of Linux kernel architecture and internals
Experience with embedded systems development
Knowledge of build systems (Yocto, Buildroot, or similar)
Strong debugging and problem-solving skills

Job Responsibility

Serve as technical lead for major features and projects
Design and architect complex system components and solutions
Provide technical guidance and mentorship to junior engineers
Review code, designs, and architecture decisions
Drive technical standards and best practices within the team
Develop and maintain embedded Linux systems software
Work on user space applications, kernel modules, or toolchain components
Implement new features and enhancements based on requirements
Debug and resolve complex technical issues
Write high-quality, maintainable code following team standards

What we offer

Hybrid work model for workplace flexibility
Comprehensive health, dental, and life insurance
Short and long-term disability coverage
RRSP matching for financial security
Flexible time-off policies for work-life balance
Employee assistance program for mental well-being
Learning benefits, including a LinkedIn Learning subscription and seminars

Fulltime

Member of Technical Staff, Pretraining evaluations

As a Member of Technical Staff in the pretraining evals team, you will play a ke...

Location

Salary:

Not provided

Cohere

Expiration Date

Until further notice

Requirements

Familiarity with base model evaluations and how they differ from post-trained models
Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance
Ability to convey statistical information effectively to a broad audience using visualizations and easy-to-understand numbers
Extremely strong software engineering skills
Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX)
Excellent communication skills to collaborate effectively with cross-functional teams and present findings
One or more papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)

Job Responsibility

Deeply understand each individual evaluation task in our base model evaluation suite, have a clear idea of what each task measures and know their strengths and limitations
Suggest and implement improvements to our base model evaluation suite, whether by adding new tasks to measure unmeasured model capabilities or removing redundant or low-signal tasks
Improve the statistical understanding of our evals and improve the signal-to-noise ratio of our evaluation suite

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Fulltime

Senior Member of Technical Staff - Systems

Wind River is a global leader in delivering software for mission-critical intell...

Location

United States , Austin

Salary:

Not provided

Aptiv plc

Expiration Date

Until further notice

Requirements

Bachelor degree in Computer Science, Electrical Engineering, or related field
12+ years of software engineering experience
8+ years of experience with embedded Linux or systems programming
5+ years in a senior technical leadership role
Proven track record of technical leadership and innovation
Candidates must be legally authorized to work in the United States on a permanent basis - without requirement for any type of visa sponsorship/transfer, now, or at any time in future
Must be a local resident of Greater Austin, TX, with ability to work on campus
Expert-level proficiency in C/C++ and systems programming
Deep, comprehensive understanding of Linux kernel architecture
Extensive experience with embedded systems and real-time systems

Job Responsibility

Drive technical vision and strategy for major systems initiatives
Define and influence system architecture and design decisions
Lead technical evaluation of new technologies and approaches
Provide technical guidance on complex, cross-cutting problems
Represent engineering in technical discussions with customers and partners
Design and architect large-scale, complex systems
Solve the most challenging technical problems
Establish technical standards and best practices organization-wide
Review and approve critical architecture and design decisions
Ensure technical quality and excellence across all deliverables

What we offer

Hybrid work model for workplace flexibility
Comprehensive health, dental, and life insurance
Short and long-term disability coverage
RRSP matching for financial security
Flexible time-off policies for work-life balance
Employee assistance program for mental well-being
Learning benefits, including a LinkedIn Learning subscription and seminars

Fulltime

Member of Technical Staff, Microsoft Robotics

Overview Microsoft’s Discovery and Quantum (MDQ) division develops and delivers...

Location

United States , Redmond

Salary:

165600.00 - 296400.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements is required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Job Responsibility

Own the multi-year technical architecture for the Microsoft Robotics platform, setting the direction the broader engineering organization, partners, and customers build on, and holding that architecture coherent as it scales across teams and deployments.
Make the foundational design decisions that determine how robotics software, AI models, and cloud and edge components fit together, defining the core interfaces and engineering contracts other teams depend on.
Architect systems that meet demanding real-world constraints, balancing latency, throughput, compute efficiency, reliability, and cost across cloud and on-robot edge environments.
Lead the evaluation and validation strategy that gates what ships, establishing the benchmarks, test approach, and quality bar for autonomy, safety, and task performance across simulation, lab, and field, including safe-autonomy boundaries and human-in-the-loop fallback.
Drive the hardest cross-stack technical problems personally, taking on the integration failure modes that emerge where AI models meet production robots, and resolving them through rigorous root-cause analysis of system behavior and field data.
Connect classical robotics engineering with modern AI, partnering across foundation-model, perception, manipulation, locomotion, and simulation teams to bring learned capabilities into a common, production-grade platform rather than one-off integrations.
Establish the engineering bar for the team, owning design standards and review practices, leading the consequential technical reviews, and lifting the quality, testability, and operability of the systems that matter most.
Influence direction across MDQ, Microsoft Research, Azure, hardware and silicon partners, and customers without relying on formal authority, building technical consensus and aligning many teams on a single coherent platform strategy, including representing the platform in the open-source and standards community.
Multiply the team's output through technical leadership, mentoring senior and principal engineers and raising the technical ceiling of the organization rather than only its headcount.
Track the state of the art across embodied AI, robot learning, robotics middleware, and simulation, and decide deliberately how and when new techniques are incorporated into the platform.

Fulltime

Member of Technical Staff, Microsoft Robotics (Robot Learning)

Microsoft's Discovery and Quantum (MDQ) division develops and delivers advanced ...

Location

United States , Redmond

Salary:

102100.00 - 202200.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Job Responsibility

Develop and train end-to-end robot learning models, including vision-language-action (VLA) family of models, imitation learning policies, and reinforcement learning agents for manipulation, locomotion, and navigation tasks
Build, maintain, and optimize data pipelines for robot learning, including collection infrastructure for teleoperation demonstrations, data preprocessing, augmentation, quality filtering, and dataset versioning
Train machine learning and deep learning models on GPU computing clusters, implementing distributed training, hyperparameter optimization, curriculum learning, and training infrastructure automation
Deploy trained models to physical robot platforms, conducting real-world evaluation, debugging sim-to-real transfer issues, and iterating on model performance based on deployment feedback
Implement and maintain evaluation frameworks for robot learning models, including standardized task benchmarks, success rate tracking, generalization testing across objects and environments, and regression detection
Collaborate with robotics researchers, simulation engineers, and platform engineers to improve the end-to-end model development lifecycle, from data collection through deployment and monitoring
Write production-quality code in Python (including NumPy, PyTorch, JAX) that is well-tested, maintainable, and extensible, adhering to team coding standards and best practices
Review code and technical designs, providing feedback to develop other engineers' skills and drive adherence to coding patterns, security practices, and engineering excellence standards
Stay current with state-of-the-art research in robot learning, foundation models for robotics, and physical AI, evaluating new model technologies and techniques for adoption and integration into the platform
Contribute to internal knowledge sharing through technical documentation, brown bag sessions, blog posts, and mentoring of team members

Fulltime

Member of Technical Staff, Microsoft Robotics (Robotics Simulation)

Microsoft’s Discovery and Quantum (MDQ) division develops and delivers advanced ...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Job Responsibility

Design and build automated pipelines for converting reality capture data (photogrammetry, LiDAR point clouds, depth camera scans, 360-degree imagery) into physics-ready 3D simulation assets with accurate geometry, collision meshes, material properties, and articulation definitions.
Develop and maintain toolchains for physics-ready 3D asset generation, including mesh optimization, UV unwrapping, PBR material assignment, collision hull generation, mass/inertia parameter estimation, and annotation of semantic and functional properties.
Integrate reality capture hardware and software workflows (e.g., NeRF, Gaussian splatting, structured light scanning, photogrammetry reconstruction) with the simulation platform’s asset ingestion pipeline.
Build 3D reconstruction workflows that enable rapid creation of simulation environments from real-world facility scans, supporting robotics deployment planning, testing, and validation.
Create and maintain asset toolchains supporting industry-standard formats (USD/OpenUSD, glTF, FBX, OBJ) with appropriate physics and simulation metadata for import into robotics simulation engines.
Develop synthetic data generation pipelines that leverage high-fidelity 3D assets to produce training data for perception, manipulation, and navigation models, including domain-randomized variations of materials, lighting, object placement, and camera viewpoints.
Collaborate with robotics engineers, ML researchers, and perception scientists to define asset fidelity requirements, validate simulation-to-reality visual and physical accuracy, and iterate on asset quality based on downstream model performance.
Implement quality assurance and validation workflows for 3D assets, including automated checks for mesh integrity, physics parameter consistency, rendering fidelity, and simulation stability.
Review code and technical designs to ensure adherence to team standards for 3D pipeline performance, asset management, and data integrity.
Remain current in 3D reconstruction, neural rendering, and asset generation research, proactively evaluating new techniques (e.g., generative 3D models, neural radiance fields, 3D Gaussian splatting) for integration into the platform.

What we offer

Certain roles may be eligible for benefits and other compensation.

Fulltime

Member of Technical Staff (Audio)

At Microsoft AI (MAI), we are at the forefront of technological innovation, crea...

Location

Switzerland , Zürich

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Master’s degree in computer science OR equivalent technical experience

Job Responsibility

Model Training & Evaluation: Design and maintain training data “recipes” and develop evaluation frameworks
Training & Inference Optimization and Scaling: Optimize end-to-end training and inference performance
Collaboration: Work closely with other members of the AI research team
Culture & Values: Actively contribute to a positive, inclusive, and collaborative team culture

Fulltime

Select Country

Member of Technical Staff, Evaluations Engineering

Job Description

Job Responsibility

Requirements

Looking for more opportunities?