Member of Technical Staff - Post-Training Job at Microsoft Corporation (Redmond)

Member of Technical Staff - Post-Training

This Microsoft AI Superintelligence Post-Training team is dedicated to advancing...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in relevant field OR equivalent experience
Software engineering skills with fluency in Python and modern data libraries
The ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Job Responsibility

Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
run ablation studies to measure impact and optimize data effectiveness
Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
identify gaps and propose improvements
Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
Embody our Culture and Values

Fulltime

Member of Technical Staff, Post-Training

Advance the state of the art for model post training, ship state of the art mode...

Location

Salary:

Not provided

Cohere

Expiration Date

Until further notice

Requirements

Extremely strong software engineering skills
Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR
Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray)
Experience using large-scale distributed training strategies
Hands on experience on training large model at scale
Hands on experience with the post training phase of model training, with a strong emphasis on performance optimisation

Job Responsibility

Design and write high-performant and scalable software for training models
Consistently post-train the models to reach SOTA level performance
Coordinate with other specialist teams (Agentic, Code…) to produce models that have strong all encompassing performance
Craft and implement techniques to improve the performance and results of our training cycles both on the SFT and the RL regime
Research, implement, and experiment with ideas on our supercompute and data infrastructure
Learn from and work with the best researchers in the field

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Fulltime

Member of Technical Staff, AI Safety Post-Training

As a Member of Technical Staff, AI Safety Post-Training, you will work to develo...

Location

United States , Mountain View

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Experience prompting and working with large language models
Experience writing production-quality Python code
Demonstrated interest in Responsible AI

Job Responsibility

Leverage expertise in AI safety to uncover potential risks and develop novel mitigation strategies, including alignment techniques, constitutional AI approaches, RLHF, and robustness improvements for large language models
Create and implement comprehensive evaluation frameworks and red-teaming methodologies to assess model safety across diverse scenarios, edge cases, and potential failure modes
Build automated safety testing systems, generalize safety solutions into repeatable frameworks, and write efficient code for safety model pipelines and intervention systems
Maintain a user-oriented perspective by understanding safety needs from user perspectives, validating safety approaches through user research, and serving as a trusted advisor on AI safety matters
Track advances in AI safety research, identify relevant state-of-the-art techniques, and adapt safety algorithms to drive innovation in production systems serving millions of users
Embody our culture and values

Fulltime

Member of Technical Staff - Post Training - MAI Superintelligence Team

At Microsoft AI, we are on a mission to develop the most cutting-edge algorithms...

Location

United States , Mountain View

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Have experience with reward modeling, RL, or other post-training techniques

Job Responsibility

Develop data collection, evaluation, and post-training methods for models
Design hypotheses and experiment plans for rapidly iterating on model performance

Fulltime

Member of Technical Staff - Post Training, Applied

This is a rare chance to sit at the intersection of frontier foundation models a...

Location

United States , San Francisco; Boston

Salary:

Not provided

Liquid AI

Expiration Date

Until further notice

Requirements

Hands-on experience with data generation and evaluation for LLM post-training
Experience training or fine-tuning models using SFT, preference alignment, and/or RL
Strong intuition for data quality and evaluation design
Familiarity with alignment or RL techniques beyond basic supervised fine-tuning

Job Responsibility

Act as the technical owner for enterprise customer post-training engagements
Translate customer requirements into concrete post-training specifications and workflows
Design and execute data generation, filtering, and quality assessment processes
Run supervised fine-tuning, preference alignment, and reinforcement learning workflows
Design task-specific evaluations, interpret results, and feed learnings back into core post-training pipelines

What we offer

Competitive base salary with equity in a unicorn-stage company
We pay 100% of medical, dental, and vision premiums for employees and dependents
401(k) matching up to 4% of base pay
Unlimited PTO plus company-wide Refill Days throughout the year

Fulltime

Member of Technical Staff - Post Training, Reinforcement Learning

At Liquid, we’re not just building AI models—we’re redefining the architecture o...

Location

United States , San Francisco; Boston

Salary:

Not provided

Liquid AI

Expiration Date

Until further notice

Requirements

Strong Python and PyTorch proficiency, with hands-on experience optimizing training pipelines
Hands-on experience with reinforcement learning and the ability to translate optimization techniques from theory into practical implementations
Track record of integrating research ideas into robust, maintainable code
Experience with frameworks like DeepSpeed, FSDP, or vLLM for efficient model training and inference
Experience working with data pipelines, including curation, validation, and analysis to support post-training objectives
Contributions to open-source machine learning projects
M.S. or Ph.D. in Computer Science, Electrical Engineering, Mathematics, or a related field

Job Responsibility

Profile, optimize, and scale RL training runs to reduce iteration time
Integrate new optimization techniques as they emerge from the research community
Design and implement tools and environments that test the boundaries of model capabilities
Turn proof-of-concept ideas into robust training pipelines and best-in-class models

What we offer

The opportunity to work directly on state-of-the-art AI systems at one of the most advanced AI companies in the world
A fast-paced, collaborative environment where your work has direct impact on model performance and product capability
The satisfaction of knowing your craftsmanship helps define the next frontier in AI

Fulltime

Member of Technical Staff - Pre Training - MAI Superintelligence Team

Help deliver one of the best foundational models in the world at Microsoft AI. A...

Location

United States , Mountain View

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Demonstrated experience in large-scale AI
Passionate about conversational AI and its deployment
Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team

Job Responsibility

Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations
Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack
Collaborate closely with teams on infrastructure, data, post-training, and multimodality
Embody our culture and values

Fulltime

Member of Technical Staff, AI Data - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...

Location

United Kingdom , London

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
OR equivalent experience
Expertise in large scale data engineering ideally applied to AI
Expertise in Spark, Kubernetes or similar

Job Responsibility

Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models
Embody our culture and values

Fulltime

Select Country

Member of Technical Staff - Post-Training

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?

Member of Technical Staff - Post-Training

Member of Technical Staff - Post-Training

Member of Technical Staff, Post-Training

Member of Technical Staff, AI Safety Post-Training

Member of Technical Staff - Post Training - MAI Superintelligence Team

Member of Technical Staff - Post Training, Applied

Member of Technical Staff - Post Training, Reinforcement Learning

Member of Technical Staff - Pre Training - MAI Superintelligence Team

Member of Technical Staff, AI Data - MAI Superintelligence Team

Our AI answers in your language