Member of Technical Staff, AI Post-Training Job at Microsoft Corporation (London)

Member of Technical Staff – Model Training

At Inflection AI, our public benefit mission is to harness the power of AI to im...

Location

United States , Palo Alto

Salary:

175000.00 - 350000.00 USD / Year

Inflection AI

Expiration Date

Until further notice

Requirements

Have hands-on experience training and fine-tuning large transformer models on multi-GPU / multi-node clusters
Are fluent in PyTorch and its ecosystem tools (Torchtune, FSDP, DeepSpeed) and enjoy digging into distributed-training internals, mixed precision, and memory-efficiency tricks
Have shipped or published work in RLHF, DPO, GRPO, or RLAIF and understand their practical trade-offs
Care deeply about training tools, pipelines, and reproducibility—you automate the boring parts so you can iterate on the fun parts
Balance research curiosity with product pragmatism—you know when to run an ablation and when to ship
Communicate crisply with both technical and non-technical teammates
Have a bachelor’s degree or equivalent in a related field to the offered position requirements

Job Responsibility

Contribute to end-to-end post-training workflows—dataset curation, hyper-parameter search, evaluation, and rollout—using PyTorch, Torchtune, FSDP/DeepSpeed, and our internal orchestration stack
Prototype and compare alignment techniques (e.g., curriculum RL, multi-objective reward modeling, tool-use fine-tuning) and push the best ideas into production
Automate training at scale: build robust pipeline components, tools, scripts, and dashboards so experiments are reproducible and easy to trace
Define the metrics that matter
run A/B tests and iterate quickly to meet aggressive quality targets
Collaborate with inference, safety, and product teams to land improvements in customer-facing systems

What we offer

Diverse medical, dental and vision options
401k matching program
Unlimited paid time off
Parental leave and flexibility for all parents and caregivers
Support of country-specific visa needs for international employees living in the Bay Area
Competitive stock options

Member of Technical Staff - Post Training, Applied

This is a rare chance to sit at the intersection of frontier foundation models a...

Location

United States , San Francisco; Boston

Salary:

Not provided

Liquid AI

Expiration Date

Until further notice

Requirements

Hands-on experience with data generation and evaluation for LLM post-training
Experience training or fine-tuning models using SFT, preference alignment, and/or RL
Strong intuition for data quality and evaluation design
Familiarity with alignment or RL techniques beyond basic supervised fine-tuning

Job Responsibility

Act as the technical owner for enterprise customer post-training engagements
Translate customer requirements into concrete post-training specifications and workflows
Design and execute data generation, filtering, and quality assessment processes
Run supervised fine-tuning, preference alignment, and reinforcement learning workflows
Design task-specific evaluations, interpret results, and feed learnings back into core post-training pipelines

What we offer

Competitive base salary with equity in a unicorn-stage company
We pay 100% of medical, dental, and vision premiums for employees and dependents
401(k) matching up to 4% of base pay
Unlimited PTO plus company-wide Refill Days throughout the year

Fulltime

Member of Technical Staff, AI Post-Training

At Microsoft AI, we are on a mission to develop the most cutting-edge algorithms...

Location

Switzerland , Zürich

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
Expertise in post-training of AI models
Demonstrated experience in large-scale AI
Passionate about conversational AI and its deployment
Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
Proven research track record in a domain related field supported by exceptional papers

Job Responsibility

Develop data collection, evaluation, and finetuning methods for models
Design hypotheses and experiment plans for rapidly iterating on model performance
Prototype new model features and capabilities and collaborate with engineers and researchers across Microsoft AI to make them a reality
Collaborate with pretraining and product platform teams to establish good vertical integration and ship models that Copilot users love
Embody our culture and values

Fulltime

Member of Technical Staff, Post-Training

Advance the state of the art for model post training, ship state of the art mode...

Location

Salary:

Not provided

Cohere

Expiration Date

Until further notice

Requirements

Extremely strong software engineering skills
Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR
Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray)
Experience using large-scale distributed training strategies
Hands on experience on training large model at scale
Hands on experience with the post training phase of model training, with a strong emphasis on performance optimisation

Job Responsibility

Design and write high-performant and scalable software for training models
Consistently post-train the models to reach SOTA level performance
Coordinate with other specialist teams (Agentic, Code…) to produce models that have strong all encompassing performance
Craft and implement techniques to improve the performance and results of our training cycles both on the SFT and the RL regime
Research, implement, and experiment with ideas on our supercompute and data infrastructure
Learn from and work with the best researchers in the field

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Fulltime

Member of Technical Staff - Post Training - MAI Superintelligence Team

At Microsoft AI, we are on a mission to develop the most cutting-edge algorithms...

Location

United States , Mountain View

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Have experience with reward modeling, RL, or other post-training techniques

Job Responsibility

Develop data collection, evaluation, and post-training methods for models
Design hypotheses and experiment plans for rapidly iterating on model performance

Fulltime

Member of Technical Staff - Post-Training

This Microsoft AI Superintelligence Post-Training team is dedicated to advancing...

Location

United States , Redmond

Salary:

84200.00 - 199000.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree (complete or in progress) in relevant field AND 3+ months related research internship experience OR Master's Degree in relevant field OR equivalent experience
Software engineering skills with fluency in Python and modern data libraries
The ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Job Responsibility

Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
run ablation studies to measure impact and optimize data effectiveness
Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
identify gaps and propose improvements
Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
Embody our Culture and Values

Fulltime

Member of Technical Staff - Post-Training

This Microsoft AI Superintelligence Post-Training team is dedicated to advancing...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in relevant field OR equivalent experience
Software engineering skills with fluency in Python and modern data libraries
The ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Job Responsibility

Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
run ablation studies to measure impact and optimize data effectiveness
Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
identify gaps and propose improvements
Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
Embody our Culture and Values

Fulltime

Member of Technical Staff, AI Safety Post-Training

As a Member of Technical Staff, AI Safety Post-Training, you will work to develo...

Location

United States , Mountain View

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Experience prompting and working with large language models
Experience writing production-quality Python code
Demonstrated interest in Responsible AI

Job Responsibility

Leverage expertise in AI safety to uncover potential risks and develop novel mitigation strategies, including alignment techniques, constitutional AI approaches, RLHF, and robustness improvements for large language models
Create and implement comprehensive evaluation frameworks and red-teaming methodologies to assess model safety across diverse scenarios, edge cases, and potential failure modes
Build automated safety testing systems, generalize safety solutions into repeatable frameworks, and write efficient code for safety model pipelines and intervention systems
Maintain a user-oriented perspective by understanding safety needs from user perspectives, validating safety approaches through user research, and serving as a trusted advisor on AI safety matters
Track advances in AI safety research, identify relevant state-of-the-art techniques, and adapt safety algorithms to drive innovation in production systems serving millions of users
Embody our culture and values

Fulltime

Select Country

Member of Technical Staff, AI Post-Training

Microsoft Corporation

Location:
United Kingdom , London

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:
January 06, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Member of Technical Staff, AI Post-Training

Member of Technical Staff – Model Training

Member of Technical Staff - Post Training, Applied

Member of Technical Staff, AI Post-Training

Member of Technical Staff, Post-Training

Member of Technical Staff - Post Training - MAI Superintelligence Team

Member of Technical Staff - Post-Training

Member of Technical Staff - Post-Training

Member of Technical Staff, AI Safety Post-Training

Our AI answers in your language

Member of Technical Staff, AI Post-Training

Microsoft Corporation

Location:United Kingdom , London

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:January 06, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Member of Technical Staff, AI Post-Training

Member of Technical Staff – Model Training

Member of Technical Staff - Post Training, Applied

Member of Technical Staff, AI Post-Training

Member of Technical Staff, Post-Training

Member of Technical Staff - Post Training - MAI Superintelligence Team

Member of Technical Staff - Post-Training

Member of Technical Staff - Post-Training

Member of Technical Staff, AI Safety Post-Training

Location:
United Kingdom , London

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
January 06, 2026