CrawlJobs Logo

Member of Technical Staff, AI Post-Training

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United Kingdom , London

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

At Microsoft AI, we are on a mission to develop the most cutting-edge algorithms for post-training large language models (LLMs) and ship those models to millions of users using Copilot every day. The AI Post-Training team at Microsoft AI is responsible for all aspects of post-training the models that we serve in Copilot, including: data collection, building evaluations that are aligned with our product and model capability goals, prototyping new capabilities to make Copilot more powerful, developing new finetuning algorithms to supercharge our models, working with platform and engineering teams to deploy those models, and closing the loop by improving the models with feedback we receive from our users.

Job Responsibility:

  • Develop data collection, evaluation, and finetuning methods for models
  • Design hypotheses and experiment plans for rapidly iterating on model performance
  • Prototype new model features and capabilities and collaborate with engineers and researchers across Microsoft AI to make them a reality
  • Collaborate with pretraining and product platform teams to establish good vertical integration and ship models that Copilot users love
  • Embody our culture and values

Requirements:

  • Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Expertise in post-training of AI models
  • Demonstrated experience in large-scale AI
  • Passionate about conversational AI and its deployment
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
  • Proven research track record in a domain related field supported by exceptional papers

Additional Information:

Job Posted:
January 06, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff, AI Post-Training

Member of Technical Staff – Model Training

At Inflection AI, our public benefit mission is to harness the power of AI to im...
Location
Location
United States , Palo Alto
Salary
Salary:
175000.00 - 350000.00 USD / Year
inflection.ai Logo
Inflection AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have hands-on experience training and fine-tuning large transformer models on multi-GPU / multi-node clusters
  • Are fluent in PyTorch and its ecosystem tools (Torchtune, FSDP, DeepSpeed) and enjoy digging into distributed-training internals, mixed precision, and memory-efficiency tricks
  • Have shipped or published work in RLHF, DPO, GRPO, or RLAIF and understand their practical trade-offs
  • Care deeply about training tools, pipelines, and reproducibility—you automate the boring parts so you can iterate on the fun parts
  • Balance research curiosity with product pragmatism—you know when to run an ablation and when to ship
  • Communicate crisply with both technical and non-technical teammates
  • Have a bachelor’s degree or equivalent in a related field to the offered position requirements
Job Responsibility
Job Responsibility
  • Contribute to end-to-end post-training workflows—dataset curation, hyper-parameter search, evaluation, and rollout—using PyTorch, Torchtune, FSDP/DeepSpeed, and our internal orchestration stack
  • Prototype and compare alignment techniques (e.g., curriculum RL, multi-objective reward modeling, tool-use fine-tuning) and push the best ideas into production
  • Automate training at scale: build robust pipeline components, tools, scripts, and dashboards so experiments are reproducible and easy to trace
  • Define the metrics that matter
  • run A/B tests and iterate quickly to meet aggressive quality targets
  • Collaborate with inference, safety, and product teams to land improvements in customer-facing systems
What we offer
What we offer
  • Diverse medical, dental and vision options
  • 401k matching program
  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Support of country-specific visa needs for international employees living in the Bay Area
  • Competitive stock options
Read More
Arrow Right

Member of Technical Staff - Post Training, Applied

This is a rare chance to sit at the intersection of frontier foundation models a...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience with data generation and evaluation for LLM post-training
  • Experience training or fine-tuning models using SFT, preference alignment, and/or RL
  • Strong intuition for data quality and evaluation design
  • Familiarity with alignment or RL techniques beyond basic supervised fine-tuning
Job Responsibility
Job Responsibility
  • Act as the technical owner for enterprise customer post-training engagements
  • Translate customer requirements into concrete post-training specifications and workflows
  • Design and execute data generation, filtering, and quality assessment processes
  • Run supervised fine-tuning, preference alignment, and reinforcement learning workflows
  • Design task-specific evaluations, interpret results, and feed learnings back into core post-training pipelines
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Post-Training

At Microsoft AI, we are on a mission to develop the most cutting-edge algorithms...
Location
Location
Switzerland , Zürich
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Expertise in post-training of AI models
  • Demonstrated experience in large-scale AI
  • Passionate about conversational AI and its deployment
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
  • Proven research track record in a domain related field supported by exceptional papers
Job Responsibility
Job Responsibility
  • Develop data collection, evaluation, and finetuning methods for models
  • Design hypotheses and experiment plans for rapidly iterating on model performance
  • Prototype new model features and capabilities and collaborate with engineers and researchers across Microsoft AI to make them a reality
  • Collaborate with pretraining and product platform teams to establish good vertical integration and ship models that Copilot users love
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Post-Training

Advance the state of the art for model post training, ship state of the art mode...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extremely strong software engineering skills
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR
  • Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray)
  • Experience using large-scale distributed training strategies
  • Hands on experience on training large model at scale
  • Hands on experience with the post training phase of model training, with a strong emphasis on performance optimisation
Job Responsibility
Job Responsibility
  • Design and write high-performant and scalable software for training models
  • Consistently post-train the models to reach SOTA level performance
  • Coordinate with other specialist teams (Agentic, Code…) to produce models that have strong all encompassing performance
  • Craft and implement techniques to improve the performance and results of our training cycles both on the SFT and the RL regime
  • Research, implement, and experiment with ideas on our supercompute and data infrastructure
  • Learn from and work with the best researchers in the field
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Safety Post-Training

As a Member of Technical Staff, AI Safety Post-Training, you will work to develo...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience prompting and working with large language models
  • Experience writing production-quality Python code
  • Demonstrated interest in Responsible AI
Job Responsibility
Job Responsibility
  • Leverage expertise in AI safety to uncover potential risks and develop novel mitigation strategies, including alignment techniques, constitutional AI approaches, RLHF, and robustness improvements for large language models
  • Create and implement comprehensive evaluation frameworks and red-teaming methodologies to assess model safety across diverse scenarios, edge cases, and potential failure modes
  • Build automated safety testing systems, generalize safety solutions into repeatable frameworks, and write efficient code for safety model pipelines and intervention systems
  • Maintain a user-oriented perspective by understanding safety needs from user perspectives, validating safety approaches through user research, and serving as a trusted advisor on AI safety matters
  • Track advances in AI safety research, identify relevant state-of-the-art techniques, and adapt safety algorithms to drive innovation in production systems serving millions of users
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Multimodal - MAI Superintelligence Team

At Microsoft AI, we are on a mission to train the world’s most capable AI fronti...
Location
Location
Switzerland , Zürich
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • Expertise in multimodal Research with a strong publishing track record
  • Proven expertise in areas of interest, evidenced by an exceptional publication track record and/or significant technical leadership in high-impact projects
  • Strong analytical skills, attention to detail, and a commitment to data-driven decision-making
  • Experience and/or in-depth understandings about large-scale distributed systems
  • Ability to work collaboratively in a fast-paced, innovative environment
Job Responsibility
Job Responsibility
  • Develop algorithms, design model architectures, conduct experiments, champion measurement and evaluation, innovate datasets and data pipelines
  • Improve training and deployment efficiency, paying careful attention to detail, persevering, and learning from everyone’s attempts whether successful or not
  • Follow a rigorous data-driven approach grounded in meticulous ablation studies and scientific analysis
  • Innovate and iterate over ideas, prototypes, and product
  • Collaborate closely with teams on infrastructure, data engineering, pre-training, post-training, and product feedback
  • Advance the AI frontier responsibly
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Multimodal

At Microsoft AI, we are on a mission to train the world’s most capable AI fronti...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • Expertise in multimodal Research with a strong publishing track record
  • Proven expertise in areas of interest, evidenced by an exceptional publication track record and/or significant technical leadership in high-impact projects
  • Strong analytical skills, attention to detail, and a commitment to data-driven decision-making
  • Experience and/or in-depth understandings about large-scale distributed systems
  • Ability to work collaboratively in a fast-paced, innovative environment
Job Responsibility
Job Responsibility
  • Develop algorithms, design model architectures, conduct experiments, champion measurement and evaluation, innovate datasets and data pipelines
  • Improve training and deployment efficiency, paying careful attention to detail, persevering, and learning from everyone’s attempts whether successful or not
  • Follow a rigorous data-driven approach grounded in meticulous ablation studies and scientific analysis
  • Innovate and iterate over ideas, prototypes, and product
  • Collaborate closely with teams on infrastructure, data engineering, pre-training, post-training, and product feedback
  • Advance the AI frontier responsibly
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Next Generation Agents

Agentic LLM systems are being deployed widely across enterprise companies includ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills
  • Proficiency in Python and have some experience with ML-related code (e.g., pytorch, numpy, etc.)
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines
Job Responsibility
Job Responsibility
  • Design and develop novel agentic solutions
  • Improve upon SOTA on hard agentic tasks
  • Research the next-generation of on-line learning-from-experience self-improvement
  • Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  • Work with an amazing team of researchers and engineers pushing the boundaries
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right