CrawlJobs Logo

Research Scientist: Post-Training

generalistai.com Logo

Generalist AI

Location Icon

Location:
United States , San Mateo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

200000.00 - 350000.00 USD / Year

Job Description:

Pretraining gives us a general model. Post-training makes it useful, controllable, safe, and performant in the real world. You will train large pretrained robot models into production-ready systems via fine-tuning, reinforcement learning, steering, human feedback, task specialization, evaluation, and on-robot validation—at scale. Regardless of your initial background, you will grow into becoming a full-stack ML roboticist capable of quickly pinpoint issues on either side of ML or controls, and all the places in between. This is where research meets reality.

Job Responsibility:

  • Designing fine-tuning and adaptation strategies for downstream robotic tasks and embodiments
  • Developing methods for improving reliability, robustness, and controllability
  • Building evaluation frameworks that measure real-world robot performance, not just offline metrics
  • Improving inference-time performance (latency, stability, memory footprint) in collaboration with ML infrastructure
  • Leveraging techniques such as imitation learning, RL, distillation, synthetic data, and curriculum learning
  • Closing the loop between model outputs and physical-world outcomes

Requirements:

  • Experience with fine-tuning large models for downstream tasks (RLHF, IL, RL, distillation, domain adaptation, etc.)
  • Worked on embodied AI, robotics, or real-world ML systems
  • Care deeply about evaluation, benchmarking, and failure analysis
  • Comfortable debugging across the ML stack — from loss curves to robot behavior
  • Enjoy rapid iteration with real-world feedback loops
  • Want to bridge the gap between foundation models and physical deployment
What we offer:

Offers Equity

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:
PREMIUM
More languages and countries
+ Unlock 31694 hidden job offers
Languages
English Čeština Deutsch Ελληνικά Español Français +15
Countries
United States United Kingdom India Canada Australia +
See plans
Plans from $2.99 / month

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist: Post-Training

Research Scientist - Generative AI

As a Research Scientist in the Emergent Machine Intelligence Team at Hewlett Pac...
Location
Location
United States , Santa Barbara
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences.
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-tuning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars.
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development.
  • Fulltime
Read More
Arrow Right

Research Scientist - Generative AI

This role involves conducting high-quality research in generative AI, designing ...
Location
Location
United States
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-turning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale works with the industry’s leading AI labs to provide high quality data and...
Location
Location
United States , San Francisco; Seattle; New York
Salary
Salary:
252000.00 - 315000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field
  • Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning
  • Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning
  • Excellent written and verbal communication skills
  • Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and/or journals
  • Previous experience in a customer facing role
Job Responsibility
Job Responsibility
  • Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities
  • Design and experiment new approaches to preference optimization
  • Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness
  • Publish research findings in top-tier AI conferences
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • commuter stipend
  • Fulltime
Read More
Arrow Right

Research Engineer / Scientist - Post-training

At Luma, the Post-training team is responsible for unlocking creative control in...
Location
Location
United States , Palo Alto
Salary
Salary:
187500.00 - 395000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
  • Demonstrated ability to do independent research in Academic or Industry settings
  • Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
  • Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization
  • Strong orientation toward applied AI implementations with emphasis on translating product requirements into technical solutions, coupled with exceptional visual discrimination and dedicated focus on enhancing visual fidelity and aesthetic quality of generated content
  • Proficiency in accelerated prototyping and demonstration development for emerging features, facilitating efficient iteration cycles and comprehensive stakeholder evaluation prior to production implementation
  • Established track record of effective cross-functional teamwork, including successful partnerships with teams spanning Product, Design, Evaluation, Applied, and creative specialists
Job Responsibility
Job Responsibility
  • Optimize Luma's image and video generative models through targeted fine-tuning to improve visual quality, instruction adherence, and overall performance metrics
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
  • Partner closely with the Applied Research team to identify product requirements, understand diverse use cases across Luma's platforms, and execute targeted fine-tuning initiatives to address performance gaps and enhance user-facing capabilities
  • Conduct comprehensive side-by-side evaluations comparing model performance against leading market competitors, systematically analyzing the impact of post-training techniques on downstream performance metrics and identifying areas for improvement
  • Develop advanced post-training capabilities for Luma’s video models including Camera control, Object & character Reference, Image & Video Editing, Human Performance & Motion Transfer Approaches
  • Architect data processing pipelines for large-scale video and image datasets, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
  • Research and deploy cutting-edge diffusion sampling methodologies and hyperparameter optimization strategies to achieve superior performance on established visual quality benchmarks
  • Research emerging post-training methodologies in generative AI, evaluate their applicability to Luma's product ecosystem, and integrate promising techniques into our Post-training recipe
  • Fulltime
Read More
Arrow Right

AI Research Lead

Perplexity is seeking an exceptional AI Research Tech Lead to drive our research...
Location
Location
United States , San Francisco
Salary
Salary:
300000.00 - 470000.00 USD / Year
perplexity.ai Logo
Perplexity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 5 years of experience working on relevant AI/ML projects with 3+ years in a technical leadership role
  • Proven track record of leading and mentoring technical and research teams
  • A Computer Science graduate degree at a premier academic institution
  • Deep expertise with large-scale LLMs and Deep Learning systems
  • Strong programming skills with versatility across multiple languages and frameworks
  • Demonstrated ability to set technical vision and drive execution
  • Experience with pre-training and post-training techniques (self-supervised learning along with SFT/DPO/GRPO/PPO)
  • Self-starter with exceptional ownership mentality and ability to work in ambiguous environments
  • Passion for solving challenging problems and pushing the boundaries of AI research
Job Responsibility
Job Responsibility
  • Define and execute the macro research direction across multiple modalities, including post-training LLMs for agent trajectories and future mid-training initiatives
  • Lead strategic research planning and roadmap development to advance Sonar model capabilities
  • Drive innovation in supervised and reinforcement learning techniques for query answering
  • Collaborate with leadership to align research priorities with product and business objectives
  • Coach and mentor a team of AI research scientists and engineers, fostering their technical and professional growth
  • Establish the long-term macro research direction across the team, including our direction across different modalities
  • Lead hiring and onboarding of new research talent
  • Create a collaborative environment that encourages knowledge sharing and innovation
  • Post-train SOTA LLMs on query answering using cutting-edge supervised and reinforcement learning techniques
  • Own and optimize the full stack data, training, and evaluation pipelines required for LLM post-training
What we offer
What we offer
  • Equity
  • Health
  • Dental
  • Vision
  • Retirement
  • Fitness
  • Commuter and dependent care accounts
  • Fulltime
Read More
Arrow Right

AI Research Scientist, Post-Training - Meta Superintelligence Labs

Meta is seeking Research Scientists to join the Post-Training team within Meta S...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Ph.D. in Computer Science, Machine Learning, or a related technical field
  • 3+ years of experience in machine learning research, with a focus on deep learning, data alignment, NLP, or related areas
  • Demonstrated ability to lead technical research projects from conception to production
  • Effective communication skills and experience collaborating with technical leadership
Job Responsibility
Job Responsibility
  • Design novel methodologies for post-training data collection, curation, and synthetic data generation
  • Define data quality frameworks and alignment strategies that guide capability development across MSL, particularly for complex reasoning and agentic behaviors
  • Drive the scientific vision for eliciting high-quality data in expert domains (finance, legal, health, STEM) and complex agentic trajectories (Deep research, computer use, UI generation)
  • Conduct research to develop and optimize post-training recipes that directly improve model quality
  • Partner with cross-functional research teams across product and model training to identify and prioritize gaps in model capabilities
  • Contribute to research workstreams that shape the long-term direction of data-centric AI at MSL, working independently while also contributing to team goals and organizational priorities
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Scientist - Large Language Model

This is a rare opportunity to help define the future of large-scale language mod...
Location
Location
United States , Palo Alto
Salary
Salary:
250000.00 - 450000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong foundation in machine learning and large language models
  • Deep understanding of autoregressive transformers and large-scale training dynamics
  • Experience with pre-training large models and/or post-training techniques such as instruction tuning, RLHF, preference optimization, or distillation
  • Hands-on experience with PyTorch and distributed training at scale
  • Comfortable operating across research and production environments
Job Responsibility
Job Responsibility
  • Architect and scale large autoregressive language models
  • Design improved pre-training objectives to enhance reasoning, knowledge retention, and compositional generalization
  • Develop mid-training strategies such as continued pre-training, domain adaptation, curriculum learning, and synthetic data integration
  • Advance post-training techniques, including instruction tuning, preference optimization, reinforcement learning, distillation, and inference-time compute scaling
  • Study and improve long-context modeling, planning depth, and multi-step reasoning behavior
  • Curate and construct massive, high-quality text corpora for pre-training
  • Design synthetic data pipelines for reasoning, tool use, mathematics, coding, and structured problem solving
  • Develop filtering, mixture weighting, and curriculum strategies that shape emergent capabilities
  • Formulate new tasks that improve coherence, logical consistency, factual grounding, and robustness
  • Train frontier-scale language models across large GPU clusters
  • Fulltime
Read More
Arrow Right

Ai Research Scientist, Video Generation And Post Training, Fair

Meta is seeking a Research Scientist to join the Fundamental AI Research (FAIR) ...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD or equivalent experience in Computer Science, Electrical Engineering, or a related field
  • Demonstrated expertise in video generation, computer vision, or multimodal AI
  • Experience with large-scale model training, post-training optimization techniques, and data curation
  • Publication record in relevant fields
Job Responsibility
Job Responsibility
  • Conduct fundamental and applied research in video generation, including generative models, video synthesis, and multimodal learning
  • Develop and optimize post-training paradigms for large-scale video and multimodal models, improving their performance, robustness, and generalization
  • Collaborate with teams across Meta to build perceptual foundations for real-time embodied agents and conversational AI
  • Contribute to the development and deployment of frontier models (e.g., Llama, LMMs) and push the boundaries of video and media generation
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right