CrawlJobs Logo

Research Engineer / Scientist - Post-training

United States, Palo Alto 187500.00 - 395000.00 USD / Year · Job Posted January 13, 2026
Apply Position
Job Link Share

Job Description

At Luma, the Post-training team is responsible for unlocking creative control in the world’s largest and most powerful pre-trained multimodal models. The team works closely with the Fundamental Research team and the Product teams across Luma to train our image and video generative models improving their capabilities in the final step refining them to be better aligned with what our users expect.

Job Responsibility

  • Optimize Luma's image and video generative models through targeted fine-tuning to improve visual quality, instruction adherence, and overall performance metrics
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
  • Partner closely with the Applied Research team to identify product requirements, understand diverse use cases across Luma's platforms, and execute targeted fine-tuning initiatives to address performance gaps and enhance user-facing capabilities
  • Conduct comprehensive side-by-side evaluations comparing model performance against leading market competitors, systematically analyzing the impact of post-training techniques on downstream performance metrics and identifying areas for improvement
  • Develop advanced post-training capabilities for Luma’s video models including Camera control, Object & character Reference, Image & Video Editing, Human Performance & Motion Transfer Approaches
  • Architect data processing pipelines for large-scale video and image datasets, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
  • Research and deploy cutting-edge diffusion sampling methodologies and hyperparameter optimization strategies to achieve superior performance on established visual quality benchmarks
  • Research emerging post-training methodologies in generative AI, evaluate their applicability to Luma's product ecosystem, and integrate promising techniques into our Post-training recipe

Requirements

  • Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
  • Demonstrated ability to do independent research in Academic or Industry settings
  • Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
  • Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization
  • Strong orientation toward applied AI implementations with emphasis on translating product requirements into technical solutions, coupled with exceptional visual discrimination and dedicated focus on enhancing visual fidelity and aesthetic quality of generated content
  • Proficiency in accelerated prototyping and demonstration development for emerging features, facilitating efficient iteration cycles and comprehensive stakeholder evaluation prior to production implementation
  • Established track record of effective cross-functional teamwork, including successful partnerships with teams spanning Product, Design, Evaluation, Applied, and creative specialists

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Engineer / Scientist - Post-training

8 matching positions

Research Engineer / Research Scientist, Post-Training

The Post-Training team is responsible for training and improving pre-trained mod...
Location
Location
United States , San Francisco
Salary
Salary:
295000.00 - 555000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of machine learning and machine learning applications
  • Working knowledge of relevant models, and building evaluations for model capability improvement
  • Comfortable diving into a large ML codebase to debug
  • Thrive in a dynamic and technically complex environment
  • Strong ML engineering skills and research experience, especially with novel and highly capable models
  • Passionate about product-driven research
Job Responsibility
Job Responsibility
  • Own and pursue a research agenda to improve model capability and performance
  • Collaborate closely with the other research and product teams, allowing customers to optimize their own models
  • Build robust evaluations for tracking modeling improvements
  • Design, implement, test, and debug code across our research stack
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale works with the industry’s leading AI labs to provide high quality data and...
Location
Location
United States , San Francisco; Seattle; New York
Salary
Salary:
252000.00 - 315000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field
  • Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning
  • Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning
  • Excellent written and verbal communication skills
  • Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and/or journals
  • Previous experience in a customer facing role
Job Responsibility
Job Responsibility
  • Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities
  • Design and experiment new approaches to preference optimization
  • Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness
  • Publish research findings in top-tier AI conferences
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • commuter stipend
  • Fulltime
Read More
Arrow Right

Research Engineer, Post-Training

Meta is seeking Research Engineers to join the Post-Training team within Meta Su...
Location
Location
United States , Menlo Park
Salary
Salary:
257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 5+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing, and completing medium to large technical features independently, without guidance
  • Demonstrated experiences in software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities
Job Responsibility
Job Responsibility
  • Design, build, and scale full-stack data collection pipelines for post-training (SFT, RLHF) across text, vision, and action modalities
  • Develop and implement environments to capture complex agentic trajectories, including computer use agents, Deep research workflows, UI generation, and shopping agents
  • Collaborate with external data vendors and domain experts to source, securely ingest, and prepare high-quality datasets in fields like STEM, finance, legal, and health
  • Execute on the technical vision of research scientists to generate and filter high-quality synthetic data at scale
  • Build robust, reusable data processing pipelines that scale across multiple model lines and product areas
  • Contribute to tooling that measures and ensures the Quality, Diversity, and Safety of post-training datasets
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Engineer, Post-Training

Meta is seeking Research Engineers to join the Post-Training team within Meta Su...
Location
Location
United States , Menlo Park
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 3+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing, and completing medium to large technical features independently, without guidance
  • Software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities
Job Responsibility
Job Responsibility
  • Design, build, and scale full-stack data collection pipelines for post-training (SFT, RLHF) across text, vision, and action modalities
  • Develop and implement environments to capture complex agentic trajectories, including computer use agents, research workflows, UI generation, and shopping agents
  • Collaborate with external data vendors and domain experts to source, securely ingest, and prepare high-quality datasets in fields like STEM, finance, legal, and health
  • Execute on the technical vision of research scientists to generate and filter high-quality synthetic data at scale
  • Build robust, reusable data processing pipelines that scale across multiple model lines and product areas
  • Contribute to tooling that measures and ensures the quality, variety, and safety of post-training datasets
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Engineer, Post-Training

Meta is seeking Research Engineers to join the Post-Training team within Meta Su...
Location
Location
United States , Menlo Park
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 1+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing, and completing medium to large technical features independently, without guidance
  • Demonstrated experience in software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities
Job Responsibility
Job Responsibility
  • Design, build, and scale full-stack data collection pipelines for post-training (SFT, RLHF) across text, vision, and action modalities
  • Develop and implement environments to capture complex agentic trajectories, including computer use agents, Deep research workflows, UI generation, and shopping agents
  • Collaborate with external data vendors and domain experts to source, securely ingest, and prepare high-quality datasets in fields like STEM, finance, legal, and health
  • Execute on the technical vision of research scientists to generate and filter high-quality synthetic data at scale
  • Build robust, reusable data processing pipelines that scale across multiple model lines and product areas
  • Contribute to tooling that measures and ensures the Quality, Diversity, and Safety of post-training datasets
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Scientist, Post-Training (Tech Leadership)

Meta is seeking Research Scientists to join the Post-Training team within Meta S...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Ph.D. in Computer Science, Machine Learning, or a related technical field
  • 5+ years of experience in machine learning research, with a focus on deep learning, data alignment, NLP, or related areas
  • Demonstrated ability to lead technical research projects from conception to production
  • Collaborative communication skills and experience collaborating with technical leadership
Job Responsibility
Job Responsibility
  • Provide scientific leadership in designing novel methodologies for post-training data collection, curation, and synthetic data generation
  • Define data quality frameworks and alignment strategies that guide capability development across MSL, particularly for complex reasoning and agentic behaviors
  • Drive the scientific vision for eliciting high-quality data in expert domains (finance, legal, health, STEM) and complex agentic trajectories (Deep Research, Computer Use, UI generation)
  • Conduct research to develop and optimize post-training recipes that directly improve model quality
  • Partner with cross-functional research teams across product and model training to identify and prioritize gaps in model capabilities
  • Lead research workstreams that shape the long-term direction of data-centric AI at MSL, working independently while also contributing to team goals and organizational priorities
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

AI Research Scientist, Post-Training - Meta Superintelligence Labs

Meta is seeking Research Scientists to join the Post-Training team within Meta S...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Ph.D. in Computer Science, Machine Learning, or a related technical field
  • 3+ years of experience in machine learning research, with a focus on deep learning, data alignment, NLP, or related areas
  • Demonstrated ability to lead technical research projects from conception to production
  • Effective communication skills and experience collaborating with technical leadership
Job Responsibility
Job Responsibility
  • Design novel methodologies for post-training data collection, curation, and synthetic data generation
  • Define data quality frameworks and alignment strategies that guide capability development across MSL, particularly for complex reasoning and agentic behaviors
  • Drive the scientific vision for eliciting high-quality data in expert domains (finance, legal, health, STEM) and complex agentic trajectories (Deep research, computer use, UI generation)
  • Conduct research to develop and optimize post-training recipes that directly improve model quality
  • Partner with cross-functional research teams across product and model training to identify and prioritize gaps in model capabilities
  • Contribute to research workstreams that shape the long-term direction of data-centric AI at MSL, working independently while also contributing to team goals and organizational priorities
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Scientist, Safety Post Training

As the leading data and evaluation partner for frontier AI companies, Scale play...
Location
Location
United States , San Francisco, CA; New York, NY
Salary
Salary:
216000.00 - 270000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches
  • A track record of published research in machine learning, particularly in generative AI
  • At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development
  • Strong written and verbal communication skills to operate in a cross-functional team
Job Responsibility
Job Responsibility
  • Develop and apply post-training methods and interpretability techniques to make frontier AI systems safer, and better understood by researchers and policymakers
  • Design and run post-training pipelines to study how training choices affect model safety, robustness, and alignment properties
  • Develop interpretability-informed evaluations that reveal how and why models produce unsafe, deceptive, or otherwise undesirable behaviors, and use those insights to guide targeted mitigations
  • Collaborate with policymakers, engineers, and other researchers to translate post-training and interpretability findings into actionable safety standards, evaluation benchmarks, and best practices
What we offer
What we offer
  • comprehensive health, dental and vision coverage
  • retirement benefits
  • learning and development stipend
  • generous PTO
  • commuter stipend (eligible)
  • Fulltime
Read More
Arrow Right