CrawlJobs Logo

Applied AI Researcher, Post-Training

Distyl AI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

130000.00 - 250000.00 USD / Year

Job Description:

The Post-Training team focuses on adapting foundation models to real-world performance and alignment requirements. Their work informs how Distyl leverages foundation models safely, effectively, and at scale across industries.

Job Responsibility:

  • Researchers develop and evaluate techniques such as supervised fine-tuning, preference optimization (DPO, RLHF, RLAIF), and continual adaptation to align models with Distyl’s enterprise systems
  • Researchers in Post-Training investigate new methods for aligning large models with human and system-level objectives. They explore trade-offs between generalization and specialization, data efficiency and robustness, capability and controllability

Requirements:

  • Deep Understanding of Post-training Techniques: Familiarity with supervised fine-tuning, preference optimization (RLHF/DPO), LoRA/PEFT, and instruction-tuning pipelines
  • Experience Adapting Frontier Models: You’ve tuned or adapted LLMs/SLMs to specialized domains or behaviors through data curation, reward modeling, or continual pretraining
  • Experience Building with Models, Not Just Building Models: We develop intelligent systems using models rather than training or fine-tuning them. Ideal candidates have expertise in compound AI systems, agentic collaboration, and associated techniques (ensembling, ReAct, graph-of-thoughts, etc.)
  • Proven Track Record of Research Results: Whether you’ve published in top journals, posted amazing work on twitter, or somewhere else we want to see what you've done
  • Uses AI Every Day: Before you can revolutionize someone else’s workflow, you need to revolutionize yours. You should be using tools like ChatGPT, Cursor, and Perplexity to accelerate your workflow
  • Strong Programming and Data Analysis Skills: While you might not consider yourself a software engineer you need to be able to build prototypes of your ideas and then perform the experiments to prove the effectiveness to a F500 Head of AI
  • Biases Towards Showing vs Telling: Our customers want to see the power of AI today vs discuss the most elegant idea that will take 5 years to realize
What we offer:
  • 100% covered medical, dental, and vision for employees and dependents
  • 401(k) with additional perks (e.g., commuter benefits, in‑office lunch)
  • Access to state‑of‑the‑art models, generous usage of modern AI tools, and real‑world business problems
  • Ownership of high‑impact projects across top enterprises
  • A mission‑driven, fast‑moving culture that prizes curiosity, pragmatism, and excellence
  • meaningful equity

Additional Information:

Job Posted:
March 08, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Applied AI Researcher, Post-Training

Research Scientist - Generative AI

As a Research Scientist in the Emergent Machine Intelligence Team at Hewlett Pac...
Location
Location
United States , Santa Barbara
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences.
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-tuning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars.
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development.
  • Fulltime
Read More
Arrow Right

Research Scientist - Generative AI

This role involves conducting high-quality research in generative AI, designing ...
Location
Location
United States
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-turning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Research Engineer

As a Research Engineer at Mercor, you’ll work at the intersection of engineering...
Location
Location
United States , San Francisco
Salary
Salary:
130000.00 - 500000.00 USD / Year
mercor.com Logo
Mercor
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong applied research background, with a focus on post-training and/or model evaluation
  • Strong coding proficiency and hands-on experience working with machine learning models
  • Strong understanding of data structures, algorithms, backend systems, and core engineering fundamentals
  • Familiarity with APIs, SQL/NoSQL databases, and cloud platforms
  • Ability to reason deeply about model behavior, experimental results, and data quality
  • Excitement to work in person in San Francisco, five days a week (with optional remote Saturdays), and thrive in a high-intensity, high-ownership environment
Job Responsibility
Job Responsibility
  • Work on post-training and RLVR pipelines to understand how datasets, rewards, and training strategies impact model performance
  • Design and run reward-shaping experiments and algorithmic improvements (e.g., GRPO, DAPO) to improve LLM tool-use, agentic behavior, and real-world reasoning
  • Quantify data usability, quality, and performance uplift on key benchmarks
  • Build and maintain data generation and augmentation pipelines that scale with training needs
  • Create and refine rubrics, evaluators, and scoring frameworks that guide training and evaluation decisions
  • Build and operate LLM evaluation systems, benchmarks, and metrics at scale
  • Collaborate closely with AI researchers, applied AI teams, and experts producing training data
  • Operate in a fast-paced, experimental research environment with rapid iteration cycles and high ownership
What we offer
What we offer
  • Generous equity grant vested over 4 years
  • A $20K relocation bonus (if moving to the Bay Area)
  • A $10K housing bonus (if you live within 0.5 miles of our office)
  • A $1K monthly stipend for meals
  • Free Equinox membership
  • Health insurance
  • Fulltime
Read More
Arrow Right

Research Engineer / Scientist - Post-training

At Luma, the Post-training team is responsible for unlocking creative control in...
Location
Location
United States , Palo Alto
Salary
Salary:
187500.00 - 395000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
  • Demonstrated ability to do independent research in Academic or Industry settings
  • Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
  • Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization
  • Strong orientation toward applied AI implementations with emphasis on translating product requirements into technical solutions, coupled with exceptional visual discrimination and dedicated focus on enhancing visual fidelity and aesthetic quality of generated content
  • Proficiency in accelerated prototyping and demonstration development for emerging features, facilitating efficient iteration cycles and comprehensive stakeholder evaluation prior to production implementation
  • Established track record of effective cross-functional teamwork, including successful partnerships with teams spanning Product, Design, Evaluation, Applied, and creative specialists
Job Responsibility
Job Responsibility
  • Optimize Luma's image and video generative models through targeted fine-tuning to improve visual quality, instruction adherence, and overall performance metrics
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
  • Partner closely with the Applied Research team to identify product requirements, understand diverse use cases across Luma's platforms, and execute targeted fine-tuning initiatives to address performance gaps and enhance user-facing capabilities
  • Conduct comprehensive side-by-side evaluations comparing model performance against leading market competitors, systematically analyzing the impact of post-training techniques on downstream performance metrics and identifying areas for improvement
  • Develop advanced post-training capabilities for Luma’s video models including Camera control, Object & character Reference, Image & Video Editing, Human Performance & Motion Transfer Approaches
  • Architect data processing pipelines for large-scale video and image datasets, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
  • Research and deploy cutting-edge diffusion sampling methodologies and hyperparameter optimization strategies to achieve superior performance on established visual quality benchmarks
  • Research emerging post-training methodologies in generative AI, evaluate their applicability to Luma's product ecosystem, and integrate promising techniques into our Post-training recipe
  • Fulltime
Read More
Arrow Right

AI Architect

We’re hiring an AI Architect to sit at the intersection of frontier AI research,...
Location
Location
United States , San Francisco; New York
Salary
Salary:
201600.00 - 241920.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep technical background in applied AI/ML: 5–10+ years in research, engineering, solutions engineering, or technical product roles working on LLMs or multimodal systems, ideally in high-stakes, customer-facing environments
  • Hands-on experience with model improvement workflows: demonstrated experience with post-training techniques, evaluation design, benchmarking, and model quality iteration
  • Ability to work on hard, ambiguous technical problems: proven track record of partnering directly with advanced customers or research teams to scope, reason through, and execute on deep technical challenges involving frontier models
  • Strong technical fluency: you can read papers, interrogate metrics, write or review complex Python/SQL for analysis, and reason about model-data trade-offs
  • Executive presence with world-class researchers and enterprise leaders
  • excellent writing and storytelling
  • Bias to action: you ship, learn, and iterate.
Job Responsibility
Job Responsibility
  • Translate research → product: work with client side researchers on post-training, evals, safety/alignment and build the primitives, data, and tooling they need
  • Partner deeply with core customers and frontier labs: work hands-on with leading AI teams and frontier research labs to tackle hard, open-ended technical problems related to frontier model improvement, performance, and deployment
  • Shape and propose model improvement work: translate customer and research objectives into clear, technically rigorous proposals—scoping post-training, evaluation, and safety work into well-defined statements of work and execution plans
  • Translate research into production impact: collaborate with customer-side researchers on post-training, evaluations, and alignment, and help design the data, primitives, and tooling required to improve frontier models in practice
  • Own the end-to-end lifecycle: lead discovery, write crisp PRDs and technical specs, prioritize trade-offs, run experiments, ship initial solutions, and scale successful pilots into durable, repeatable offerings
  • Lead complex, high-stakes engagements: independently run technical working sessions with senior customer stakeholders
  • define success metrics
  • surface risks early
  • and drive programs to measurable outcomes
  • Partner across Scale: collaborate closely with research (agents, browser/SWE agents), platform, operations, security, and finance to deliver reliable, production-grade results for demanding customers
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • equity based compensation.
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, AI Research Multi-modal Post-Training

Meta is seeking Research Scientist Interns in the Meta Superintelligence org. We...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, NLP, Reinforcement Learning (RL), Computer Vision, Artificial Intelligence, or relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience in Python or other related programming languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of generative AI
  • Perform research that enables learning the semantics of data at scale (images, video, text, audio, and other modalities)
  • Improve and propose new methods for post-training foundation models across the spectrum of techniques including reinforcement learning and supervised fine tuning
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Devise better data-driven models of image multi-modal understanding
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right

Applied Research - RL & Agents

Prime Intellect builds the infrastructure that frontier AI labs build internally...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
Prime Intellect
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong background in machine learning engineering, with experience in post-training, RL, or large-scale model alignment
  • Experience with agent frameworks and tooling (e.g. DSPy, LangGraph, MCP, Stagehand)
  • Familiarity with distributed training/inference frameworks (e.g., vLLM, sglang, Accelerate, Ray, Torch)
  • Track record of research contributions (publications, open-source contributions, benchmarks) in ML/RL
  • Passion for advancing the state-of-the-art in reasoning and building practical, agentic AI systems
  • Strong technical writing abilities (documentation, blogs, papers) and research taste
  • Eagerness to drive collaborations with external partners and engage with the broader open-source community
Job Responsibility
Job Responsibility
  • Advancing Agent Capabilities: Designing and iterating on next-generation AI agents that tackle real workloads—workflow automation, reasoning-intensive tasks, and decision-making at scale
  • Building Robust Infrastructure: Developing the systems and frameworks that enable these agents to operate reliably, efficiently, and at massive scale
  • Bridge Between Applications & Research: Translate ambiguous objectives into clear technical requirements that guide product and research priorities
  • Prototype in the Field: Rapidly design and deploy agents, evals, and harnesses for real-world tasks to validate solutions
  • Application-Driven Research & Infrastructure: Shape the direction and feature set for verifiers, the Environments Hub, training services, and other research platform offerings
  • Build high‑quality examples, reference implementations, and “recipes” that make it easy for others to extend the stack
  • Prototype agents and eval harnesses tailored to real-world use cases and external systems
  • Pair with technical end‑users (research teams, infra‑heavy customers, open‑source contributors) to design environments, evals, and verifiers that reflect real workloads
  • Post-training & Reinforcement Learning: Design and implement novel RL and post-training methods (RLHF, RLVR, GRPO, etc.) to align large models with domain-specific tasks
  • Build evaluations and harnesses and to measure reasoning, robustness, and agentic behavior in real-world workflows
What we offer
What we offer
  • Competitive Compensation + equity incentives
  • Flexible Work (San Francisco or hybrid-remote)
  • Visa Sponsorship & relocation support
  • Professional Development budget
  • Team Off-sites & conference attendance
  • Fulltime
Read More
Arrow Right

Software Engineer, AI Platform

Perplexity is seeking an experienced Software Engineer focusing on building the ...
Location
Location
United States , San Francisco, Palo Alto
Salary
Salary:
210000.00 - 385000.00 USD / Year
perplexity.ai Logo
Perplexity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming and data engineering skills, with proficiency in open source & distributed framework(AWS, Spark, Flink, Iceberg, DynamoDB)
  • Familiarity with cloud-based data services (e.g., AWS, RDS, DynamoDB), containerized infrastructure (e.g., EKS, Docker), and data streaming (Flink, Spark streaming, CDC)
  • Strong quantitative and engineering skills with experience in estimating performance at high scale
  • Experience supporting various ML/AI engineering teams to build scalable frameworks to accelerate R&D for frontier models and AI products
  • Experience iterating on improving LLM responses and set up proper evaluation framework or Judges to analysis performance holistically.
  • Self-motivated with a strong sense of ownership of systems and designs
  • 5+ years of industry experience in distributed systems or AI infrastructure
Job Responsibility
Job Responsibility
  • Collaborate closely with AI Product, Applied ML, Post-Training, and Data Science teams to design, build, and maintain scalable data pipelines and data lakes
  • Develop high-performance infrastructure that powers personalization features including memory, discover, and agentic products
  • Create a scalable, multi-modal evaluation platform for all Perplexity AI products, including personalization, pro search, labs, deep research, and Comet
  • Design tools and abstractions on foundational infrastructure to enhance personalization, analytics, recommendations, AI products, and post-training capabilities
  • Holistically improve engineering foundation to support rapid growth of Perplexity products and international user base.
What we offer
What we offer
  • equity
  • health
  • dental
  • vision
  • retirement
  • fitness
  • commuter and dependent care accounts
  • Fulltime
Read More
Arrow Right