CrawlJobs Logo

Research Scientist: Pretraining

generalistai.com Logo

Generalist AI

Location Icon

Location:
United States , San Mateo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

200000.00 - 350000.00 USD / Year

Job Description:

You will build the base intelligence layer for robotics. We train large-scale robot foundation models from massive multimodal datasets spanning video, proprioception, action traces, language, and more. You will design and run the core large-scale training efforts that give our models fundamentally new general capabilities across embodiments, tasks, and environments. You will “live and breathe” all forms of robot data.

Job Responsibility:

  • Designing and executing large-scale pretraining runs for robot foundation models (transformer- and diffusion-based architectures)
  • Defining model architectures, objectives, and training curricula across multimodal robotic data (vision, action, state, language)
  • Developing scalable data mixtures and sampling strategies across petabyte-scale datasets
  • Guiding data collection operations towards new directions, as well as sourcing new datasets
  • Running ablations to understand scaling laws, data quality effects, and architecture tradeoffs
  • Collaborating closely with ML Infra and Systems to push cluster utilization, throughput, and reliability
  • Turning raw robotic interaction data into generalizable model capabilities

Requirements:

  • Deep experience training large transformer or diffusion models at scale (for generative models e.g. including language models, audio models, or video models)
  • Have led or significantly contributed to multi-node, multi-GPU distributed training efforts
  • Have worked on scaling laws, optimization dynamics, and large-model failure modes
  • Have strong PyTorch fundamentals and comfort debugging at every layer of the stack
  • Care about both empirical rigor and raw iteration speed
  • Are excited about building general-purpose robot intelligence from first principles
What we offer:

Offers Equity

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist: Pretraining

Sr. Applied Research Scientist

We are building AI to simulate the world through merging art and science. We bel...
Location
Location
United States
Salary
Salary:
280000.00 - 380000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of relevant ML engineering or research experience
  • Very strong programming skills and ability to write clean and maintainable research code
  • Deep interest in building human-in-the-loop systems for creativity
  • Passion for seeing research through from initial conception to eventual application
  • Experience mentoring and teaching other researchers
  • Strong communication, collaboration, and documentation skills
Job Responsibility
Job Responsibility
  • Lead efforts in pretraining the next generation of Runway’s multimodal models
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, AI Research - Multimodal Pretraining

Meta is seeking Research Scientist Interns in the multimodal pretraining team in...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Machine Learning, Computer Vision, Artificial Intelligence, or relevant technical field
  • Past projects/publications in the general domain of neural scaling laws, model architectures, image/text modeling, vision-language modeling
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience in PyTorch, Triton, or other related programming languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Perform research to advance the frontiers of multimodal (images, video, text, audio, and other modalities) pretraining, to develop the next generation of multimodal architectures
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right
New

Junior Research Infrastructure Engineer

We are seeking a Product-Minded Junior Research Infrastructure Engineer to join ...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
meshy.ai Logo
Meshy LLC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of experience in software engineering, backend development, or distributed systems
  • Strong programming skills in Python (plus Scala/Java/C++ a plus)
  • Familiarity with distributed frameworks (Spark, Dask, Ray) and cloud platforms (AWS/GCP/Azure)
  • Experience with workflow orchestration tools (Temporal, Celery, or Airflow)
  • Proficiency with Infrastructure as Code (Terraform) and CI/CD tools (GitHub Actions)
  • Experience building web applications or internal tools using React or Next.js
  • A 'product-first' mindset: an interest in how users interact with infrastructure and a desire to build clean, functional interfaces
Job Responsibility
Job Responsibility
  • Participate in the design and implementation of distributed task orchestration systems using Temporal or Celery
  • Architect pipelines across cloud object storage (S3, GCS), data lakes, and metadata catalogs
  • Implement partitioning, sharding, and caching strategies to ensure data processing pipelines are resilient, highly available, and consistent
  • Design, implement, and maintain distributed ingestion pipelines for structured and unstructured data (images, 3D/2D assets, binaries)
  • Build scalable ETL/ELT workflows to transform, validate, and enrich datasets for AI/ML model training and analytics
  • Support preprocessing of unstructured assets (e.g., images, 3D/2D models, video) for training pipelines, including format conversion, normalization, augmentation, and metadata extraction
  • Implement validation and quality checks to ensure datasets meet ML training requirements
  • Collaborate with ML researchers to quickly adapt pipelines to evolving pretraining and evaluation needs
  • Use infrastructure-as-code (Terraform, Kubernetes, etc.) to manage scalable and reproducible environments
  • Manage data assets using Databricks Asset Bundles (DABs) and build rigorous CI/CD pipelines (GitHub Actions)
What we offer
What we offer
  • Competitive salary, equity, and benefits package
  • Opportunity to work with a talented and passionate team at the forefront of AI and 3D technology
  • Flexible work environment, with options for remote and on-site work
  • Opportunities for fast professional growth and development
  • An inclusive culture that values creativity, innovation, and collaboration
  • Unlimited, flexible time off
  • Stock options available for core team members
  • 401(k) plan for employees
  • Comprehensive health, dental, and vision insurance
  • The latest and best office equipment
  • Fulltime
Read More
Arrow Right

Tech Lead - Pretraining Team, Wayve Foundation Model

This is a rare opportunity to lead foundational work at the intersection of larg...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Leadership in data-centric AI: Experience leading research or engineering teams focused on dataset curation, filtering, or enrichment at scale, particularly for large-scale model pretraining.
  • Contributions to data benchmarks or tools: Involvement in projects like DataComp, LAION, DINO, MOLMO, or equivalent initiatives that define or evaluate pretraining dataset quality.
  • Deep understanding of distributed data processing: Strong working knowledge of frameworks such as Ray, Spark, Dask, or equivalent, and designing scalable, fault-tolerant data pipelines.
  • Hands-on deep learning expertise: Strong proficiency in PyTorch and a solid grasp of how data quality, distribution, and structure impact training dynamics and model generalisation.
  • Experimental mindset: Demonstrated ability to run and interpret data-centric experiments (e.g., small-scale trials, ablations) to inform large-scale model training.
  • Collaboration with research: Experience working closely with ML researchers and contributing to experimental design, pretraining strategies, or evaluation design.
  • Minimum 5 years of relevant industry experience: Including at least several years in data-heavy, model-driven environments involving deep learning at scale.
Job Responsibility
Job Responsibility
  • Lead data curation, enrichment, and filtering efforts for large-scale pretraining of embodied models
  • Build and manage distributed data processing and ingestion pipelines across modalities
  • Partner with research teams to run data-centric experiments and influence model training strategy
  • Identify, integrate, and leverage third-party datasets to enhance pretraining and evaluation
  • Manage and mentor a team of engineers and data scientists to deliver scientific and technical impact
What we offer
What we offer
  • Attractive compensation with salary and equity
  • Immersion in a team of world-class researchers, engineers and entrepreneurs
  • A unique position to shape the future of autonomy and tackle the biggest challenge of our time
  • Bespoke learning and development opportunities
  • Relocation support with visa sponsorship
  • Flexible working hours - we trust you to do your job well, at times that suit you and your time
  • Benefits such as an onsite chef, workplace nursery scheme, private health insurance, therapy, daily yoga, onsite bar, large social budgets, unlimited L&D requests, enhanced parental leave, and more!
  • Fulltime
Read More
Arrow Right

Foundational AI Research Scientist - FAIR

Meta is seeking Research Scientists to join its Fundamental AI Research (FAIR) o...
Location
Location
United States , Bellevue
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • A PhD in AI, computer science, data science, or related technical fields
  • First-authored publications at peer-reviewed conferences, such as ICML, NeuRIPS, ICLR, ACL, EMNLP, and other similar venues, reflecting experience in LLM pretraining
  • 1+ years experience holding an industry, postdoctoral, faculty, or government researcher position
  • Research background in machine learning, artificial intelligence, computational statistics, applied mathematics, or related areas
  • Experience in developing and debugging in Python or similar programming languages
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Perform research that enables learning the semantics of data (images, video, text, audio, and other modalities)
  • Perform research to advance the science and technology of intelligent machines
  • Work towards long-term ambitious research goals, while identifying intermediate milestones
  • Influence progress of relevant research communities by producing publications
  • Open source high quality code and produce reproducible research
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Scientist Intern, Embodied Foundation Models

Our team is seeking a talented Applied Scientist Intern to join us for 3-6 month...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently pursuing a graduate degree in Computer Science, Machine Learning, Robotics, or related technical field
  • Proficient in at least one backend/systems programming language (e.g. Python, Ruby, Java, etc)
  • Previous experience in vision-language models, large language models, natural language processing, especially around reasoning
  • Solid software engineering fundamentals, especially in Python
  • Previously used PyTorch or a similar library for deep learning (e.g. Tensorflow, JAX)
  • Experience with multi-node distributed training of large models
  • Interested in using large-scale multimodal (vision, language, etc.) datasets to improve embodied AI
  • Previous publications in conferences (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Job Responsibility
Job Responsibility
  • Work on foundation models for embodied AI, including large-scale pretraining, post-training, leveraging language, or improving reasoning capabilities
  • Train models on large-scale multimodal (vision, language, etc.) data efficiently in a multi-node distributed system, and evaluate their performance on open (and closed) datasets/benchmarks
  • Lead a high-impact research work and publish at a top tier conference
Read More
Arrow Right

Research Scientist Intern, Embodied Foundation Models (Evaluation)

Our team is seeking a talented Applied Scientist Intern to join us for 3-6 month...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You are currently pursuing a graduate degree in a Computer Science, Machine Learning, Robotics, or related technical field
  • You are proficient in at least one backend/systems programming language (e.g. Python, Ruby, Java, etc)
  • You have previous experience in vision-language models, large language models, natural language processing, especially around reasoning
  • You have solid software engineering fundamentals, especially in Python
  • You have previously used PyTorch or a similar library for deep learning (e.g. Tensorflow, JAX)
  • Experience with multi-node distributed training of large models
  • You are interested in using large-scale multimodal (vision, language, etc.) datasets to improve embodied AI
  • You have previous publications in the following conferences (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Job Responsibility
Job Responsibility
  • Work on foundation models for embodied AI, including large-scale pretraining, post-training, leveraging language, or improving reasoning capabilities
  • Train models on large-scale multimodal (vision, language, etc.) data efficiently in a multi-node distributed system, and evaluate their performance on open (and closed) datasets/benchmarks
  • Lead a high-impact research work and publish at a top tier conference (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Read More
Arrow Right

Research Scientist / Engineer – Realtime Interactive

At Luma, the Realtime Interactive team is responsible for building an entirely n...
Location
Location
United States , Palo Alto
Salary
Salary:
187500.00 - 395000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with fine-tuning large-scale generative models
  • Proficiency in PyTorch and distributed training frameworks
  • (Preferred) Strong background in methods for optimizing model inference (distillation, quantization, sparsity, compression, etc.)
  • (Preferred) Experience in gathering, processing, and annotating datasets
Job Responsibility
Job Responsibility
  • Work on top of pretrained multimodal generative models to fine-tune and optimize them for realtime generation
  • Design novel algorithms and techniques to solve problems with autoregressive visual generation, long-range temporal consistency, and long-term memory
  • Develop interactive applications with tight latency constraints
  • Process data to develop advanced interactive capabilities and controls for World Modeling, such as controlling character and camera movement, audio, and more
  • Fulltime
Read More
Arrow Right