CrawlJobs Logo

Research Scientist: Pretraining

generalistai.com Logo

Generalist AI

Location Icon

Location:
United States , San Mateo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

200000.00 - 350000.00 USD / Year

Job Description:

You will build the base intelligence layer for robotics. We train large-scale robot foundation models from massive multimodal datasets spanning video, proprioception, action traces, language, and more. You will design and run the core large-scale training efforts that give our models fundamentally new general capabilities across embodiments, tasks, and environments. You will “live and breathe” all forms of robot data.

Job Responsibility:

  • Designing and executing large-scale pretraining runs for robot foundation models (transformer- and diffusion-based architectures)
  • Defining model architectures, objectives, and training curricula across multimodal robotic data (vision, action, state, language)
  • Developing scalable data mixtures and sampling strategies across petabyte-scale datasets
  • Guiding data collection operations towards new directions, as well as sourcing new datasets
  • Running ablations to understand scaling laws, data quality effects, and architecture tradeoffs
  • Collaborating closely with ML Infra and Systems to push cluster utilization, throughput, and reliability
  • Turning raw robotic interaction data into generalizable model capabilities

Requirements:

  • Deep experience training large transformer or diffusion models at scale (for generative models e.g. including language models, audio models, or video models)
  • Have led or significantly contributed to multi-node, multi-GPU distributed training efforts
  • Have worked on scaling laws, optimization dynamics, and large-model failure modes
  • Have strong PyTorch fundamentals and comfort debugging at every layer of the stack
  • Care about both empirical rigor and raw iteration speed
  • Are excited about building general-purpose robot intelligence from first principles
What we offer:

Offers Equity

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist: Pretraining

Sr. Applied Research Scientist

We are building AI to simulate the world through merging art and science. We bel...
Location
Location
United States
Salary
Salary:
280000.00 - 380000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of relevant ML engineering or research experience
  • Very strong programming skills and ability to write clean and maintainable research code
  • Deep interest in building human-in-the-loop systems for creativity
  • Passion for seeing research through from initial conception to eventual application
  • Experience mentoring and teaching other researchers
  • Strong communication, collaboration, and documentation skills
Job Responsibility
Job Responsibility
  • Lead efforts in pretraining the next generation of Runway’s multimodal models
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, AI Research - Multimodal Pretraining

Meta is seeking Research Scientist Interns in the multimodal pretraining team in...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Machine Learning, Computer Vision, Artificial Intelligence, or relevant technical field
  • Past projects/publications in the general domain of neural scaling laws, model architectures, image/text modeling, vision-language modeling
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience in PyTorch, Triton, or other related programming languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Perform research to advance the frontiers of multimodal (images, video, text, audio, and other modalities) pretraining, to develop the next generation of multimodal architectures
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Engineer / Research Scientist - Foundations Retrieval Lead

The Foundations Research team works on high-risk, high-reward ideas that could s...
Location
Location
United States , San Francisco
Salary
Salary:
445000.00 - 555000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience leading high-performance teams of researchers or engineers in ML infrastructure or foundational research
  • Deep technical expertise in representation learning, embedding models, or vector retrieval systems
  • Familiarity with transformer-based LLMs and how embedding spaces can interact with language model objectives
  • Research experience in areas such as contrastive learning, supervised or unsupervised embedding learning, or metric learning
  • A track record of building or scaling large machine learning systems, particularly embedding pipelines in production or research contexts
  • A first-principles mindset for challenging assumptions about how retrieval and memory should work for large models
Job Responsibility
Job Responsibility
  • Lead research into embedding models and retrieval systems optimized for grounding, relevance, and adaptive reasoning
  • Manage a team of researchers and engineers building end-to-end infrastructure for training, evaluating, and integrating embeddings into frontier models
  • Drive innovation in dense, sparse, and hybrid representation techniques, metric learning, and learning-to-retrieve systems
  • Collaborate closely with Pretraining, Inference, and other Research teams to integrate retrieval throughout the model lifecycle
  • Contribute to OpenAI’s long-term vision of AI systems with memory and knowledge access capabilities rooted in learned representations
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Junior Research Infrastructure Engineer

We are seeking a Product-Minded Junior Research Infrastructure Engineer to join ...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
meshy.ai Logo
Meshy LLC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of experience in software engineering, backend development, or distributed systems
  • Strong programming skills in Python (plus Scala/Java/C++ a plus)
  • Familiarity with distributed frameworks (Spark, Dask, Ray) and cloud platforms (AWS/GCP/Azure)
  • Experience with workflow orchestration tools (Temporal, Celery, or Airflow)
  • Proficiency with Infrastructure as Code (Terraform) and CI/CD tools (GitHub Actions)
  • Experience building web applications or internal tools using React or Next.js
  • A 'product-first' mindset: an interest in how users interact with infrastructure and a desire to build clean, functional interfaces
Job Responsibility
Job Responsibility
  • Participate in the design and implementation of distributed task orchestration systems using Temporal or Celery
  • Architect pipelines across cloud object storage (S3, GCS), data lakes, and metadata catalogs
  • Implement partitioning, sharding, and caching strategies to ensure data processing pipelines are resilient, highly available, and consistent
  • Design, implement, and maintain distributed ingestion pipelines for structured and unstructured data (images, 3D/2D assets, binaries)
  • Build scalable ETL/ELT workflows to transform, validate, and enrich datasets for AI/ML model training and analytics
  • Support preprocessing of unstructured assets (e.g., images, 3D/2D models, video) for training pipelines, including format conversion, normalization, augmentation, and metadata extraction
  • Implement validation and quality checks to ensure datasets meet ML training requirements
  • Collaborate with ML researchers to quickly adapt pipelines to evolving pretraining and evaluation needs
  • Use infrastructure-as-code (Terraform, Kubernetes, etc.) to manage scalable and reproducible environments
  • Manage data assets using Databricks Asset Bundles (DABs) and build rigorous CI/CD pipelines (GitHub Actions)
What we offer
What we offer
  • Competitive salary, equity, and benefits package
  • Opportunity to work with a talented and passionate team at the forefront of AI and 3D technology
  • Flexible work environment, with options for remote and on-site work
  • Opportunities for fast professional growth and development
  • An inclusive culture that values creativity, innovation, and collaboration
  • Unlimited, flexible time off
  • Stock options available for core team members
  • 401(k) plan for employees
  • Comprehensive health, dental, and vision insurance
  • The latest and best office equipment
  • Fulltime
Read More
Arrow Right

Tech Lead - Pretraining Team, Wayve Foundation Model

This is a rare opportunity to lead foundational work at the intersection of larg...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Leadership in data-centric AI: Experience leading research or engineering teams focused on dataset curation, filtering, or enrichment at scale, particularly for large-scale model pretraining.
  • Contributions to data benchmarks or tools: Involvement in projects like DataComp, LAION, DINO, MOLMO, or equivalent initiatives that define or evaluate pretraining dataset quality.
  • Deep understanding of distributed data processing: Strong working knowledge of frameworks such as Ray, Spark, Dask, or equivalent, and designing scalable, fault-tolerant data pipelines.
  • Hands-on deep learning expertise: Strong proficiency in PyTorch and a solid grasp of how data quality, distribution, and structure impact training dynamics and model generalisation.
  • Experimental mindset: Demonstrated ability to run and interpret data-centric experiments (e.g., small-scale trials, ablations) to inform large-scale model training.
  • Collaboration with research: Experience working closely with ML researchers and contributing to experimental design, pretraining strategies, or evaluation design.
  • Minimum 5 years of relevant industry experience: Including at least several years in data-heavy, model-driven environments involving deep learning at scale.
Job Responsibility
Job Responsibility
  • Lead data curation, enrichment, and filtering efforts for large-scale pretraining of embodied models
  • Build and manage distributed data processing and ingestion pipelines across modalities
  • Partner with research teams to run data-centric experiments and influence model training strategy
  • Identify, integrate, and leverage third-party datasets to enhance pretraining and evaluation
  • Manage and mentor a team of engineers and data scientists to deliver scientific and technical impact
What we offer
What we offer
  • Attractive compensation with salary and equity
  • Immersion in a team of world-class researchers, engineers and entrepreneurs
  • A unique position to shape the future of autonomy and tackle the biggest challenge of our time
  • Bespoke learning and development opportunities
  • Relocation support with visa sponsorship
  • Flexible working hours - we trust you to do your job well, at times that suit you and your time
  • Benefits such as an onsite chef, workplace nursery scheme, private health insurance, therapy, daily yoga, onsite bar, large social budgets, unlimited L&D requests, enhanced parental leave, and more!
  • Fulltime
Read More
Arrow Right

Foundational AI Research Scientist - FAIR

Meta is seeking Research Scientists to join its Fundamental AI Research (FAIR) o...
Location
Location
United States , Bellevue
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • A PhD in AI, computer science, data science, or related technical fields
  • First-authored publications at peer-reviewed conferences, such as ICML, NeuRIPS, ICLR, ACL, EMNLP, and other similar venues, reflecting experience in LLM pretraining
  • 1+ years experience holding an industry, postdoctoral, faculty, or government researcher position
  • Research background in machine learning, artificial intelligence, computational statistics, applied mathematics, or related areas
  • Experience in developing and debugging in Python or similar programming languages
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Perform research that enables learning the semantics of data (images, video, text, audio, and other modalities)
  • Perform research to advance the science and technology of intelligent machines
  • Work towards long-term ambitious research goals, while identifying intermediate milestones
  • Influence progress of relevant research communities by producing publications
  • Open source high quality code and produce reproducible research
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Member of Technical Staff, Multimodal Infrastructure

Microsoft AI is looking for a Member of Technical Staff, Multimodal Infrastructu...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience in multi-modal data processing: Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations
  • Experience with state-of-art model inference and serving frameworks
  • Experience with image/video/audio data processing
  • Experience with common data formats for efficient I/O
  • Experience in multi-modal pretraining and post-training: Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed
  • Knowledge of auto-regressive and diffusion transformer models
  • Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism
  • Proven experiences in at least one of the following areas: image/video generation and editing
  • efficient architectures (e.g., MoE, window attention)
Job Responsibility
Job Responsibility
  • Design, develop and maintain large-scale multimodal data processing pipelines
  • Design, develop and maintain large-scale multimodal model pretraining and post-training frameworks
  • Design, develop and maintain large-scale multimodal model inference and serving frameworks
  • Work with research scientists and product engineers to solve infra-related problems
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, Embodied Foundation Models

Our team is seeking a talented Applied Scientist Intern to join us for 3-6 month...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently pursuing a graduate degree in Computer Science, Machine Learning, Robotics, or related technical field
  • Proficient in at least one backend/systems programming language (e.g. Python, Ruby, Java, etc)
  • Previous experience in vision-language models, large language models, natural language processing, especially around reasoning
  • Solid software engineering fundamentals, especially in Python
  • Previously used PyTorch or a similar library for deep learning (e.g. Tensorflow, JAX)
  • Experience with multi-node distributed training of large models
  • Interested in using large-scale multimodal (vision, language, etc.) datasets to improve embodied AI
  • Previous publications in conferences (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Job Responsibility
Job Responsibility
  • Work on foundation models for embodied AI, including large-scale pretraining, post-training, leveraging language, or improving reasoning capabilities
  • Train models on large-scale multimodal (vision, language, etc.) data efficiently in a multi-node distributed system, and evaluate their performance on open (and closed) datasets/benchmarks
  • Lead a high-impact research work and publish at a top tier conference
Read More
Arrow Right