Research Scientist / Engineer — Foundation Model Job at Luma AI (Palo Alto)

Research Scientist / Engineer – Foundation Model: Core Research

This is a rare and foundational opportunity to define the future of multimodal A...

Location

United States , Palo Alto

Salary:

250000.00 - 450000.00 USD / Year

Luma AI

Expiration Date

Until further notice

Requirements

A Bachelor's, Master's, or PhD degree in Computer Science, Machine Learning, Physics, or Mathematics is essential
A 'first-principles' intuition for scaling
Fluent in the language of frontier AI
Proven ability to design and rigorously analyze experiments and to articulate complex technical concepts effectively
Practical experience with distributed or high-performance computing environments, particularly managing and optimizing training runs on large-scale GPU clusters

Job Responsibility

Unified Modeling & Efficiency Drive the core research that powers all of Luma's products — co-designing multimodal representations, advancing core algorithms for long-context training, and establishing rigorous scaling laws to predict performance across compute budgets
Alignment & Evaluation Close the gap between training loss and user experience. Develop proxy tasks and automated metrics that serve as the compass for research decisions — ensuring our models optimize for what actually matters to users, not just benchmarks
Research Infrastructure Build the engine for high-velocity research. Maintain production-research parity, ensure reproducibility, and design systems for rapid experimentation — so that novel ideas go from hypothesis to validated result as fast as possible

Fulltime

Research Scientist, Foundation Model

You'll be among the first scientists developing an entirely new class of AI mode...

Location

Germany; United States , Freiburg; Berlin; San Francisco; New York

Salary:

Not provided

Prior Labs

Expiration Date

Until further notice

Requirements

PhD in Computer Science, Applied Mathematics, Statistics, Electrical Engineering, or a related field
Deep experience with ML frameworks, especially PyTorch and scikit-learn
Strong engineering fundamentals with excellent Python expertise
Experience in data-science and working with tabular data or time series
Publications at top-tier venues (NeurIPS, ICML, ICLR) or significant open-source contributions

Job Responsibility

Work on fundamental breakthroughs in AI
Shape the future of how organizations worldwide work with their most valuable data
Scaling our transformer architectures from 10K to 1M+ samples while maintaining performance
Building multimodal models that combine text and tabular understanding
Developing specialized architectures for time series, forecasting, and anomaly detection
Creating efficient inference methods for production deployment
Researching causal understanding in foundation models
Designing novel approaches for handling multiple related tables

What we offer

Competitive compensation package with meaningful equity
30 days of paid vacation + public holidays
Comprehensive benefits including healthcare, transportation, and fitness
Work with state-of-the-art ML architecture, substantial compute resources and with a world-class team

Fulltime

Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI

We build simple yet innovative consumer products and developer APIs that shape h...

Location

United States , New York

Salary:

228960.00 - 315360.00 USD / Year

Plaid

Expiration Date

Until further notice

Requirements

Strong applied ML research skills with production delivery experience
Depth in Transformers/LLMs, representation learning, or large-scale model training
Demonstrated ability to ship models to production (not just prototype)
Distributed training experience and strong Python + software engineering fundamentals
Fintech / financial data domain experience is a plus
External publications or open-source contributions is a plus

Job Responsibility

Building a foundation model on one of the world’s richest financial datasets that no one else has
Doing research that ships: moving from experimentation and prototypes to production systems serving real customers
Working across the full ML stack, from pretraining objectives and architectures to serving infrastructure and monitoring
Collaborating with a high-caliber team and seeing your work amplify the capabilities of multiple product teams
Helping hundreds of millions of consumers achieve greater financial freedom through data-driven products

Fulltime

Research Engineer / Research Scientist - Foundations Retrieval Lead

The Foundations Research team works on high-risk, high-reward ideas that could s...

Location

United States , San Francisco

Salary:

445000.00 - 555000.00 USD / Year

OpenAI

Expiration Date

Until further notice

Requirements

Proven experience leading high-performance teams of researchers or engineers in ML infrastructure or foundational research
Deep technical expertise in representation learning, embedding models, or vector retrieval systems
Familiarity with transformer-based LLMs and how embedding spaces can interact with language model objectives
Research experience in areas such as contrastive learning, supervised or unsupervised embedding learning, or metric learning
A track record of building or scaling large machine learning systems, particularly embedding pipelines in production or research contexts
A first-principles mindset for challenging assumptions about how retrieval and memory should work for large models

Job Responsibility

Lead research into embedding models and retrieval systems optimized for grounding, relevance, and adaptive reasoning
Manage a team of researchers and engineers building end-to-end infrastructure for training, evaluating, and integrating embeddings into frontier models
Drive innovation in dense, sparse, and hybrid representation techniques, metric learning, and learning-to-retrieve systems
Collaborate closely with Pretraining, Inference, and other Research teams to integrate retrieval throughout the model lifecycle
Contribute to OpenAI’s long-term vision of AI systems with memory and knowledge access capabilities rooted in learned representations

What we offer

Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
401(k) retirement plan with employer match
Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
Mental health and wellness support
Employer-paid basic life and disability coverage
Annual learning and development stipend to fuel your professional growth
Daily meals in our offices, and meal delivery credits as eligible

Fulltime

Research Scientist / Engineer — Multimodal Agent

This is a rare and foundational opportunity to define the future of multimodal A...

Location

United States , Palo Alto

Salary:

250000.00 - 450000.00 USD / Year

Luma AI

Expiration Date

Until further notice

Requirements

Strong foundation in machine learning, foundation models and agentic systems
Deep understanding of agentic systems and approaches in LLM/VLM reasoning, coding models, LLM/VLM tool calling
Hands-on experience with PyTorch and large-scale training (distributed, mixed precision, large datasets)

Job Responsibility

Architect large-scale multimodal agentic models that use reasoning, planning, coding, and tool calling to achieve complex, multi-step multimodal work
Hillclimbing existing tasks and formulating new tasks through data
Design, implement, and run robust data pipelines for constructing, enriching, and filtering massive pixel datasets
Train large-scale multimodal models on massive datasets and GPU clusters
Define and build novel evaluation frameworks to measure multimodal agents

Fulltime

Research Scientist / Engineer — Video / Audio Generation

This is a rare and foundational opportunity to define the future of creative AI....

Location

United States , Palo Alto

Salary:

250000.00 - 450000.00 USD / Year

Luma AI

Expiration Date

Until further notice

Requirements

Strong foundation in machine learning and generative modeling, with experience in video, audio, or multimodal domains
Deep understanding of autoregressive, diffusion/flow-based, or hybrid generative models, and their tradeoffs for long-horizon generation
Hands-on experience with PyTorch and large-scale training (distributed, mixed precision, large datasets)

Job Responsibility

Architect large-scale video and audio generative models, focusing on strong temporal coherence and high perceptual quality
Design, implement, and run robust data pipelines for curating, filtering, and captioning massive video and audio datasets
Train large-scale video and audio generative models on massive datasets and GPU clusters
Define and build novel evaluation frameworks to measure realism, temporal consistency, controllability, and human-aligned creative quality

Fulltime

Research Scientist / Engineer – Pre-training / Scaling

At Luma, the Pre-Training / Scaling team is responsible for building the core mu...

Location

United States , Palo Alto

Salary:

187500.00 - 395000.00 USD / Year

Luma AI

Expiration Date

Until further notice

Requirements

Expertise in Python and PyTorch with experience building ML models from scratch
Deep understanding of multimodal generative models and deep learning architectures
(Preferred) Strong research track record in generative AI with published work in top-tier venues preferred
(Preferred) Experience with large-scale distributed training systems

Job Responsibility

Lead cutting-edge research in multimodal foundation models spanning video, image, text, and audio
Design and implement novel algorithms, architectures, and techniques for large-scale generative AI models
Develop training methodologies for foundation models across thousands of GPUs
Research and implement state-of-the-art techniques in Autoregressive LLMs, Vision Language Models, and / or Diffusion Models
Collaborate with cross-functional teams to transition research into production systems

Fulltime

Research Scientist / Engineer – Training Infrastructure

Luma’s mission is to build multimodal AI to expand human imagination and capabil...

Location

United States , Palo Alto

Salary:

187500.00 - 395000.00 USD / Year

Luma AI

Expiration Date

Until further notice

Requirements

Extensive experience with distributed PyTorch training and parallelisms in foundation model training
Deep understanding of GPU clusters, networking, and storage systems
Familiarity with communication libraries (NCCL, MPI) and distributed system optimization

Job Responsibility

Design, implement, and optimize efficient distributed training systems for models with thousands of GPUs
Research and implement advanced parallelization techniques (FSDP, Tensor Parallel, Pipeline Parallel, Expert Parallel)
Build monitoring, visualization, and debugging tools for large-scale training runs
Optimize training stability, convergence, and resource utilization across massive clusters

Fulltime

Select Country

Research Scientist / Engineer — Foundation Model

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?