CrawlJobs Logo

AI Research Engineer, Scaling

United States, Palo Alto 180000.00 - 300000.00 USD / Year · Job Posted December 14, 2025
Apply Position
Job Link Share

Job Description

As a Research Engineer focused on Scaling, you will design and build robust infrastructure to support large-scale training, evaluation, and deployment across 1X’s fleet of robots. You will transform experimental systems into production-grade platforms optimized for throughput, latency, and performance across both datacenter and edge environments. Your work will be pivotal in enabling high-efficiency learning and inference, directly shaping the performance of our general-purpose humanoid robots.

Job Responsibility

  • Own and lead scaling of distributed training and inference systems
  • Ensure compute resources are optimized to make data the primary constraint
  • Enable massive training runs (1000+ GPUs) using robot data, with robust fault tolerance, experiment tracking, and distributed operations
  • Optimize inference throughput for datacenter use cases such as world models and diffusion engines
  • Reduce latency and enhance performance for on-device robot policies using techniques such as quantization, scheduling, and distillation

Requirements

  • Strong programming experience in Python and/or C++
  • Deep intuitive understanding of training and inference speed bottlenecks and scaling laws
  • A mindset aligned with extremely high scaling: belief that scale is foundational to enabling humanoid robotics
  • Degree in Computer Science or a related field
  • Experience with distributed training frameworks (e.g., TorchTitan, DeepSpeed, FSDP/ZeRO), multi-node debugging, and experiment management
  • Proven skills in optimizing inference performance using graph compilers, batching/scheduling, and serving systems like TensorRT or equivalents
  • Familiarity with quantization strategies (PTQ, QAT, INT8/FP8) and tools such as TensorRT and bitsandbytes
  • Experience developing or tuning CUDA or Triton kernels with understanding of hardware-level optimization (vectorization, tensor cores, memory hierarchies)

What we offer

  • Equity
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI Research Engineer, Scaling

8 matching positions

AI Research Engineer - Social Products (Technical Leadership)

We're hiring Research Engineers to join teams across Meta working at the interse...
Location
Location
United States , Bellevue
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience with large scale model training, implementing algorithms, and evaluating speech-based systems
  • 5+ YOE as an Applied AI Research Scientist or Applied AI Research Engineer
Job Responsibility
Job Responsibility
  • Contribute to the training of next-generation multimodal foundation models, advance their capabilities in understanding, generation, and grounding, and enable them for downstream product use-cases
  • Support creative data sourcing, high-quality pre/mid/post-training data curation, and scale and optimize data pipelines for multimodal large language models (LLMs)
  • Lead, collaborate, and execute on research that pushes forward the state of the art in multimodal reasoning and generation research, and prioritize research that can be directly applied to Meta's product development
What we offer
What we offer
  • bonus
  • equity
  • Fulltime
Read More
Arrow Right

Ai Research Engineer, Language

Meta is seeking talented engineers to join our teams in building cutting-edge pr...
Location
Location
United States , Redmond
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • 2+ years of programming experience in a relevant language OR a PhD + 9 months programming experience in a relevant language
  • Experience building maintainable and testable codebases, including API design and unit testing techniques
  • Experience effectively utilizing AI technologies and tools (e.g., large language models, agents, etc.) to enhance workflows
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams (product, design, operations, infrastructure) to build innovative AI-native application experiences
  • Build and integrate LLM / generative AI capabilities into product surfaces (mobile, web), including prompt engineering, structured prompting, and context management
  • Develop and maintain reusable software components for interfacing with back-end platforms, model serving/inference layers, and AI toolchains
  • Implement retrieval-augmented generation (RAG) patterns (e.g., embeddings + retrieval) and contribute to context-aware and personalized user experiences
  • Design/Contribute to agentic workflows and leverage AI tools and agents (including human-in-the-loop / expert-in-the-loop designs) to automate tasks and scale impact
  • Analyze, debug, and optimize code and systems for quality, efficiency, performance, reliability, and cost
  • Establish effective quality practices for AI features, including evaluation/QA for AI outputs, monitoring, and iterative improvement via feedback loops
  • Architect efficient and scalable systems that power complex applications and AI-enabled features, identify and resolve performance and scalability issues
  • Drive end-to-end execution of medium-to-large features with increasing independence, contribute to technical direction within the team
  • Establish ownership of components, features, or systems with comprehensive end-to-end understanding
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Engineer

We’re hiring Research Engineers to join teams across Meta working at the interse...
Location
Location
United States , Bellevue, WA +3 locations
Salary
Salary:
183997.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience as a formal technical lead, leading major technical initiatives with XFN impact, and/or influencing strategy across multiple teams
  • Impressive engineering background (PhD in ML not required)
  • Experience working in AI/ML environments
  • Can manage data pipelines and versioning
Job Responsibility
Job Responsibility
  • Contribute to the training of next-generation multimodal foundation models, advance their capabilities in understanding, generation, and grounding, and enable them for downstream product use-cases
  • Support creative data sourcing, high-quality pre/mid/post-training data curation, and scale and optimize data pipelines for multimodal large language models (LLMs)
  • Lead, collaborate, and execute on research that pushes forward the state of the art in multimodal reasoning and generation research, and prioritize research that can be directly applied to Meta’s product development
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Ai Research Engineer, Media - Meta Superintelligence Labs

We are seeking AI Researchers to join the Product and Applied Research (PAR) Med...
Location
Location
United States , Menlo Park
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 1+ year of industry research experience in LLM/NLP, computer vision, or related AI/ML models
  • Skilled in model training, data, or inference & efficiency for image, video, and/or related multimodal models
  • Proficient in media generation, understanding, and/or grounding
  • Experience owning and/or driving complex technical projects from end-to-end
  • Programming experience in Python and hands-on experience with frameworks like PyTorch or Spark
Job Responsibility
Job Responsibility
  • Contribute to the training of next-generation multimodal foundation models, advance their capabilities in understanding, generation, and grounding, and enable them for downstream product use-cases
  • Support creative data sourcing, high-quality pre/mid/post-training data curation, and scale and optimize data pipelines for multimodal large language models (LLMs)
  • Lead, collaborate, and execute on research that pushes forward the state of the art in multimodal reasoning and generation research, and prioritize research that can be directly applied to Meta’s product development
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Engineer

Build the future of offensive security with XBOW. Attackers are already using AI...
Location
Location
Salary
Salary:
150000.00 - 350000.00 USD / Year
xbow.com Logo
Xbow
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience with building software around LLMs: prompting, agentic orchestration, fault-tolerance, and integration of LLM parts with hard-coded logic
  • Strong software engineering skills: architecting and building production-grade software that runs reliably and can be maintained
  • Experience with TypeScript or proven ability to learn a new programming language quickly
  • Strong skills in structured and independently-driven problem-solving. Able to work with incomplete information and rapidly testing hypotheses
  • Comfortable with an energetic environment that mixes the fast-paced agile prioritisation of a startup with the curiosity mentality of a research lab
  • Eager to own projects and jump into the deep end, learning as you go. Curious, adaptable and collaborative
  • MSc or equivalent or higher in computer science, math, physics or machine learning
Job Responsibility
Job Responsibility
  • Build LLM-powered software that actually works, by designing prompt flows and orchestrations that ensures great performance with no false positives
  • Architect and build an AI-powered software stack that is production-grade, testable and maintainable
  • Design and build experiments and evaluation frameworks for performance testing of the system at scale. Conduct data analysis to draw conclusions
  • Collaborate with the rest of the AI team, with security experts, and both frontend and backend developers to create end-to-end systems that work and customers love
  • Own projects end-to-end: from basic ideation and experimentation to deployment and production monitoring
  • Continuously conduct research on how to harness the advancements in LLMs to make our system better and faster
What we offer
What we offer
  • Competitive salary and a generous equity package, making you a true owner of the company
  • Shape your role, lead the function, and grow with the company as we redefine cybersecurity
  • You will tackle technically complex challenges and play a pivotal role in the growth of our business, working alongside an amazing team and some of the world’s experts to shape how AI transforms cybersecurity
  • Fulltime
Read More
Arrow Right

Research Engineer, AI for Science

OpenAI for Science is building the next great scientific instrument: an AI-power...
Location
Location
United States , San Francisco
Salary
Salary:
295000.00 - 445000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming skills and enjoy building reliable, high-performance systems
  • Comfortable working in large distributed systems and at significant computational scale
  • Excited about OpenAI’s research direction and motivated by the real-world impact of AI
Job Responsibility
Job Responsibility
  • Design, implement, and improve large-scale distributed machine learning systems
  • Write robust, high-quality machine learning code and contribute to performance-critical components
  • Collaborate closely with researchers to translate ideas into scalable, production-ready systems
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

AI Research Engineer, Enterprise Evaluations

Scale AI is seeking a technically rigorous and driven AI Research Engineer to jo...
Location
Location
United States , San Francisco; New York
Salary
Salary:
179400.00 - 224250.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Electrical Engineering, a related field, or equivalent practical experience
  • 2+ years of experience in Machine Learning or Applied Research, focused on applied ML systems or evaluation infrastructure
  • Hands-on experience with Large Language Models (LLMs) and Generative AI in professional or research environments
  • Strong understanding of frontier model evaluation methodologies and the current research landscape
  • Proficiency in Python and major ML frameworks (e.g., PyTorch, TensorFlow)
  • Solid engineering and statistical analysis foundation, with experience developing data-driven methods for assessing model quality
Job Responsibility
Job Responsibility
  • Partner with Scale’s Operations team and enterprise customers to translate ambiguity into structured evaluation data, guiding the creation and maintenance of gold-standard human-rated datasets and expert rubrics that anchor AI evaluation systems
  • Analyze feedback and collected data to identify patterns, refine evaluation frameworks, and establish iterative improvement loops that enhance the quality and relevance of human-curated assessments
  • Design, research, and develop LLM-as-a-Judge autorater frameworks and AI-assisted evaluation systems. This includes creating models that critique, grade, and explain agent outputs (e.g., RLAIF, model-judging-model setups), along with scalable evaluation pipelines and diagnostic tools
  • Pursue research initiatives that explore new methodologies for automatically analyzing, evaluating, and improving the behavior of enterprise agents, pushing the boundaries of how AI systems are assessed and optimized in real-world contexts
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • equity grant
  • Fulltime
Read More
Arrow Right

AI Research Engineer - ML Engineering

At Helsing, we are pioneering the future of autonomous decision-making for defen...
Location
Location
Germany; United Kingdom , Berlin; Munich; London
Salary
Salary:
Not provided
helsing.ai Logo
Helsing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hold an MSc or PhD in Computer Science or STEM field, with a focus on Machine Learning and Deep Learning
  • Have strong software engineering skills in Python and fluency with modern DL frameworks (PyTorch/JAX/TensorFlow)
  • Are a clear communicator who can build from complex theoretical concepts and contribute to the company's internal engineering culture
  • Have a "first-principles" mindset
  • You have debugged production ML pipelines
Job Responsibility
Job Responsibility
  • Extend our highly integrated deep learning frameworks (built on top of PyTorch), making them efficient and easy to use for a wide range of use cases
  • Scale our current infrastructure and tooling stack to support faster and larger distributed training
  • Design data strategy to support large scale datasets and efficient storage, ensuring GPUs stay warm
What we offer
What we offer
  • Competitive compensation and stock options
  • Relocation support
  • Social and education allowances
  • Regular company events and all-hands
  • A hands-on onboarding program
  • Fulltime
Read More
Arrow Right