CrawlJobs Logo

Research Engineer, Reinforcement Learning

United States, Palo Alto Employment contract 180000.00 - 250000.00 USD / Year · Job Posted December 01, 2025
Apply Position
Job Link Share

Job Description

As a Research Engineer specializing in Reinforcement Learning, you will be responsible for teaching NEO new capabilities using RL algorithms. You'll work across simulation and real-world robots to build robust behaviors and deploy RL-trained skills into home environments. Your work will play a critical role in making our robots safer, more capable, and increasingly versatile.

Job Responsibility

  • Own the full stack of engineering tasks: from data engineering and model architecture to delivering polished products
  • Train NEO on a wide variety of manipulation and locomotion tasks
  • Collaborate with hardware teams to bridge the sim-to-real gap for policies trained in simulation
  • Partner with controls, quality assurance, and data collection teams to ship RL policies to production
  • Deploy reinforcement learning-trained skills into real-world home environments

Requirements

  • Strong programming experience in Python and/or C++
  • Proficiency with PyTorch
  • Hands-on experience with simulation platforms like Isaac Sim or MuJoCo
  • Experience training reinforcement learning policies, particularly for manipulation or locomotion
  • Ability to collaborate cross-functionally with hardware, control, data, and QA teams
  • Demonstrated experience addressing the sim-to-real gap

What we offer

  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Engineer, Reinforcement Learning

8 matching positions

Research Engineer - Reinforcement Learning

Building Open Superintelligence Infrastructure. Prime Intellect is building the ...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
Prime Intellect
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for the inference or training of large-scale AI models
  • Deep expertise in distributed inference techniques and frameworks (e.g. vllm, sglang) for optimizing the performance and scalability of AI workloads
  • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines
  • Passion for advancing the state-of-the-art in reasoning and democratizing access to AI capabilities for researchers, developers, and businesses worldwide
Job Responsibility
Job Responsibility
  • Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution
  • Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques
  • Contribute to the development of our open-source libraries and frameworks for synthetic data generation and distributed RL frameworks
  • Publish research in top-tier AI conferences such as ICML & NeurIPS
  • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers
  • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform's capabilities and user experience
What we offer
What we offer
  • Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect
  • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco
  • Visa sponsorship and relocation assistance for international candidates
  • Quarterly team off-sites, hackathons, conferences and learning opportunities
  • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI
  • Fulltime
Read More
Arrow Right

AI Research Engineer, Reinforcement Learning

As a Research Engineer focused on Reinforcement Learning, you will be responsibl...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming experience in Python and/or C++ with familiarity using build tools such as Bazel
  • Proficiency with PyTorch
  • Hands-on experience with simulation platforms like Isaac Sim or MuJoCo
  • Experience training reinforcement learning policies, especially for manipulation or locomotion
  • Ability to collaborate cross-functionally with hardware, control, data, and QA teams
  • Demonstrated experience addressing the sim-to-real gap
Job Responsibility
Job Responsibility
  • Own the full stack of engineering tasks, from data engineering and model architecture to product deployment
  • Train NEO on a variety of manipulation and locomotion tasks
  • Collaborate with hardware teams to bridge the sim-to-real gap for policies trained in simulation
  • Partner with controls, QA, and data collection teams to ship RL policies to production
  • Deploy reinforcement learning-trained skills into real-world home environments
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Equity
  • Fulltime
Read More
Arrow Right

AI Research Engineer - Reinforcement Learning

At Helsing we deliver AI-based capabilities and the enabling infrastructure that...
Location
Location
Germany , Munich
Salary
Salary:
Not provided
helsing.ai Logo
Helsing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hold MSc in machine learning with a speciality in either reinforcement learning, multi-agent systems, automation and control, or robotics
  • Have excellent communication skills and the ability to report and present research findings clearly and efficiently both internally and externally
  • Are passionate about keeping up-to-date with current research and enjoy reimplementing / extending papers on state-of-the-art Deep Learning-based approaches
  • Possess solid software engineering skills, writing clean and well-structured code in Python and/or languages like Rust, Java, or modern C++, and experience deploying AI software to production including testing, QA, and monitoring
Job Responsibility
Job Responsibility
  • Design, train and deploy agents in complex multi-agent environments
  • Contribute to our reinforcement learning stack by implementing, improving and extending the current state of the art in multi-agent reinforcement learning
  • Be a part of impactful projects and will collaborate with people across several teams and backgrounds to integrate cutting edge ML/AI in our production systems
What we offer
What we offer
  • Competitive compensation and stock options
  • Relocation support
  • Social and education allowances
  • Regular company events and all-hands to bring together employees as one team across Europe
  • A hands-on onboarding program (affectionately labelled “AI-duction”), in which you will be familiarising yourself with our tools and ML pipelines used across the company
  • Fulltime
Read More
Arrow Right

Research Engineer in Reinforcement Learning

The Applied AI team at InstaDeep creates optimization solutions for large scale ...
Location
Location
United Kingdom , London
Salary
Salary:
70000.00 - 90000.00 GBP / Year
instadeep.com Logo
InstaDeep
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Professional experience in AI Research, Applied AI/ML or Mathematical Optimization
  • Proven experience in software development in Python, ideally in projects with codebases of production-grade quality, multiple contributors and version control
  • Candidates must have the right to work in the UK. Visa sponsorship is not available for this role.
  • Attendance in the office for 3 days a week mandatory, we do not offer remote roles.
Job Responsibility
Job Responsibility
  • Work closely with our clients to build a deep understanding of the use case and the constraints the operational experts face in their daily operations
  • Translate industrial knowledge into optimization problem statements
  • Work with InstaDeep colleagues to brainstorm, prototype new research ideas and to iterate and improve on existing implementations by running large-scale experiments and deploying high-quality engineering
  • Engage in pre-sales activities and to support our Business Development team with your acquired unique combination of domain knowledge and AI expertise.
What we offer
What we offer
  • Time to dedicate to personal development, as well as InstaDeep-provided education opportunities
  • Long-term incentive stock plans
  • Private Health insurance
  • Monthly Gym allowance
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale works with the industry’s leading AI labs to provide high quality data and...
Location
Location
United States , San Francisco; Seattle; New York
Salary
Salary:
252000.00 - 315000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field
  • Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning
  • Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning
  • Excellent written and verbal communication skills
  • Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and/or journals
  • Previous experience in a customer facing role
Job Responsibility
Job Responsibility
  • Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities
  • Design and experiment new approaches to preference optimization
  • Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness
  • Publish research findings in top-tier AI conferences
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • commuter stipend
  • Fulltime
Read More
Arrow Right

Senior Reinforcement Learning Engineer

Figure is an AI Robotics company developing a general purpose humanoid. Our Huma...
Location
Location
United States , San Jose
Salary
Salary:
150000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Confident writing production quality code in PyTorch
  • Familiar with online and offline reinforcement learning algorithms: PPO, SAC, etc.
  • Experience tuning hyperparameters and cost functions for these RL algorithms
  • Familiarity with common RL techniques such as: domain randomization, curriculum learning, reward shaping, etc.
  • Familiarity with general ML evaluation tools such as TensorBoard, Weights&Biases, etc.
  • Strong mix of industry and research experience, ideally 5-7+ years of experience
Job Responsibility
Job Responsibility
  • Develop, train, and deploy reinforcement learning algorithms for locomotion and manipulation tasks
  • Build simulation infrastructure to support the training of locomotion and manipulation policies for a general purpose humanoid robot at a large scale
  • Collaborate with the controls team to integrate policies into the existing control stack
  • Define, test, and evaluate performance metrics for learned policies
  • Fulltime
Read More
Arrow Right

Machine Learning Research Engineer - Robotics

Scale’s Robotics business unit is dedicated to solving the data bottleneck in Ph...
Location
Location
United States , San Francisco
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Practical experience building training VLA models and/or building robotics data
  • 3+ years of relevant industry experience in areas relating to: robotics, computer vision, embodied AI, sim-to-real, imitation learning, reinforcement learning, and vision language actions models
  • PhD or equivalent experience in Machine Learning or Robotics
  • A track record of published research in robotics
  • Experience conducting data collection and performing evaluations
  • Strong written and verbal communication skills and the ability to work with cross-functional teams and customers
  • Intellectual curiosity, empathy, and ability to operate with a high degree of autonomy
Job Responsibility
Job Responsibility
  • Collaborate closely with Robotics customers to drive the industry forward in using VLA data
  • Develop ML pipelines to train/fine-tune models using Scale’s data
  • Conduct research on robotics data collection, cross-embodiment training, and policy fine-tuning
  • Develop novel methods for evaluating VLA models, including new robotics industry benchmarks
  • Partner with cross-functional stakeholders and Scale’s customers to improve data collection
  • Collaborate with product teams to bring ML outcomes to Scale’s platform
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • may be eligible for additional benefits such as a commuter stipend
  • Fulltime
Read More
Arrow Right

Ai Research Scientist, Reinforcement Learning

Meta's Fundamental AI Research lab is seeking a Research Scientist to drive foun...
Location
Location
United States , New York
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Currently has or is in the process of obtaining a PhD degree in Artificial Intelligence, Computer Vision (3D), Physical AI, Machine Learning, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Research experience in at least one of the following research areas: reinforcement learning, representation learning, self-supervised learning, multimodal learning, robotics policy development, computer vision (3D), egocentric perception, embodied AI and/or LLMs, control theory, optimization algorithms
  • Experience in C/C++ and Python and deep learning frameworks (e.g., PyTorch, TensorFlow)
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Explore and develop novel post-training paradigms for LLMs using reinforcement learning
  • Explore and develop novel LLM post-training recipes using 3D data
  • Integrate large-scale simulation into LLM post-training
  • Explore mechanical, aerospace, civil, and other engineering disciplines and how to enable LLMs to solve key problems in these domains
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right