CrawlJobs Logo

Research Engineer in Reinforcement Learning

United Kingdom, London 70000.00 - 90000.00 GBP / Year · Job Posted March 22, 2026
Apply Position
Job Link Share

Job Description

The Applied AI team at InstaDeep creates optimization solutions for large scale real-world industrial problems. Each solution is built to be ready for full production use by our clients and it is the Applied AI team that owns all stages of this lifecycle: Within Applied AI, Research Engineering builds the AI optimization systems, MLOps sets up the training and inference infrastructure and the Software teams build graphical user interfaces and API servers. These teams work together to provide the end-to-end capabilities required to create scalable and high-performing optimization tools for the end user, all done 100% within InstaDeep. Our work spreads across many industries - from Energy to Logistics to Production Optimization, there are plenty of interesting topics!

Job Responsibility

  • Work closely with our clients to build a deep understanding of the use case and the constraints the operational experts face in their daily operations
  • Translate industrial knowledge into optimization problem statements
  • Work with InstaDeep colleagues to brainstorm, prototype new research ideas and to iterate and improve on existing implementations by running large-scale experiments and deploying high-quality engineering
  • Engage in pre-sales activities and to support our Business Development team with your acquired unique combination of domain knowledge and AI expertise.

Requirements

  • Professional experience in AI Research, Applied AI/ML or Mathematical Optimization
  • Proven experience in software development in Python, ideally in projects with codebases of production-grade quality, multiple contributors and version control
  • Candidates must have the right to work in the UK. Visa sponsorship is not available for this role.
  • Attendance in the office for 3 days a week mandatory, we do not offer remote roles.

Nice to have

  • Professional experience in Reinforcement Learning and / or dealing with combinatorial optimization problems
  • Hands-on experience in high-performance computing environments (Kubernetes, Ray etc.)
  • Domain expertise in the industries of Supply Chain, Manufacturing, Aviation, Energy

What we offer

  • Time to dedicate to personal development, as well as InstaDeep-provided education opportunities
  • Long-term incentive stock plans
  • Private Health insurance
  • Monthly Gym allowance

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Engineer in Reinforcement Learning

8 matching positions

Research Engineer - Reinforcement Learning

Building Open Superintelligence Infrastructure. Prime Intellect is building the ...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
Prime Intellect
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for the inference or training of large-scale AI models
  • Deep expertise in distributed inference techniques and frameworks (e.g. vllm, sglang) for optimizing the performance and scalability of AI workloads
  • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines
  • Passion for advancing the state-of-the-art in reasoning and democratizing access to AI capabilities for researchers, developers, and businesses worldwide
Job Responsibility
Job Responsibility
  • Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution
  • Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques
  • Contribute to the development of our open-source libraries and frameworks for synthetic data generation and distributed RL frameworks
  • Publish research in top-tier AI conferences such as ICML & NeurIPS
  • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers
  • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform's capabilities and user experience
What we offer
What we offer
  • Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect
  • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco
  • Visa sponsorship and relocation assistance for international candidates
  • Quarterly team off-sites, hackathons, conferences and learning opportunities
  • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI
  • Fulltime
Read More
Arrow Right

AI Research Engineer, Reinforcement Learning

As a Research Engineer focused on Reinforcement Learning, you will be responsibl...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming experience in Python and/or C++ with familiarity using build tools such as Bazel
  • Proficiency with PyTorch
  • Hands-on experience with simulation platforms like Isaac Sim or MuJoCo
  • Experience training reinforcement learning policies, especially for manipulation or locomotion
  • Ability to collaborate cross-functionally with hardware, control, data, and QA teams
  • Demonstrated experience addressing the sim-to-real gap
Job Responsibility
Job Responsibility
  • Own the full stack of engineering tasks, from data engineering and model architecture to product deployment
  • Train NEO on a variety of manipulation and locomotion tasks
  • Collaborate with hardware teams to bridge the sim-to-real gap for policies trained in simulation
  • Partner with controls, QA, and data collection teams to ship RL policies to production
  • Deploy reinforcement learning-trained skills into real-world home environments
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Equity
  • Fulltime
Read More
Arrow Right

Research Engineer, Reinforcement Learning

As a Research Engineer specializing in Reinforcement Learning, you will be respo...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming experience in Python and/or C++
  • Proficiency with PyTorch
  • Hands-on experience with simulation platforms like Isaac Sim or MuJoCo
  • Experience training reinforcement learning policies, particularly for manipulation or locomotion
  • Ability to collaborate cross-functionally with hardware, control, data, and QA teams
  • Demonstrated experience addressing the sim-to-real gap
Job Responsibility
Job Responsibility
  • Own the full stack of engineering tasks: from data engineering and model architecture to delivering polished products
  • Train NEO on a wide variety of manipulation and locomotion tasks
  • Collaborate with hardware teams to bridge the sim-to-real gap for policies trained in simulation
  • Partner with controls, quality assurance, and data collection teams to ship RL policies to production
  • Deploy reinforcement learning-trained skills into real-world home environments
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

AI Research Engineer - Reinforcement Learning

At Helsing we deliver AI-based capabilities and the enabling infrastructure that...
Location
Location
Germany , Munich
Salary
Salary:
Not provided
helsing.ai Logo
Helsing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hold MSc in machine learning with a speciality in either reinforcement learning, multi-agent systems, automation and control, or robotics
  • Have excellent communication skills and the ability to report and present research findings clearly and efficiently both internally and externally
  • Are passionate about keeping up-to-date with current research and enjoy reimplementing / extending papers on state-of-the-art Deep Learning-based approaches
  • Possess solid software engineering skills, writing clean and well-structured code in Python and/or languages like Rust, Java, or modern C++, and experience deploying AI software to production including testing, QA, and monitoring
Job Responsibility
Job Responsibility
  • Design, train and deploy agents in complex multi-agent environments
  • Contribute to our reinforcement learning stack by implementing, improving and extending the current state of the art in multi-agent reinforcement learning
  • Be a part of impactful projects and will collaborate with people across several teams and backgrounds to integrate cutting edge ML/AI in our production systems
What we offer
What we offer
  • Competitive compensation and stock options
  • Relocation support
  • Social and education allowances
  • Regular company events and all-hands to bring together employees as one team across Europe
  • A hands-on onboarding program (affectionately labelled “AI-duction”), in which you will be familiarising yourself with our tools and ML pipelines used across the company
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale works with the industry’s leading AI labs to provide high quality data and...
Location
Location
United States , San Francisco; Seattle; New York
Salary
Salary:
252000.00 - 315000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field
  • Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning
  • Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning
  • Excellent written and verbal communication skills
  • Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and/or journals
  • Previous experience in a customer facing role
Job Responsibility
Job Responsibility
  • Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities
  • Design and experiment new approaches to preference optimization
  • Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness
  • Publish research findings in top-tier AI conferences
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • commuter stipend
  • Fulltime
Read More
Arrow Right

Senior Reinforcement Learning Engineer

Figure is an AI Robotics company developing a general purpose humanoid. Our Huma...
Location
Location
United States , San Jose
Salary
Salary:
150000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Confident writing production quality code in PyTorch
  • Familiar with online and offline reinforcement learning algorithms: PPO, SAC, etc.
  • Experience tuning hyperparameters and cost functions for these RL algorithms
  • Familiarity with common RL techniques such as: domain randomization, curriculum learning, reward shaping, etc.
  • Familiarity with general ML evaluation tools such as TensorBoard, Weights&Biases, etc.
  • Strong mix of industry and research experience, ideally 5-7+ years of experience
Job Responsibility
Job Responsibility
  • Develop, train, and deploy reinforcement learning algorithms for locomotion and manipulation tasks
  • Build simulation infrastructure to support the training of locomotion and manipulation policies for a general purpose humanoid robot at a large scale
  • Collaborate with the controls team to integrate policies into the existing control stack
  • Define, test, and evaluate performance metrics for learned policies
  • Fulltime
Read More
Arrow Right

Machine Learning Research Engineer - Robotics

Scale’s Robotics business unit is dedicated to solving the data bottleneck in Ph...
Location
Location
United States , San Francisco
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Practical experience building training VLA models and/or building robotics data
  • 3+ years of relevant industry experience in areas relating to: robotics, computer vision, embodied AI, sim-to-real, imitation learning, reinforcement learning, and vision language actions models
  • PhD or equivalent experience in Machine Learning or Robotics
  • A track record of published research in robotics
  • Experience conducting data collection and performing evaluations
  • Strong written and verbal communication skills and the ability to work with cross-functional teams and customers
  • Intellectual curiosity, empathy, and ability to operate with a high degree of autonomy
Job Responsibility
Job Responsibility
  • Collaborate closely with Robotics customers to drive the industry forward in using VLA data
  • Develop ML pipelines to train/fine-tune models using Scale’s data
  • Conduct research on robotics data collection, cross-embodiment training, and policy fine-tuning
  • Develop novel methods for evaluating VLA models, including new robotics industry benchmarks
  • Partner with cross-functional stakeholders and Scale’s customers to improve data collection
  • Collaborate with product teams to bring ML outcomes to Scale’s platform
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • may be eligible for additional benefits such as a commuter stipend
  • Fulltime
Read More
Arrow Right

Ai Research Scientist, Reinforcement Learning

Meta's Fundamental AI Research lab is seeking a Research Scientist to drive foun...
Location
Location
United States , New York
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Currently has or is in the process of obtaining a PhD degree in Artificial Intelligence, Computer Vision (3D), Physical AI, Machine Learning, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Research experience in at least one of the following research areas: reinforcement learning, representation learning, self-supervised learning, multimodal learning, robotics policy development, computer vision (3D), egocentric perception, embodied AI and/or LLMs, control theory, optimization algorithms
  • Experience in C/C++ and Python and deep learning frameworks (e.g., PyTorch, TensorFlow)
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Explore and develop novel post-training paradigms for LLMs using reinforcement learning
  • Explore and develop novel LLM post-training recipes using 3D data
  • Integrate large-scale simulation into LLM post-training
  • Explore mechanical, aerospace, civil, and other engineering disciplines and how to enable LLMs to solve key problems in these domains
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right