CrawlJobs Logo

Research Engineer - Reinforcement Learning

Prime Intellect

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Building Open Superintelligence Infrastructure. Prime Intellect is building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full rl post-training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts. As a Research Engineer in our Reasoning team, you'll play a crucial role in shaping our technological direction, focusing on our test-time compute scaling research ideas. If you love working with synthetic data and teach LLMs reasoning abilities, this role is for you.

Job Responsibility:

  • Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution
  • Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques
  • Contribute to the development of our open-source libraries and frameworks for synthetic data generation and distributed RL frameworks
  • Publish research in top-tier AI conferences such as ICML & NeurIPS
  • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers
  • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform's capabilities and user experience

Requirements:

  • Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for the inference or training of large-scale AI models
  • Deep expertise in distributed inference techniques and frameworks (e.g. vllm, sglang) for optimizing the performance and scalability of AI workloads
  • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines
  • Passion for advancing the state-of-the-art in reasoning and democratizing access to AI capabilities for researchers, developers, and businesses worldwide
What we offer:
  • Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect
  • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco
  • Visa sponsorship and relocation assistance for international candidates
  • Quarterly team off-sites, hackathons, conferences and learning opportunities
  • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Engineer - Reinforcement Learning

Research Engineer, Reinforcement Learning

As a Research Engineer specializing in Reinforcement Learning, you will be respo...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming experience in Python and/or C++
  • Proficiency with PyTorch
  • Hands-on experience with simulation platforms like Isaac Sim or MuJoCo
  • Experience training reinforcement learning policies, particularly for manipulation or locomotion
  • Ability to collaborate cross-functionally with hardware, control, data, and QA teams
  • Demonstrated experience addressing the sim-to-real gap
Job Responsibility
Job Responsibility
  • Own the full stack of engineering tasks: from data engineering and model architecture to delivering polished products
  • Train NEO on a wide variety of manipulation and locomotion tasks
  • Collaborate with hardware teams to bridge the sim-to-real gap for policies trained in simulation
  • Partner with controls, quality assurance, and data collection teams to ship RL policies to production
  • Deploy reinforcement learning-trained skills into real-world home environments
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

Senior Reinforcement Learning Engineer

Figure is an AI Robotics company developing a general purpose humanoid. Our Huma...
Location
Location
United States , San Jose
Salary
Salary:
150000.00 - 400000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Confident writing production quality code in PyTorch
  • Familiar with online and offline reinforcement learning algorithms: PPO, SAC, etc.
  • Experience tuning hyperparameters and cost functions for these RL algorithms
  • Familiarity with common RL techniques such as: domain randomization, curriculum learning, reward shaping, etc.
  • Familiarity with general ML evaluation tools such as TensorBoard, Weights&Biases, etc.
  • Strong mix of industry and research experience, ideally 5-7+ years of experience
Job Responsibility
Job Responsibility
  • Develop, train, and deploy reinforcement learning algorithms for locomotion and manipulation tasks
  • Build simulation infrastructure to support the training of locomotion and manipulation policies for a general purpose humanoid robot at a large scale
  • Collaborate with the controls team to integrate policies into the existing control stack
  • Define, test, and evaluate performance metrics for learned policies
  • Fulltime
Read More
Arrow Right

AI Research Engineer - Reinforcement Learning

At Helsing we deliver AI-based capabilities and the enabling infrastructure that...
Location
Location
Germany , Munich
Salary
Salary:
Not provided
helsing.ai Logo
Helsing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hold MSc in machine learning with a speciality in either reinforcement learning, multi-agent systems, automation and control, or robotics
  • Have excellent communication skills and the ability to report and present research findings clearly and efficiently both internally and externally
  • Are passionate about keeping up-to-date with current research and enjoy reimplementing / extending papers on state-of-the-art Deep Learning-based approaches
  • Possess solid software engineering skills, writing clean and well-structured code in Python and/or languages like Rust, Java, or modern C++, and experience deploying AI software to production including testing, QA, and monitoring
Job Responsibility
Job Responsibility
  • Design, train and deploy agents in complex multi-agent environments
  • Contribute to our reinforcement learning stack by implementing, improving and extending the current state of the art in multi-agent reinforcement learning
  • Be a part of impactful projects and will collaborate with people across several teams and backgrounds to integrate cutting edge ML/AI in our production systems
What we offer
What we offer
  • Competitive compensation and stock options
  • Relocation support
  • Social and education allowances
  • Regular company events and all-hands to bring together employees as one team across Europe
  • A hands-on onboarding program (affectionately labelled “AI-duction”), in which you will be familiarising yourself with our tools and ML pipelines used across the company
  • Fulltime
Read More
Arrow Right

Applied Research Lead, Reinforcement Learning

We are building AI to simulate the world through merging art and science. We bel...
Location
Location
United States
Salary
Salary:
280000.00 - 380000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of relevant engineering or research experience in applying reinforcement learning to align language, image, and/or video generation models
  • Very strong programming skills and ability to write clean and maintainable research code
  • Deep interest in building human-in-the-loop systems for creativity
  • Passion for seeing research through from initial conception to eventual application
  • Experience mentoring and teaching other researchers
  • Strong communication, collaboration, and documentation skills
Job Responsibility
Job Responsibility
  • Lead efforts in applying reinforcement learning based techniques to improve the quality and controllability of the models that power Runway’s research and tools
  • Fulltime
Read More
Arrow Right

Machine Learning Research Associate

The Machine Learning research team at Hewlett Packard Labs seeks highly motivate...
Location
Location
United States , Milpitas
Salary
Salary:
43.27 - 93.15 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Pursuing a Ph.D. degree (with significant research and innovation experience) in a relevant discipline (e.g. machine learning, computer science, electrical engineering, statistics, etc.)
  • Track record of world-class innovative contributions and ideas in machine learning
  • Experience in deep learning, LLM, Agentic AI, and reinforcement learning research
  • Experience in developing deep learning software with high proficiency in data structures and algorithms
  • Experience in Machine Learning frameworks like PyTorch - required
  • Strong programming skills and experience with Python
  • Software development experience in Deep Learning, GPU acceleration, and Model Optimization
  • Demonstrated effective communication and collaboration skills
  • Demonstrated ability for original research papers published in top-tier conferences or journals.
Job Responsibility
Job Responsibility
  • Provide thought leadership and technical influence both internally and externally to HPE
  • Work on cutting-edge machine learning research focusing on Large Language Models, Agentic AI, and Reinforcement Learning
  • Contribute along the full range from initial novel ideas to design, development, implementation, evaluation, and technology transfer
  • Publish in top AI conferences and workshops, including NeurIPS, AAAI, and ICML.
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Post Doctoral Machine Learning Research Scientist

The Core Machine Learning Research team within the Artificial Intelligence Resea...
Location
Location
United States , Milpitas
Salary
Salary:
47.75 - 72.00 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, or related fields focusing on Machine Learning
  • Extensive experience in deep learning research is required, preferably with Reinforcement Learning and Large Language Models
  • Experience in developing applications with deep learning frameworks like PyTorch with a high software proficiency
Job Responsibility
Job Responsibility
  • LLM and agentic architectures with refinements to enhance trust for complex applications and workflows
  • Multi-agent and multi-objective reinforcement learning for complex physical systems
  • Generative models and Optimization for scientific domains such as inertial confinement fusion
  • Scalable, safe AI systems that push the boundary of what’s possible in applied ML
What we offer
What we offer
  • Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Specific programs catered to helping you achieve career goals
  • Inclusive working environment
  • Collaborations with top-tier research institutions, national labs, and global AI initiatives
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist

This role focuses on cutting-edge research and development in Artificial Intelli...
Location
Location
United States , Milpitas
Salary
Salary:
117500.00 - 270000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, or related fields focusing on Machine Learning for the dissertation
  • extensive experience in deep learning research, preferably in Large Language Models or Reinforcement Learning
  • experience developing applications with deep learning frameworks like PyTorch with a high software proficiency
  • strong programming skills in Python, data structures, and algorithms are required
  • experience with ML model optimization, GPU acceleration, heterogeneous computation, system software, and performance optimization desired
  • experience in Python Web Frameworks – Django, Flask - a plus but not required.
Job Responsibility
Job Responsibility
  • conducting research, developing solutions, and creating intellectual property in emerging fields like reinforcement learning, LLMs, digital twins, clean energy, data center optimization, and sustainability
  • developing advanced technologies for analysis, optimization, time series forecasting, uncertainty quantification, and control
  • providing thought leadership, collaborating internally and externally, and contributing to HPE’s strategy by identifying emerging technologies
  • publishing in top conferences like NeurIPS, AAAI, and ACL
  • developing patent applications
  • software development, GPU acceleration, model optimization, and real-time data streaming to create robust AI solutions for real-world use cases.
What we offer
What we offer
  • a competitive salary and extensive social benefits
  • diverse and dynamic work environment
  • work-life balance and support for career development
  • health and wellbeing programs
  • personal and professional development programs
  • diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right

Reinforcement learning intern

As a Reinforcement Learning Intern, you will help develop and implement learning...
Location
Location
France , Paris
Salary
Salary:
Not provided
enchanted.tools Logo
Enchanted Tools
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BSc holder in Robotics, Engineering, Computer Science, or related field
  • Coursework or project experience in reinforcement learning or learning-based control
  • Strong Python skills and knowledge of a deep learning framework PyTorch, JAX, or TensorFlow
  • Familiarity with simulation environments such as Isaac Sim, Mujoco, or Gazebo
  • Solid analytical and problem-solving abilities
Job Responsibility
Job Responsibility
  • Develop, debug, and test reinforcement learning algorithms for locomotion and navigation on a dynamically balancing base
  • Extend simulation environments (Isaac Sim / Isaac Lab) to support training and evaluation of RL policies
  • Integrate trained policies into the Mirokai software stack and validate them on physical robots
  • Analyze performance, stability, and sim-to-real transfer aspects
  • Stay up to date with recent research in reinforcement learning for robotics
Read More
Arrow Right