CrawlJobs Logo

Senior Reinforcement Learning Engineer

figure.ai Logo

Figure

Location Icon

Location:
United States , San Jose

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

150000.00 - 400000.00 USD / Year

Job Description:

Figure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in San Jose, CA and require 5 days/week in-office collaboration. It’s time to build. We are looking for a Senior or Staff level Reinforcement Learning Engineer. You will own the development, training, and deployment of new reinforcement learning algorithms for our humanoid robot as well as building infrastructure to support training policies at a large scale.

Job Responsibility:

  • Develop, train, and deploy reinforcement learning algorithms for locomotion and manipulation tasks
  • Build simulation infrastructure to support the training of locomotion and manipulation policies for a general purpose humanoid robot at a large scale
  • Collaborate with the controls team to integrate policies into the existing control stack
  • Define, test, and evaluate performance metrics for learned policies

Requirements:

  • Confident writing production quality code in PyTorch
  • Familiar with online and offline reinforcement learning algorithms: PPO, SAC, etc.
  • Experience tuning hyperparameters and cost functions for these RL algorithms
  • Familiarity with common RL techniques such as: domain randomization, curriculum learning, reward shaping, etc.
  • Familiarity with general ML evaluation tools such as TensorBoard, Weights&Biases, etc.
  • Strong mix of industry and research experience, ideally 5-7+ years of experience

Nice to have:

  • Experience transferring policies learned in simulation to robot hardware
  • Experience training locomotion policies for quadrupedal or bipedal robots

Additional Information:

Job Posted:
December 08, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Reinforcement Learning Engineer

Senior Machine Learning Engineer

Groupon is a marketplace where customers discover new experiences and services e...
Location
Location
Spain , Madrid; Valencia
Salary
Salary:
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5–8+ years hands-on experience building and deploying ML models in production, ideally for recommender, ranking, or personalization systems
  • Expertise in Python (and optionally Java/Scala), ML frameworks (PyTorch, TensorFlow, XGBoost), feature engineering, and data transformation
  • Solid background in cloud (GCP strongly preferred), container orchestration (Docker, Kubernetes), and modern data/feature pipelines
  • Skilled at structuring ambiguous problems and navigating fast-changing priorities—ready to build with minimal legacy constraints
  • Comfortable communicating complex technical concepts in clear, remote team environments (professional English)
Job Responsibility
Job Responsibility
  • Lead the full ML model lifecycle—feature engineering, model design, training, deployment, monitoring, and ongoing improvement
  • Architect and implement scalable ranking, retrieval, and personalization models using state-of-the-art ML frameworks (e.g., PyTorch, TensorFlow)
  • Build robust, production-ready ML data pipelines and infrastructure (Python, GCP, Docker/Kubernetes)
  • Integrate ML models into high-traffic distributed systems
  • ensure observability, CI/CD, and real-time performance
  • Collaborate closely with Product and Data Engineering to deeply understand business needs and translate them into measurable user impact
  • Set technical standards and mentor less-experienced colleagues as an emerging ML leader in our scale-up environment
  • Experiment with advanced techniques (embeddings, deep learning, reinforcement learning) and champion an evidence-driven, AI-first culture
What we offer
What we offer
  • Greenfield Impact: Architect the backbone of Groupon’s revitalized search and recommendations from the ground up—with your work seen by millions
  • AI-First Scale-Up Vibe: Join a driven, supportive team amid exciting transformation—where speed, ambition, and technical influence matter
  • Career Launchpad: Be the ML architect/leader you’ve always wanted to be, with clear pathways to technical or team leadership as we grow
  • Global Collaboration: Work cross-functionally with international colleagues and senior leadership. EMEA time zone overlap preferred for maximum impact
Read More
Arrow Right

Senior AI and Machine Learning Engineer

We are seeking Senior AI/ML & Innovation Engineer who will be leading initiative...
Location
Location
United States , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically, 7-10 years’ experience
  • Deep understanding of machine learning algorithms, such as linear regression, decision trees, support vector machines, random forests, deep learning models (e.g., neural networks), and reinforcement learning
  • A strong foundation in mathematics and statistics
  • Proficiency in programming languages such as Python, R, or Java
  • Strong understanding of GitHub CoPilot, Cursor, N8N, vibe coding, Windsurf, and similar technologies
  • Experience in Cloud Infrastructure (AWS, Azure, etc)
  • Knowledge of Open Source, Linux, etc
  • Understanding of Devops, SRE
  • Advanced knowledge and experience in deep learning
Job Responsibility
Job Responsibility
  • Conducts research and stays up to date with the latest advancements in AI and machine learning technologies, frameworks, and algorithms
  • Collaborates with cross-functional teams to understand business requirements and design AI and machine learning solutions
  • Develops, implements, and optimizes machine learning models and algorithms
  • Deploys machine learning models into production environments
  • Monitors the performance of deployed models
  • Organizes and leads comprehensive design review sessions
  • Works collaboratively with the engineering manager and team lead to set design and implementation standards
  • Regularly leads meetings
  • Has experience in providing technical leadership, mentorship, and guidance to junior team members
  • Develops and delivers strategic presentations and reports to senior stakeholders
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Senior Principal Machine Learning Engineer - LLM Post-Training and Optimization

Atlassian is seeking a highly skilled and experienced Senior Principle Machine L...
Location
Location
United States , Mountain View
Salary
Salary:
243100.00 - 407200.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. or Master’s degree in Computer Science, Machine Learning, Artificial Intelligence, or a related field
  • 8+ years of experience in machine learning, with a focus on large-scale model development and optimization
  • Deep expertise in LLM and transformer architectures (e.g., GPT, BERT, T5)
  • Strong proficiency in Python and ML frameworks such as PyTorch, JAX, or TensorFlow
  • Experience with distributed training techniques and large-scale data processing pipelines
  • Proven track record of deploying machine learning models in production environments
  • Familiarity with model optimization techniques, including quantization, pruning, and knowledge distillation
  • Strong problem-solving skills and ability to work in a fast-paced, collaborative environment
  • Excellent communication skills and ability to translate technical concepts for diverse audiences
Job Responsibility
Job Responsibility
  • Lead the fine-tuning and post-training optimization of large language models (LLMs) for diverse applications
  • Develop and implement techniques for model compression, quantization, pruning, and knowledge distillation to optimize performance and reduce computational costs
  • Conduct research on advanced techniques in transfer learning, reinforcement learning, and prompt engineering for LLMs
  • Design and execute rigorous benchmarking and evaluation frameworks to assess model performance across multiple dimensions
  • Collaborate with infrastructure teams to optimize LLM deployment pipelines, ensuring scalability and efficiency in production environments
  • Stay at the forefront of advancements in LLM technologies, sharing insights, driving innovation within the team, and leading agile development
  • Mentoring other team members, facilitating within/across team workshops, fostering a culture of technical excellence and continuous learning
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Associate Director, Reinforcement Learning (ML)

Lead Amgen’s strategy and execution for Reinforcement Learning from Human Feedba...
Location
Location
United States , Thousand Oaks; Jacksonville
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate degree and 3 years of Computer Science, IT or related field experience
  • Master’s degree and 5 years of Computer Science, IT or related field experience
  • Bachelor’s degree and 7 years of Computer Science, IT or related field experience
  • Associate’s degree and 12 years of Computer Science, IT or related field experience
  • High school diploma / GED and 14 years of Computer Science, IT or related field experience
  • Deep, hands-on expertise in Reinforcement Learning from Human Feedback (RLHF) and/or advanced reinforcement learning, including reward modeling, policy optimization, exploration strategies, and offline/online evaluation
  • Demonstrated experience deploying RLHF or RL systems into production for real-world applications (e.g., large language models, recommendation systems, decision support tools, or workflow automation), ideally in healthcare, life sciences, or other regulated domains
  • Strong background in modern machine learning and deep learning, with practical experience in Python and frameworks such as PyTorch or TensorFlow, and familiarity with LLM ecosystems and tooling
  • Experience driving sophisticated, cross-functional initiatives, collaborating with non-technical stakeholders (e.g., physicians, scientists, commercial leaders, compliance, legal) and translating needs into impactful AI solutions
  • Strong ability to communicate complex technical topics simply, tailoring content to senior executives and non-technical audiences
Job Responsibility
Job Responsibility
  • Lead the design and development of RLHF systems including reward modeling, policy optimization, safety and alignment mechanisms, and evaluation frameworks for large language models and other AI systems
  • Drive hands-on technical execution, particularly for high-impact projects, reviewing architectures, experimentation plans, and code, and helping the team navigate scientific and engineering trade-offs
  • Establish best-practice pipelines for human feedback, partnering closely with internal customer teams to define feedback protocols, annotation quality standards, and governance for RLHF data
  • Define and track success metrics for RLHF systems, balancing offline and online evaluation, A/B tests, safety and robustness criteria, and business or scientific outcomes
  • Collaborate across Amgen leaders to ensure RLHF solutions are aligned with strategy, compliant with policy, and integrated into real workflows
  • Partner with Data, Platform and Technology teams to ensure that RLHF workloads are supported by scalable data platforms, model hosting, experimentation infrastructure, and MLOps best practices
  • Champion responsible and compliant AI, working with Legal, Compliance, and Information Security to implement governance around human feedback, data usage, model behavior, transparency, and risk management in a regulated environment
  • Communicate insights and influence senior stakeholders, creating clear narratives, roadmaps, and recommendations that help executives understand RLHF trade-offs, risks, and opportunities
What we offer
What we offer
  • A comprehensive employee benefits package, including a Retirement and Savings Plan with generous company contributions, group medical, dental and vision coverage, life and disability insurance, and flexible spending accounts
  • A discretionary annual bonus program, or for field sales representatives, a sales-based incentive plan
  • Stock-based long-term incentives
  • Award-winning time-off plans
  • Flexible work models where possible
Read More
Arrow Right

Senior AI Engineer

This role will be tasked with applying machine learning/deep learning to the aut...
Location
Location
United States , Belmont
Salary
Salary:
170000.00 - 210000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of professional experience post graduate degree preferred
  • 4+ years' Deep Learning experience post graduate degree preferred
  • Master's Degree in Computer Science or equivalent
  • PhD Strongly Preferred
  • Strong knowledge of different machine learning algorithms
  • Proficiency in deep learning techniques and frameworks
  • Strong understanding of traditional machine learning algorithms and their applications
  • Expertise in computer vision, including object detection, image segmentation, and image recognition
  • Proficiency in NLP techniques, including sentiment analysis, text generation, and language understanding models
  • Experience with multimodal language modeling and applications
Job Responsibility
Job Responsibility
  • Applying machine learning/deep learning to the automotive industry
  • Maintaining and enhancing existing machine learning modules for autonomous vehicles
  • Designing and implementing new machine learning based approaches based on existing frameworks
  • Keeping up to speed with the state of the art of academic research and technology in the industry
  • Coordinating with engineers at the ICC and in Germany on the development of autonomous driving software
  • Transferring technologies and solutions to Volkswagen Group development divisions
  • Developing technical specifications and documentation
  • Representing Volkswagen Group in the technical community, such as at conferences
  • Fulltime
Read More
Arrow Right

Senior ML Engineer

As a Senior ML Engineer at Provectus, you'll be responsible for designing, devel...
Location
Location
Colombia , Medellín; Bogotá; Cali; Barranquilla; Bucaramanga
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • ML Fundamentals: supervised, unsupervised, and reinforcement learning
  • Model Development: feature engineering, model training, evaluation, hyperparameter tuning, and validation
  • ML Frameworks: classical ML libraries, TensorFlow, PyTorch, or similar frameworks
  • Deep Learning: CNNs, RNNs, Transformers
  • LLM Applications: Experience building production LLM-based applications
  • Prompt Engineering: Ability to design effective prompts and chain-of-thought strategies
  • RAG Systems: Experience building retrieval-augmented generation architectures
  • Vector Databases: Familiarity with embedding models and vector search
  • LLM Evaluation: Experience with evaluation metrics and techniques for LLM outputs
  • Python: Advanced proficiency in Python for ML applications
Job Responsibility
Job Responsibility
  • Design and implement end-to-end ML solutions from experimentation to production
  • Build scalable ML pipelines and infrastructure
  • Optimize model performance, efficiency, and reliability
  • Write clean, maintainable, production-quality code
  • Conduct rigorous experimentation and model evaluation
  • Troubleshoot and resolve complex technical challenges
  • Mentor junior and mid-level ML engineers
  • Conduct code reviews and provide constructive feedback
  • Share knowledge through documentation, presentations, and workshops
  • Collaborate with cross-functional teams (DevOps, Data Engineering, SAs)
Read More
Arrow Right

Senior AI Engineer

We are seeking a Senior AI Engineer (L4, Individual Contributor) to design, buil...
Location
Location
India , Chennai
Salary
Salary:
Not provided
arcadia.com Logo
Arcadia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of professional software engineering experience
  • 3+ years in AI/ML development
  • Strong expertise in Python, PyTorch/TensorFlow, scikit-learn, and ML tooling (MLflow, LangChain)
  • Proficiency with SQL, cloud services (AWS), containers (Docker, Kubernetes), and distributed systems
  • Understanding of modern AI research (LLMs, diffusion models, transformers)
  • Experience deploying ML models in production with CI/CD
  • Strong analytical skills, ability to balance speed and rigor in experimentation
  • A passion for sustainability and the clean-energy mission
  • Experienced with building agentic pipelines with the latest models from Anthropic, Google, OpenAI, and more
Job Responsibility
Job Responsibility
  • Integrate with LLMs and be an expert in prompt engineering to derive the right results from the models with limited hallucination
  • Design and train ML/AI models (forecasting, NLP, graph learning, generative AI) to improve data quality, cost effectiveness, and system scalability
  • Deploy and optimize models for large-scale production workloads using Python-based services in AWS/Kubernetes environments
  • Build robust, automated data pipelines and ML Ops workflows for continuous training and deployment
  • Research and experiment with modern AI methods (transformers, foundation models, reinforcement learning) and adapt them to energy-sector challenges not limited to utility statements
  • Drive performance improvements in model accuracy, latency, and cost efficiency
  • Collaborate with Product, SRE, and Analytics teams to deliver AI-enabled features across Arcadia’s platform
  • Write clean, maintainable code, contribute to architecture reviews, and mentor junior engineers
  • Build true agentic workflows with multi-step processing incorporating RAG pipelines and MCPs
What we offer
What we offer
  • Competitive compensation and employee stock options
  • Hybrid/remote-first working model (India-based role, with global collaboration)
  • Flexible leave policy
  • Comprehensive medical insurance (self + family members)
  • Annual performance cycle + quarterly recognition awards
  • A supportive, diverse engineering culture grounded in empathy, teamwork, and innovation
  • Fulltime
Read More
Arrow Right

Senior AI Software Engineer

AnaVation is seeking a Senior Agentic-AI Software engineer to join our team that...
Location
Location
United States , Chantilly
Salary
Salary:
Not provided
anavationllc.com Logo
AnaVation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active TS/SCI clearance within last 24 months
  • BA/BS in Computer Science or related field
  • BS + 10 Yrs or MS + 8 Yrs experience in computer science, AI, Machine Learning, or related field
  • 5+ years of experience in AI/ML development
  • At least 2 years focused on Agentic AI or autonomous systems
  • Proven track record of deploying production-grade AI systems
  • Strong problem-solving skills
  • Ability to work in a fast-paced, collaborative environment
Job Responsibility
Job Responsibility
  • Design, develop, and deploy advanced Agentic AI systems that autonomously perform complex tasks, make decisions, and interact with dynamic environments
  • Collaborate with cross-functional teams to deliver scalable, efficient, and ethical AI solutions
  • Architect and implement agentic AI systems capable of autonomous decision-making, task planning, and execution
  • Design and integrate multi-agent systems to solve complex problems
  • Develop and fine-tune large language models (LLMs) and reinforcement learning (RL) models
  • Implement robust APIs and interfaces to integrate AI agents with external systems
  • Optimize AI models for performance, scalability, and low-latency inference
  • Conduct rigorous testing, validation, and monitoring of AI agents
  • Collaborate with product managers, data scientists, and software engineers
  • Stay updated on latest advancements in Agentic AI, LLMs, and RL
What we offer
What we offer
  • Generous cost sharing for medical insurance for employee and dependents
  • 100% company paid dental insurance for employees and dependents
  • 100% company paid long-term and short term disability insurance
  • 100% company paid vision insurance for employees and dependents
  • 401k plan with generous match and 100% immediate vesting
  • Competitive Pay
  • Generous paid leave and holiday package
  • Tuition and training reimbursement
  • Life and AD&D Insurance
  • Fulltime
Read More
Arrow Right