CrawlJobs Logo

Staff Research Scientist - RL and Agents

Switzerland, Zurich · Job Posted February 17, 2026
Apply Position
Job Link Share

Job Description

We are building a team focusing on personal multi-agent systems, that will go out in the world and act on our behalf. The team covers a broad range of topics, including cooperative and social LLM agents, multi-agent planning and reasoning, self-improvement and evolutionary techniques, multi-agent reinforcement learning, human-agent collaboration, simulation environments and tool use.

Job Responsibility

  • Conduct applied research to advance the state of the art in Language / Multimodal Models and Agentic systems
  • Consistently and sustainably advance the state of the art for your problem, including setting and executing against roadmaps for 6-month plus timeframes
  • Collaborate with different cross-functional teams across the globe in research and product

Requirements

  • PhD in Computer Science or a related field with published projects in the fields of machine learning, deep learning, robotics, large language models and/or computer vision
  • Proven development skills in Deep Learning, working with PyTorch or TensorFlow
  • Experience developing LLM algorithms or infrastructure in Python or C/C++
  • First-authored publications at peer-reviewed conferences, e.g. ICLR, ICML, CVPR, ECCV, ICCV, NeurIPS, SIGGRAPH

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Research Scientist - RL and Agents

8 matching positions

Member of technical staff - Research - Agent

About H: H exists to push the boundaries of superintelligence with agentic AI. B...
Location
Location
France; United Kingdom , Paris; London
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior Experience: Previous demonstrable role(s) as a Staff, Principal, or Senior Engineer (or equivalent Research Scientist) in a Frontier AI Lab with a proven track record of leading complex, end-to-end AI/ML projects from conception to production
  • Education / Publication: Preferably PhD (or equivalent research experience) in Machine Learning, Computer Science, or a related field, preferably with a strong publication record (e.g., NeurIPS, ICML, ICLR) in Computer Science
  • Core Expertise: Deep theoretical and practical expertise in Agentic AI and proven experience building, scaling, and shipping solutions involving foundation models (LLMs/VLMs)
  • Soft Skills: Collaborative: Enjoys collaboration and thrives in a teamwork-oriented, fast-paced research environment
  • High-Impact Communicator: Possesses impactful communication skills, with the ability to bridge the gap between research and engineering and articulate complex ideas clearly
  • Mission-Driven: Genuinely eager to explore and solve the new engineering and research challenges at the frontier of agentic AI
Job Responsibility
Job Responsibility
  • Research & Leadership: Design and develop new agents, proposing new research directions, e.g., combining state-of-the-art RL with foundation models (LLMs/VLMs)
  • Algorithm & Systems Design: Design, implement, and scale complex, high-performance systems for training large-scale agents. This includes both the foundational infrastructure and the novel algorithms, reward models, and sophisticated training environments
  • Research-to-Production: Collaborate closely with researchers and engineers to implement, test, and productionize new agent logics, learning algorithms, and system architectures
  • Evaluation & Reliability: Create, manage, and scale massive benchmarks and evaluation systems to rigorously track agent capabilities. You will own system reliability, scalability, and observability for our entire research infrastructure
  • Mentorship & Standards: Mentor and guide other engineers and researchers on the team, fostering technical excellence. You will establish and enforce engineering standards, tooling, and best practices for both code and research design
  • Innovation: Conduct thorough code and design reviews, champion technical innovation, and proactively address technical debt to accelerate the R&D lifecycle
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic, and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right

Staff / Principal Research Scientist

Inworld is a product-oriented research lab of top AI researchers and engineers, ...
Location
Location
United Kingdom
Salary
Salary:
140000.00 - 200000.00 GBP / Year
inworld.ai Logo
Inworld AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Foundation models: training, new architectures, RL, reward modeling, scaling
  • Evaluation: benchmarks, eval loops, quality measurement, LLM-as-judge, failure analysis
  • Frontier topics: multimodal models, agents, tool use, test-time compute, world models
  • Published research at ICML, ICLR, NeurIPS, EMNLP, ACL, or AAAI
  • PhD in ML/NLP — or equivalent practical experience you can point to
  • Public work: non-trivial AI side projects, interdisciplinary experiments, open-source contributions
  • Full-stack research ownership: you frame the question, run the experiments, write the paper, ship the result
What we offer
What we offer
  • equity and benefits
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Integration/RL Team (Research Engineer)

The integration team is responsible for developing and scaling machine learning ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extremely strong software engineering skills
  • Value test-driven development methods, clean code, and strive to reduce technical debts at all levels
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch and/or XLA/MLIR
  • Experience using and debugging large-scale distributed training strategies (memory/speed profiling)
  • [Bonus] Experience with distributed training infrastructures (Kubernetes) and associated frameworks (Ray)
  • [Bonus] Hands-on experience with the post-training phase of model training, with a strong emphasis on scalability and performance
  • [Bonus] Experience in ML, LLM and RL academic research
Job Responsibility
Job Responsibility
  • Design and write high-performing and scalable software for training models
  • Develop new tools to support and accelerate research and LLM training
  • Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem
  • Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime
  • Research, implement, and experiment with ideas on our cluster and data infrastructure
  • Collaborate, Collaborate, and Collaborate with other scientists, engineers, and teams!
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

E-Bike Delivery Cyclist (Rider own bike)

Location
Location
United Kingdom , Manchester
Salary
Salary:
8.00 - 12.71 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Passion for cycling and delivering great service
  • Own a safe, road-worthy bike and helmet (if required by store)
  • Able to use your own smartphone for delivery app use
  • Friendly, reliable, and able to work independently
  • Able to work flexible hours, including evenings and weekends
  • Right to work relevant to store location
  • Previous experience is a bonus, but not essential
Job Responsibility
Job Responsibility
  • Deliver pizzas and menu items to customers promptly and safely
  • Provide excellent customer service at every doorstep
  • Operate your bike in accordance with road safety regulations and company policies
  • Use the Domino's Delivery App on your smartphone to manage orders
  • Support the in-store team during quieter delivery periods (e.g. cleaning, restocking)
  • Represent Domino's positively in the community
What we offer
What we offer
  • Competitive hourly pay + per-delivery payment + tips
  • 28 days paid holiday per year (includes BH, pro rata for part time)
  • Flexible working hours to suit your lifestyle
  • Staff discount on our delicious food
  • Staff meals (conditions apply)
  • Occasional Business Use insurance provided whilst out riding (store can provide more details)
  • Company pension scheme (where eligible)
  • Family Leave policies in place
  • Paid training and clear career progression pathway with linked pay increases
  • Supportive, inclusive, and fun team environment
  • Parttime
Read More
Arrow Right
New

Public Area Cleaner

An Epic Icon needs an Epic Team! Ayers Rock Resort is searching for a Public Are...
Location
Location
Australia , Yulara
Salary
Salary:
Not provided
voyages.com.au Logo
Voyages Indigenous Tourism
Expiration Date
June 17, 2026
Flip Icon
Requirements
Requirements
  • Excellent attention to detail
  • Ability to work efficiently in a fast-paced environment while maintaining a positive attitude towards guests and colleagues
  • Previous experience in a similar role is a bonus
  • This role requires full Australian driver's license - Preferred manual
  • National Criminal History Check is a mandatory step in the recruitment process
Job Responsibility
Job Responsibility
  • Cleaning assigned Public Areas by following a repeating daily task list
  • Performing Regular Deep cleans on assigned areas
  • Using observation and Initiative to clean Public Areas Items not listed on task list
What we offer
What we offer
  • Discounted accommodation
  • Competitive pay
  • Resort discounts
  • Delicious on-shift meals
  • Relocation assistance payment of up to $700
  • Access to staff pool, gym, and Residents Club
  • Incentive and bonus program accessible after a year of service including rental discounts, $700 vacation bonus, and resort vouchers
  • Fulltime
Read More
Arrow Right
New

B2 Aircraft Mechanic

We are currently seeking a skilled B2 Aircraft Mechanic specialising in Avionics...
Location
Location
United Kingdom , Lasham
Salary
Salary:
44000.00 GBP / Year
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Previous experience working as a B2 aircraft mechanic specialising in avionics
  • Proficient in troubleshooting avionics systems and interpreting technical manuals and schematics
  • Strong knowledge of relevant aviation regulations, including CAA Part-66 or equivalent
  • Excellent communication and teamwork skills, with the ability to work effectively in a collaborative environment
  • Detail-oriented with a commitment to quality and safety
Job Responsibility
Job Responsibility
  • Perform scheduled and unscheduled maintenance tasks on avionics systems, including navigation, communication, and electronic instrumentation
  • Repair or replace faulty components and wiring harnesses to ensure the safe and reliable operation of aircraft systems
  • Install, configure, and calibrate avionics equipment in accordance with manufacturer specifications and regulatory requirements
  • Document maintenance activities accurately and maintain comprehensive records of all work performed
  • Collaborate with other maintenance personnel and support teams to ensure timely completion of tasks and compliance with safety standards
  • Stay current with advancements in avionics technology and participate in training programs as needed
What we offer
What we offer
  • Competitive salary
  • Generous holiday allowance and company pension scheme
  • Opportunities for career development and training
  • Dynamic and supportive work environment with opportunities for advancement
  • Fulltime
Read More
Arrow Right
New

HR Generalist | People & Culture Advisor / Manager

A different kind of People & Culture role — in a place like no other! Ayers Rock...
Location
Location
Australia , Yulara
Salary
Salary:
Not provided
voyages.com.au Logo
Voyages Indigenous Tourism
Expiration Date
June 17, 2026
Flip Icon
Requirements
Requirements
  • Experience working in HR / P&C in Australia
  • Strong generalist experience across the employee lifecycle
  • Sound knowledge of Australian employment obligations and industrial instruments
  • Experience managing employee relations and workplace investigations
  • Confidence coaching and advising people leaders
  • An interest in workforce development, career development and cross-skilling
  • Experience in hospitality, tourism, retail or other operational environments is highly regarded.
Job Responsibility
Job Responsibility
  • Support leaders and teams across diverse operations
  • Exposure across employee relations, performance management, workplace investigations, workforce capability and leader coaching
  • Partner closely with leaders and teams
  • End-to-end responsibility for HR and people matters across areas of responsibility
What we offer
What we offer
  • Discounted accommodation
  • Tax benefits
  • Resort discounts
  • Delicious on-shift meals
  • Casual multi-hire opportunities
  • Relocation assistance payment
  • Access to staff pool, gym, and Residents Club
  • After one year: rental discounts
  • $700 vacation bonus
  • Resort vouchers for work anniversary
  • Fulltime
Read More
Arrow Right
New

Staff Accountant

Robert Half has partnered with a well-respected local company to locate a Staff ...
Location
Location
United States , Auburn
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Accounting, Finance, Business or similar
  • 5+ years General Ledger accounting experience
  • QuickBooks
  • Journal Entries
  • Reconciliation
  • Advanced Excel skills
What we offer
What we offer
  • medical plan
  • vision
  • dental
  • FSA
  • HSA
  • Group Life Insurance
  • 401k with match
  • paid vacation and holidays
  • Fulltime
Read More
Arrow Right