CrawlJobs Logo

Staff Research Scientist - RL and Agents

meta.com Logo

Meta

Location Icon

Location:
Switzerland , Zurich

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are building a team focusing on personal multi-agent systems, that will go out in the world and act on our behalf. The team covers a broad range of topics, including cooperative and social LLM agents, multi-agent planning and reasoning, self-improvement and evolutionary techniques, multi-agent reinforcement learning, human-agent collaboration, simulation environments and tool use.

Job Responsibility:

  • Conduct applied research to advance the state of the art in Language / Multimodal Models and Agentic systems
  • Consistently and sustainably advance the state of the art for your problem, including setting and executing against roadmaps for 6-month plus timeframes
  • Collaborate with different cross-functional teams across the globe in research and product

Requirements:

  • PhD in Computer Science or a related field with published projects in the fields of machine learning, deep learning, robotics, large language models and/or computer vision
  • Proven development skills in Deep Learning, working with PyTorch or TensorFlow
  • Experience developing LLM algorithms or infrastructure in Python or C/C++
  • First-authored publications at peer-reviewed conferences, e.g. ICLR, ICML, CVPR, ECCV, ICCV, NeurIPS, SIGGRAPH

Additional Information:

Job Posted:
February 17, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff Research Scientist - RL and Agents

Member of technical staff - Research - Agent

About H: H exists to push the boundaries of superintelligence with agentic AI. B...
Location
Location
France; United Kingdom , Paris; London
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior Experience: Previous demonstrable role(s) as a Staff, Principal, or Senior Engineer (or equivalent Research Scientist) in a Frontier AI Lab with a proven track record of leading complex, end-to-end AI/ML projects from conception to production
  • Education / Publication: Preferably PhD (or equivalent research experience) in Machine Learning, Computer Science, or a related field, preferably with a strong publication record (e.g., NeurIPS, ICML, ICLR) in Computer Science
  • Core Expertise: Deep theoretical and practical expertise in Agentic AI and proven experience building, scaling, and shipping solutions involving foundation models (LLMs/VLMs)
  • Soft Skills: Collaborative: Enjoys collaboration and thrives in a teamwork-oriented, fast-paced research environment
  • High-Impact Communicator: Possesses impactful communication skills, with the ability to bridge the gap between research and engineering and articulate complex ideas clearly
  • Mission-Driven: Genuinely eager to explore and solve the new engineering and research challenges at the frontier of agentic AI
Job Responsibility
Job Responsibility
  • Research & Leadership: Design and develop new agents, proposing new research directions, e.g., combining state-of-the-art RL with foundation models (LLMs/VLMs)
  • Algorithm & Systems Design: Design, implement, and scale complex, high-performance systems for training large-scale agents. This includes both the foundational infrastructure and the novel algorithms, reward models, and sophisticated training environments
  • Research-to-Production: Collaborate closely with researchers and engineers to implement, test, and productionize new agent logics, learning algorithms, and system architectures
  • Evaluation & Reliability: Create, manage, and scale massive benchmarks and evaluation systems to rigorously track agent capabilities. You will own system reliability, scalability, and observability for our entire research infrastructure
  • Mentorship & Standards: Mentor and guide other engineers and researchers on the team, fostering technical excellence. You will establish and enforce engineering standards, tooling, and best practices for both code and research design
  • Innovation: Conduct thorough code and design reviews, champion technical innovation, and proactively address technical debt to accelerate the R&D lifecycle
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic, and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right

Staff / Principal Research Scientist

Inworld is a product-oriented research lab of top AI researchers and engineers, ...
Location
Location
United Kingdom
Salary
Salary:
140000.00 - 200000.00 GBP / Year
inworld.ai Logo
Inworld AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Foundation models: training, new architectures, RL, reward modeling, scaling
  • Evaluation: benchmarks, eval loops, quality measurement, LLM-as-judge, failure analysis
  • Frontier topics: multimodal models, agents, tool use, test-time compute, world models
  • Published research at ICML, ICLR, NeurIPS, EMNLP, ACL, or AAAI
  • PhD in ML/NLP — or equivalent practical experience you can point to
  • Public work: non-trivial AI side projects, interdisciplinary experiments, open-source contributions
  • Full-stack research ownership: you frame the question, run the experiments, write the paper, ship the result
What we offer
What we offer
  • equity and benefits
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Integration/RL Team (Research Engineer)

The integration team is responsible for developing and scaling machine learning ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extremely strong software engineering skills
  • Value test-driven development methods, clean code, and strive to reduce technical debts at all levels
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch and/or XLA/MLIR
  • Experience using and debugging large-scale distributed training strategies (memory/speed profiling)
  • [Bonus] Experience with distributed training infrastructures (Kubernetes) and associated frameworks (Ray)
  • [Bonus] Hands-on experience with the post-training phase of model training, with a strong emphasis on scalability and performance
  • [Bonus] Experience in ML, LLM and RL academic research
Job Responsibility
Job Responsibility
  • Design and write high-performing and scalable software for training models
  • Develop new tools to support and accelerate research and LLM training
  • Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem
  • Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime
  • Research, implement, and experiment with ideas on our cluster and data infrastructure
  • Collaborate, Collaborate, and Collaborate with other scientists, engineers, and teams!
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Aerial Groundman

Groundman is an entry-level position in the line of progression for Telecommunic...
Location
Location
United States , Clarksburg
Salary
Salary:
Not provided
southeastutilitiesofgeorgia.com Logo
Southeast Utilities of Georgia LLC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A positive can-do attitude and openness to trying things new ways
  • GRIT values – Guts, Reliability, Innovation, and Teamwork
  • High school diploma or equivalent
  • Ability to take direction and instruction from supervisors
  • Ability to work well with others in a team environment
  • Ability to communicate clearly using radios, verbal, and hand signals
  • Must be able to lift and carry materials of all shapes and sizes, weighing 40 to 90 pounds
  • Must be able to stand or walk for long periods of time
  • Must be able to stoop, kneel, crouch and crawl while performing work
  • Must be able to ascend and descend ladders
Job Responsibility
Job Responsibility
  • Assists Lineman in loading, unloading, layout and preparing tools/supplies/equipment/materials needed at each job site
  • Performs manual labor type tasks as directed by the Lineman, Foreman, or Supervisor such as cleaning truck, digging ditches, tamping, cutting brush and moving materials
  • Assists with Traffic Control and Safe Operations
  • Handles physically demanding construction duties
  • Drives a company-provided vehicle from the yard to the job site and back
  • Performs other duties as assigned
What we offer
What we offer
  • Medical, Dental & Vision Benefits
  • 401(k) Program with a 4% company match
  • Free Wellness Resources & Marketplace Discounts
  • Paid Maternity & Parental Leave
  • Paid Basic Life Insurance & Voluntary Options
  • Fulltime
Read More
Arrow Right
New

Runner

Join one of the UK's leading hospitality businesses as a Runner. Create unforget...
Location
Location
United Kingdom , Bristol
Salary
Salary:
12.71 - 14.69 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Great people skills
  • Excellent organisational and multitasking abilities
  • Ability to maintain a positive attitude in a fast-paced airport environment
  • Flexibility in hours, airport shifts can start at 3am
  • Ability to provide a 5 year work/education/personal reference history
  • Ability to undertake a criminal record check
Job Responsibility
Job Responsibility
  • Create unforgettable guest experiences through delivering warm welcomes and being a supportive team player
What we offer
What we offer
  • Free meals on shift
  • Up to 30% discount at all our brands with no limit on number of guests
  • Duty free discounts excluding alcohol and cigarettes
  • Access to a great discount platform, saving you money on everyday purchases
  • Wagestream platform to access your wages as they are earned
  • Superb training and development, apprenticeships open to all
  • Fulltime
Read More
Arrow Right
New

Fiber Splicing Technician

The Fiber Splicing Technician splices, tests, troubleshoots, and repairs fiber o...
Location
Location
United States , Jacksonville, FL
Salary
Salary:
Not provided
southeastutilitiesofgeorgia.com Logo
Southeast Utilities of Georgia LLC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ability to do physical labor, climb ladders and poles, work in confined spaces, etc.
  • Ability to read blueprints for job specifications in placement of cables, location of utilities, etc.
  • Must be able to distinguish between different colors for connection of fibers
  • Must be a self-starter and internally motivated to achieve corporate, department, and personal objectives
  • Must have excellent interpersonal communication skills (oral and written) and be a team player
  • Must possess excellent problem solving and decision-making skills
  • Ability to use basic math skills to compute measurements, figure ohms, timecards, etc.
  • Ability to listen and to follow directions
  • Must be organized, multi-tasked oriented, and maintain a neat and safe work environment
  • Ability to work in high places on ladders and poles
Job Responsibility
Job Responsibility
  • End-of-Line Network Testing: Testing light levels at the end of fiber networks for balanced signal between points is crucial to ensure proper signal transmission and network performance
  • Cable Prep: Ensure that cables are properly labeled and organized to minimize errors during installation
  • Build Fiber Cases and Enclosures: Constructing fiber cases and enclosures involves creating protective housings for fiber optic equipment and connections
  • Splice Color for Color Fibers: When splicing fiber optic cables together, technicians match the colors of the fibers to ensure that the correct fibers are connected
  • Cable Management: Organize the cables neatly within the enclosure, ensuring that they are properly routed and secured to prevent tangling or damage
  • Sealing and Placement of Case: Ensuring that the enclosure is properly closed and secured to protect the equipment and cables inside from environmental elements
  • Read Prints and Maps: Examine Splice Matrix and redline maps to understand fiber routes and assignments
  • Splice off color and multiple cables in one enclosure
  • Jumping Trays: Routing fibers from one tray to another, possibly to optimize cable management, minimize signal loss, or create efficient pathways within the enclosure
  • Splice Tie Point and Jumper Cases: Splicing fibers at a tie point involves connecting fibers from different cables
Read More
Arrow Right
New

Director, IT PMO, Greater China

Lead the execution of the Continent's IT Operations and Technology & Digital str...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
https://www.marriott.com Logo
Marriott Bonvoy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4-year degree from an accredited university in Information Technology, Hotel Management, or a related major
  • 3 years of project management experience
  • 3 years of experience in a similar role, with exposure to diverse cultures and values across the Greater China Region
  • Proven negotiation skills
  • Hotel operations experience
Job Responsibility
Job Responsibility
  • Lead the execution of the Continent's IT Operations and Technology & Digital strategy
  • Ensure effective communication and systematic tracking of initiative progress
  • Oversee departmental delivery activities
  • Actively manage complex or delayed projects
  • Responsible for key IT communication efforts, including conferences, events, and other marketing initiatives
  • Serve as the technology interface for cross-disciplinary initiatives, providing technical evaluation and recommendations
  • Foster a culture of MI and take responsibility for Associate Engagement within the IT Departments
  • Drive talent-related initiatives, particularly learning and development, along with talent resourcing efforts for the APIT field team in program development and rollout
  • Accountable for delivering results aligned with the balanced scorecard and established annual goals
  • Responsible for strategic planning, execution, and communication of continental events
  • Fulltime
Read More
Arrow Right
New

Floor & Bar Team Member

At Buzzworks, we’re more than just a hospitality group; we’re a community. With ...
Location
Location
United Kingdom , North Berwick
Salary
Salary:
8.00 - 12.71 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Passionate about great food and great people
  • Love to meet new people
  • Ability to work as part of a team in a fast-paced environment
Job Responsibility
Job Responsibility
  • Creating memorable moments
  • Learn skills for future success
What we offer
What we offer
  • Flexible working – shifts that work around your life
  • Training & Development
  • Stream App – access to pay, retail discounts and great savings
  • 40% staff discount across all Buzzworks venues
  • Internal progression opportunities
  • Extra holidays after 1, 3, and 5 years' service
  • Wellbeing support – Employee Assistance, wellness hub, discounted gym
Read More
Arrow Right