This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re looking for an experienced machine learning researcher / engineer who can help us push the frontiers of agentic LLM systems. As a part of the team, you will help drive exploration and development of agentic techniques and have the opportunity to build the models that power our agentic solutions. Agentic LLM systems are being deployed widely across enterprise companies including through Cohere’s North platform. In this role, you’ll be working with a team developing new strategies for training models for advanced agent capabilities including reasoning, tool use, and memory. This includes developing data-generation techniques for post-training (SFT and RL*) Cohere’s models. Model advancements have direct impacts on North and other Cohere products creating an exciting opportunity where core model development leads to direct product advancements.
Job Responsibility:
Design and develop novel agentic solutions
Improve upon SOTA on hard agentic tasks
Research the next-generation of on-line learning-from-experience self-improvement
Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
Work with an amazing team of researchers and engineers pushing the boundaries
Requirements:
Have a PhD in computer science or related field or similar industry research experience
Strong software engineering skills
Proficiency in Python and experience with ML-related code (e.g., pytorch, numpy, etc.)
Experience with LLMs and agentic frameworks
Experience with post-training LLMs (SFT, PEFT, or RL*)
Experience with building synthetic data generation pipelines
What we offer:
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend