This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Agentic LLM systems are being deployed widely across enterprise companies including through Cohere’s North platform. The Next Generation Agents team is exploring the horizon of modeling techniques to improve agent capabilities (e.g., deep-research, learning-from-experience, continual learning, and memory). We work in an empirical-research-driven manner to develop production solutions. Much of the work is based on improving beyond the current state-of-the-art in a setting where we know this will bring value to our customers. As a part of this team, you will help drive exploration and development of agentic techniques. You will have the opportunity to build the models that power our agentic solutions. This includes developing data-generation techniques for post-training (SFT and RL*) Cohere’s models.
Job Responsibility:
Design and develop novel agentic solutions
Improve upon SOTA on hard agentic tasks
Research the next-generation of on-line learning-from-experience self-improvement
Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
Work with an amazing team of researchers and engineers pushing the boundaries
Requirements:
Strong software engineering skills
Proficiency in Python and have some experience with ML-related code (e.g., pytorch, numpy, etc.)
Experience with LLMs and agentic frameworks
Experience with post-training LLMs (SFT, PEFT, or RL*)
Experience with building synthetic data generation pipelines
What we offer:
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend