Member of Technical Staff, Post-Training Job at Cohere

Job Description

Advance the state of the art for model post training, ship state of the art models to production, and bridge the gap between research and production. We have one of the highest ratio of compute to engineers in the world. We do not delineate strongly between engineering and research. Everyone will contribute to writing production code and supporting our research effort depending on individual interest and organisational needs. We have all the compute, data, and talent available for you to do your best work.

Job Responsibility

Design and write high-performant and scalable software for training models
Consistently post-train the models to reach SOTA level performance
Coordinate with other specialist teams (Agentic, Code…) to produce models that have strong all encompassing performance
Craft and implement techniques to improve the performance and results of our training cycles both on the SFT and the RL regime
Research, implement, and experiment with ideas on our supercompute and data infrastructure
Learn from and work with the best researchers in the field

Requirements

Extremely strong software engineering skills
Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR
Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray)
Experience using large-scale distributed training strategies
Hands on experience on training large model at scale
Hands on experience with the post training phase of model training, with a strong emphasis on performance optimisation

Nice to have

paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Cohere - All Job Offers

Select Country

Member of Technical Staff, Post-Training

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Member of Technical Staff, Post-Training

Member of Technical Staff - Post-Training

Member of Technical Staff - Post-Training

Member of Technical Staff, AI Safety Post-Training

Member of Technical Staff - Post Training - MAI Superintelligence Team

Member of Technical Staff - Post Training, Applied

Member of Technical Staff - Post Training, Reinforcement Learning

Member of Technical Staff - Pre Training - MAI Superintelligence Team

Member of Technical Staff, AI Data - MAI Superintelligence Team

Our AI answers in your language