This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Performance Engineer in the Pre-Training team you will be responsible for optimizing the performance of our advanced language models and systems. Their primary focus is on improving key model training metrics, such as training throughput, ensuring high accelerator utilization. The team combines expertise in software engineering, machine learning, and low-level kernel design and development to design robust systems and enhance model performance. You will work on identifying and removing performance bottlenecks, develop cutting-edge training and profiling tools to help Cohere's mission of providing efficient and reliable language understanding and generation capabilities and drive innovation in the field of natural language processing.
Job Responsibility:
Design and write high-performant and scalable software for training
Understand architectural modifications and design choices and their effects on training throughput and quality
Write low-level CUDA, triton kernels to squeeze every last bit of performance from our accelerators
Research, implement, and experiment with ideas on our supercompute and data infrastructure
Learn from and work with the best researchers in the field
Requirements:
Extremely strong software engineering skills
Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR
Experience writing kernels for GPUs using CUDA, triton, etc
Experience using large-scale distributed training strategies
Familiarity with autoregressive sequence models, such as Transformers
Nice to have:
Paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)
What we offer:
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend