CrawlJobs Logo

Member of Technical Staff, Pretraining evaluations

cohere.com Logo

Cohere

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Member of Technical Staff in the pretraining evals team, you will play a key role in helping us make modelling decisions based on experimental outcomes for our large language models (LLMs). Your primary focus will be on developing better ways to measure base model progress. This can include implementing new/better evaluations for base model capabilities, finding ways to reduce noise in our current model evaluations, or developing evaluation benchmarks that measure model progress at all model scales, among other directions.

Job Responsibility:

  • Deeply understand each individual evaluation task in our base model evaluation suite, have a clear idea of what each task measures and know their strengths and limitations
  • Suggest and implement improvements to our base model evaluation suite, whether by adding new tasks to measure unmeasured model capabilities or removing redundant or low-signal tasks
  • Improve the statistical understanding of our evals and improve the signal-to-noise ratio of our evaluation suite

Requirements:

  • Familiarity with base model evaluations and how they differ from post-trained models
  • Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance
  • Ability to convey statistical information effectively to a broad audience using visualizations and easy-to-understand numbers
  • Extremely strong software engineering skills
  • Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX)
  • Excellent communication skills to collaborate effectively with cross-functional teams and present findings
  • One or more papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)
What we offer:
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff, Pretraining evaluations

New

Member of Technical Staff, Evaluations Engineering

Microsoft AI is looking for a Member of Technical Staff, Evaluations Engineer to...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience with generative AI
  • Experience with distributed computing
  • Experience in leading technical projects and supporting architectural decisions with data
Job Responsibility
Job Responsibility
  • Develop and tune the pretraining scalable software for Nvidia GB200 72NVL CX8 and AMD MIxxx architectures
  • Benchmark GB200 and AMD MIxxx GPU clusters
  • Gather data and insights to develop the pretraining compute roadmap
  • Care deeply about conversational AI and its deployment
  • Actively contribute to the development of AI models that are powering our innovative products
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Pretraining Text Data

We are seeking engineers and researchers to join our Pretraining Text Data team,...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.) OR equivalent experience.
  • 2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.
  • Proficiency in statistics and exploratory data analysis methods.
Job Responsibility
Job Responsibility
  • Create high-quality datasets for training and evaluation
  • run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.
  • Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.
  • Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.
  • Build lightweight tools and workflows for dataset auditing, visualization, and versioning.
  • Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.
  • Embody our culture and values.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Platform Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to TypeScript, Python, C, C++, C#, Java
  • OR equivalent experience
  • Bachelor’s degree in computer science, or related technical discipline AND 6+ years technical engineering experience building web services with coding in languages including, but not limited to: Python, Golang, Java/Scala, Rust
  • 6+ years' experience in building and releasing production software at the platform level
  • Deep experience with all of the following languages: Golang, Java/Scala, Typescript (React/Next.js)
  • Experience in model pretraining, post-training, evaluation, and inference
  • Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally
  • Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders
  • Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, designers, and other engineers
  • Experience going from zero-to-one as well as working with developed systems
Job Responsibility
Job Responsibility
  • Design, develop, and maintain platform-level software solutions
  • Collaborate with cross-functional teams to integrate AI capabilities into various products
  • Ensure the reliability, scalability, and performance of platform components
  • Stay updated with the latest advancements in AI and engineering
  • Work alongside the technical staff and AI researchers to improve model development flows
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Machine Learning

As a Member of Technical Staff - Machine Learning, you will work to create LLM m...
Location
Location
United States , Mountain View
Salary
Salary:
163000.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Demonstrated engineering experience or research experience (e.g. creating or leading the creation of a feature in a different company, complex graduate work, research papers, or other experience)
  • Experience prompting, evaluating, and working with large language models
  • Experience writing production-quality Python code
Job Responsibility
Job Responsibility
  • Own and pursue a research agenda to improve model capability and performance for agentive application
  • Collaborate closely with the other research and product teams, from pretraining to model hosting to unlock new model capabilities
  • Build robust evaluations for tracking modeling improvements
  • Design, implement, test, and debug code across our research stack
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Machine Learning

As a Member of Technical Staff - Machine Learning, you will work to create LLM m...
Location
Location
United States , Mountain View
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's degree in Computer Science or related technical field AND 1+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Doctorate in Computer Science, Machine Learning, Human-Centered AI or related field and experience in (e.g., finetuning models with supervision or reinforcement learning, understanding and fixing data quality and curation, working with collaborators on creating new products)
  • Experience in machine learning, software engineering
  • Effective communicator and great teammate
  • Takes the initiative, is user-centered and enjoys building world-class AI experiences and products in a fast-paced environment
Job Responsibility
Job Responsibility
  • Own and pursue a research agenda to improve model capability and performance for agentive application
  • Collaborate closely with the other research and product teams, from pretraining to model hosting to unlock new model capabilities
  • Build robust evaluations for tracking modeling improvements
  • Design, implement, test, and debug code across our research stack
  • Work to create LLM models for general purpose capabilities and for products
  • Developing new methods to train core LLM capabilities (including agentive), collecting data, evaluating LLMs, creating data flywheels, tooling for LLM training/evals, writing production quality code, and creating new user-facing features
  • Creating Reinforcement Learning data, fine tuning, or training classifiers or engineering prompts to create SOTA foundation models and support Microsoft products and the Cloud API
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Post-Training

At Microsoft AI, we are on a mission to develop the most cutting-edge algorithms...
Location
Location
Switzerland , Zürich
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Expertise in post-training of AI models
  • Demonstrated experience in large-scale AI
  • Passionate about conversational AI and its deployment
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
  • Proven research track record in a domain related field supported by exceptional papers
Job Responsibility
Job Responsibility
  • Develop data collection, evaluation, and finetuning methods for models
  • Design hypotheses and experiment plans for rapidly iterating on model performance
  • Prototype new model features and capabilities and collaborate with engineers and researchers across Microsoft AI to make them a reality
  • Collaborate with pretraining and product platform teams to establish good vertical integration and ship models that Copilot users love
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Post-Training

At Microsoft AI, we are on a mission to develop the most cutting-edge algorithms...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Expertise in post-training of AI models
  • Demonstrated experience in large-scale AI
  • Passionate about conversational AI and its deployment
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
  • Proven research track record in a domain related field supported by exceptional papers
Job Responsibility
Job Responsibility
  • Develop data collection, evaluation, and finetuning methods for models
  • Design hypotheses and experiment plans for rapidly iterating on model performance
  • Prototype new model features and capabilities and collaborate with engineers and researchers across Microsoft AI to make them a reality
  • Collaborate with pretraining and product platform teams to establish good vertical integration and ship models that Copilot users love
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - ML Research Engineer, Data

Our Data team powers Liquid Foundation Models across pre-training, vision, audio...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python skills with the ability to quickly comprehend problems and translate them into clean, working code
  • Solid ML fundamentals: experience training, evaluating, and iterating on models (PyTorch preferred)
  • Track record of learning new technical domains quickly
  • 3+ years relevant experience with an M.S., or 1+ year with a Ph.D. (5+ years with a B.S.)
Job Responsibility
Job Responsibility
  • Build and maintain data processing, filtering, and selection pipelines at scale
  • Create pipelines for pretraining, midtraining, SFT, and preference optimization datasets
  • Design synthetic data generation systems using LLMs, structured prompting, and domain-specific generators
  • Design and run evaluations and ablations to measure dataset's impact on model performance
  • Monitor public datasets across text, vision, and audio domains
  • Collaborate with pre-training, vision, and audio teams on modality-specific data needs
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right