CrawlJobs Logo

Member of Technical Staff, Synthetic Data

cohere.com Logo

Cohere

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Machine Learning Engineer specializing in synthetic data, you will play a pivotal role in developing the synthetic data pipeline that is crucial to Cohere’s advanced language models. Your responsibilities will encompass the end-to-end management of synthetic data, including maintaining and optimizing the synthetic data pipeline, data analysis and generation, as well as conducting data ablations and model evaluation to gauge data quality. You will work with diverse web data and code data and transform them using generative models to improve token efficiency and model quality. By combining research and engineering, you will bridge the gap between raw data and cutting-edge AI models, directly contributing to improvements in critical training metrics like throughput and accelerator utilization.

Job Responsibility:

  • Design and build scalable inference pipelines that run on large GPU clusters
  • Conduct data ablations to assess data quality and experiment with data mixtures to enhance model performance
  • Research and implement innovative synthetic data curation methods, leveraging Cohere’s infrastructure to drive advancements in natural language processing
  • Collaborate with cross-functional teams, including researchers and engineers, to ensure data pipelines meet the demands of cutting-edge language models

Requirements:

  • Strong software engineering skills, with proficiency in Python and experience building data pipelines
  • Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or similar tools
  • Experience working with LLMs through work projects, open-source contributions or personal experimentation
  • Familiarity with LLM inference frameworks such as vLLM and TensorRT
  • Experience working with large-scale datasets, including web data, code data, and multilingual corpora
  • A passion for bridging research and engineering to solve complex data-related challenges in AI model training

Nice to have:

Bonus: paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)

What we offer:
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff, Synthetic Data

Member of Technical Staff - ML Research Engineer, Data

Our Data team powers Liquid Foundation Models across pre-training, vision, audio...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python skills with the ability to quickly comprehend problems and translate them into clean, working code
  • Solid ML fundamentals: experience training, evaluating, and iterating on models (PyTorch preferred)
  • Track record of learning new technical domains quickly
  • 3+ years relevant experience with an M.S., or 1+ year with a Ph.D. (5+ years with a B.S.)
Job Responsibility
Job Responsibility
  • Build and maintain data processing, filtering, and selection pipelines at scale
  • Create pipelines for pretraining, midtraining, SFT, and preference optimization datasets
  • Design synthetic data generation systems using LLMs, structured prompting, and domain-specific generators
  • Design and run evaluations and ablations to measure dataset's impact on model performance
  • Monitor public datasets across text, vision, and audio domains
  • Collaborate with pre-training, vision, and audio teams on modality-specific data needs
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Forward Deployed Engineer

You will work directly on customer engagements that generate revenue. This is ha...
Location
Location
United States , San Francisco, Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on fine-tuning experience with modern LLMs (last 12-18 months): LoRA, PEFT, DPO, instruction tuning, or similar
  • Strong ML fundamentals: you understand how models learn, fail, and improve
  • Experience generating or curating training data to address model gaps
  • Autonomous coding and debugging skills in Python and PyTorch
  • Proficiency with open-source ML ecosystem (Hugging Face transformers, datasets, accelerate)
  • Fine-tunes models: You have hands-on experience with techniques like LoRA, PEFT, DPO, instruction tuning, or RLHF. You've written training loops, not just API calls
  • Works with modern architectures: Your experience includes models released in the last 12-18 months (Llama 3.x, Mistral, Gemma, Qwen, etc.), not just BERT or classical ML
  • Generates and curates data: You've created synthetic training data to address specific model failure modes. You understand how data quality drives model performance
  • Debugs methodically: When a model underperforms, you diagnose whether it's a data problem, architecture problem, or training problem, and you fix it
  • Ships to customers: You can translate ambiguous customer requirements into concrete technical specs and deliver against quality metrics
Job Responsibility
Job Responsibility
  • Fine-tune LFMs on customer data to hit quality and latency targets for on-device and edge deployments
  • Generate and curate training data to address specific model failure modes
  • Run experiments, track metrics, and iterate until customer success criteria are met
  • Translate ambiguous customer requirements into concrete technical specifications
  • Provide analytics to commercial teams for contract structuring and pricing
  • Work across text, vision, and audio modalities as customer needs require
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Next Generation Agents

Agentic LLM systems are being deployed widely across enterprise companies includ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills
  • Proficiency in Python and have some experience with ML-related code (e.g., pytorch, numpy, etc.)
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines
Job Responsibility
Job Responsibility
  • Design and develop novel agentic solutions
  • Improve upon SOTA on hard agentic tasks
  • Research the next-generation of on-line learning-from-experience self-improvement
  • Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  • Work with an amazing team of researchers and engineers pushing the boundaries
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Agents Modeling

We’re looking for an experienced machine learning researcher / engineer who can ...
Location
Location
United States , New York
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have a PhD in computer science or related field or similar industry research experience
  • Strong software engineering skills
  • Proficiency in Python and experience with ML-related code (e.g., pytorch, numpy, etc.)
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines
Job Responsibility
Job Responsibility
  • Design and develop novel agentic solutions
  • Improve upon SOTA on hard agentic tasks
  • Research the next-generation of on-line learning-from-experience self-improvement
  • Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  • Work with an amazing team of researchers and engineers pushing the boundaries
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - ML Engineer / Scientist (JP Localization)

At Liquid, we’re not just building AI models—we’re redefining the architecture o...
Location
Location
Japan , Tokyo
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of the Japanese model evaluation landscape and familiarity with Japanese pre-training data sources
  • Experience using modeling and inference tools such as Huggingface inference, vLLM, and cloud APIs
Job Responsibility
Job Responsibility
  • Identify, collect, and curate diverse high-quality Japanese text, audio, and multimodal datasets
  • Design methods to synthetically generate or augment Japanese training data when needed
  • Ensure datasets meet enterprise-grade quality, coverage, and compliance requirements
  • Train and fine-tune language and vision models to achieve state-of-the-art performance for Japanese enterprise use cases
  • Adapt existing LFMs for Japanese language, culture, and enterprise-specific workflows
  • Implement evaluation frameworks to benchmark model quality on Japanese datasets
  • Design evaluation datasets and metrics for Japanese enterprise applications
  • Conduct thorough error analysis and iteratively improve model performance
  • Ensure robustness, fairness, and reliability in Japanese-language outputs
What we offer
What we offer
  • Hands-on experience with state-of-the-art technology at a leading AI company
  • The opportunity to directly shape foundation model performance in one of the world’s most complex and nuanced languages
  • A collaborative, fast-paced environment where your work drives the next generation of LFMs
  • Fulltime
Read More
Arrow Right
New

Engineer I, EHS

You, as Engineer I, EHS, will support supervisors by providing tools and advice ...
Location
Location
Costa Rica , Cartago
Salary
Salary:
Not provided
https://www.baxter.com/ Logo
Baxter
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in occupational health and safety, or equivalent experience, or higher education in related areas
  • Experience in related fields of 1 years minimum
  • Sophisticated English, bilingual desirable
  • Certification in ISO 14001:2014, ISO 45001:2018 and ISO 50001 standards (desirable)
  • Internal Auditor Certificate (desirable)
Job Responsibility
Job Responsibility
  • Promote compliance with quality requirements, Good Manufacturing Practices (GMPs) and Good Documentation Practices (GDP)
  • Promote and participate in ICare's initiatives: if you see something, do something
  • Respond to and resolve in a timely manner the problems that may affect the quality, safety or efficiency of the product or the regulatory consistency of the process
  • Continuously seek regulatory completion in your process
  • Stay aligned with EHS rules and procedures
  • Ensure the people in your charge follow them
  • Provide crucial training spaces for EHS issues
  • Promote the culture of safety, and ensure compliance with all the guidelines that the organization settles vital to align with the applicable legal and/or corporate requirements of its area passionate about protecting safety, its own health and that of its personnel in charge, and the environment
  • Build, coordinate and implement the company's Ergonomics program
  • Build, coordinate and implement the company's Environmental program (Management of waste, wastewater, air emissions, drinking water, identification and evaluation of environmental aspects, assessment of relevant regulations, training, among others)
Read More
Arrow Right
New

Store Operator

GrainCorp Feeds is a national animal feed business, based in Hamilton NZ, dealin...
Location
Location
New Zealand , Waharoa
Salary
Salary:
Not provided
graincorp.com.au Logo
GrainCorp
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Current forklift licence with F and W endorsement
  • Experience working in a warehousing / logistics role
  • Good communication skills
  • Ability to follow direction
  • Ability to work collaboratively and autonomously
  • Capable of lifting up to 20kg in weight
  • Solid understanding of safety standards
Job Responsibility
Job Responsibility
  • Bagging and pallet stacking
  • Moving / loading palletised stock
What we offer
What we offer
  • Ongoing training and safety programs
  • Paid Parental leave and birthday leave
  • Employee referral bonus scheme
  • Standard Southern Cross Health Insurance that covers employee, spouse and kids up to the age of 18
  • Family Inclusive Workplace accredited employer, committed to supporting you both on and off the job
  • Fulltime
Read More
Arrow Right
New

Marketing Manager

We are looking for a dynamic Marketing Manager to spearhead social media and dig...
Location
Location
United States , Los Angeles
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-7 years of marketing experience with a strong emphasis on social media and digital strategies
  • Proven experience in building and growing brand presence, preferably within the fashion or footwear industries
  • Expertise in content strategy, platform trends, and performance analytics
  • Strategic thinker with the ability to manage day-to-day marketing activities effectively
  • Excellent collaboration skills with a history of working cross-functionally, especially with eCommerce teams
  • Proficiency in email campaigns, digital marketing, and enhancing brand awareness
  • Strong understanding of social media platforms and their role in driving business growth
Job Responsibility
Job Responsibility
  • Develop and implement comprehensive social media strategies to establish a strong and consistent brand presence across multiple platforms
  • Collaborate with the eCommerce team to ensure marketing efforts align seamlessly with sales objectives and customer journey goals
  • Plan and oversee content calendars, digital campaigns, and product launches to drive engagement and conversions
  • Lead wholesale marketing strategies, including organizing events at brick-and-mortar locations to boost brand visibility
  • Monitor and analyze campaign performance metrics, optimizing strategies to increase traffic, engagement, and revenue
  • Identify and pursue opportunities for expanding the brand's digital footprint, including partnerships, new platforms, and innovative content formats
  • Refine and maintain brand guidelines to ensure consistent messaging across all marketing channels
  • Work closely with creative teams, influencers, and external partners to produce high-quality content that aligns with the brand identity
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right