CrawlJobs Logo

Staff Research Engineer, Model Efficiency

cohere.com Logo

Cohere

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Large Language Models (LLMs) continue to push the boundaries of what AI systems can do — but inference is still the bottleneck. The Model Efficiency team is responsible for pushing the limits of LLM inference efficiency across our foundation models. We explore and ship breakthroughs across the model execution stack, including: model architecture and MoE routing optimization; decoding and inference-time algorithm improvements; software/hardware co-design for GPU acceleration; performance optimization without compromising model quality.

Job Responsibility:

develop, prototype, and deploy techniques that materially improve how fast and efficiently our models run in production

Requirements:

  • Have a PhD in Machine Learning or a related field
  • Understand LLM architecture, and how to optimize LLM inference given resource constraints
  • Have significant experience with one or more techniques that enhance model efficiency
  • Strong software engineering skills
  • An appetite to work in a fast-paced high-ambiguity start-up environment
  • Publications at top-tier conferences and venues (ICLR, ACL, NeurIPS)
  • Passion to mentor others
What we offer:
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff Research Engineer, Model Efficiency

Sr. Staff Thermal Design and Modeling Engineer

The Enphase Energy Storage and Systems Innovation team in the office of the CTO ...
Location
Location
United States , Austin
Salary
Salary:
110000.00 - 167000.00 USD / Year
enphase.com Logo
Enphase Energy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/MS/PhD Mechanical Engineering or closely related discipline
  • Minimum experience: BS and 5+ years / MS and 3+ years / PhD with thermal design experience for a Senior Engineer
  • BS and 8+ years / MS and 6+ years / PhD and 3+ years thermal design experience for a Staff Engineer
  • BS and 12+ years / MS and 8+ years / PhD and 5+ years thermal design experience for a Sr. Staff Engineer
  • Significant experience designing active and passive thermal management solutions within a product and with key thermal components such as heat sinks, air flow systems, coolant systems, multi-phase cooling, phase change materials, and thermal interface materials
  • Excellent understanding of conduction, convection, and radiation
  • Experience with analytical approaches and simulation tools for thermal modeling (e.g., Solidworks Flow Sim, Fluent, Icepak, COMSOL, etc.)
  • High proficiency with an enterprise CAD package (Solidworks preferred)
  • Experience designing test plans, building experimental setups, collecting data, and analyzing results for thermal performance
  • Familiarity with design for manufacturability and reliability
Job Responsibility
Job Responsibility
  • Design, integrate, and validate thermal management solutions for Enphase energy storage and systems products
  • Build, update, and analyze computational models including heat transfer, combustion, and air flow to simulate thermal performance in energy storage and systems products
  • Specify key functional and performance requirements
  • design and execute verification test plans
  • Design high quality mechanical and electromechanical part and assembly designs efficiently and with good modeling practices
  • Produce 2D drawings and 3D CAD models, evaluate parts, and collaborate with a multi-disciplinary design team, suppliers, and contract manufacturing partners to develop product prototypes
  • Research, analyze, implement, and test advanced thermal management technologies to support innovative energy storage and system product designs
  • Complete documentation for design, testing, and analysis
  • Hands-on fabrication of electromechanical assemblies and prototypes
  • Work with a multi-disciplinary team to test and troubleshoot prototype designs
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Research

As a Member of Technical Staff on the Research team, you’ll push the boundaries ...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 240000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Research background in Artificial Intelligence, Machine Learning, Physics, or similar field
  • Experience solving analytical problems using analytic and quantitative approaches
  • Experience communicating research to audiences with different backgrounds
  • Experience coding in C/C++, Python, or other similar languages
Job Responsibility
Job Responsibility
  • Conduct foundational research to advance the capabilities, efficiency, and reliability of LLMs and multimodal systems
  • Design, implement, and evaluate novel model architectures, training methods, and optimization techniques
  • Collaborate with engineering teams to transition research prototypes into production-grade systems
  • Analyze empirical results, identify performance bottlenecks, and iterate quickly to improve model quality
  • Contribute to internal research strategy by identifying high-impact opportunities and emerging trends in AI
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Sr. Staff Mechanical Design Engineer

The Enphase Energy Storage and Systems Innovation team in the office of the CTO ...
Location
Location
United States , Austin
Salary
Salary:
110000.00 - 167000.00 USD / Year
enphase.com Logo
Enphase Energy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/MS/PhD Mechanical Engineering or closely related discipline
  • Minimum experience: BS and 5+ years / MS and 3+ years / PhD with mechanical design experience for a Senior Engineer
  • BS and 8+ years / MS and 6+ years / PhD and 3+ years mechanical design experience for a Staff Engineer
  • BS and 12+ years / MS and 8+ years / PhD and 5+ years mechanical design experience for a Sr. Staff Engineer
  • Significant experience in product and part mechanical design
  • Expertise leveraging various high volume manufacturing approaches such as sheet metal stamping, die casting, extrusion, forging, laser welding, CNC machining, thermoforming, silicone/rubber molding, injection molding, etc.
  • Proficiency with structural FEA and analyses for vibration, fatigue, and load bearing
  • Experience designing products for manufacturability and reliability
  • High proficiency with an enterprise CAD package (Solidworks preferred)
  • Outdoor product design experience for various ingress protection environments
Job Responsibility
Job Responsibility
  • Design high quality mechanical and electromechanical part and assembly designs efficiently and with good modeling practices
  • Produce 2D drawings and 3D CAD models, evaluate parts, and collaborate with a multi-disciplinary design team, suppliers, and contract manufacturing partners to develop product prototypes
  • Specify key functional and performance requirements
  • design and execute verification test plans
  • Perform mechanical analysis, structural FEA, vibration and shock analysis, FMEA, and design verification testing to inform designs and resolve issues and failures in prototype products
  • Research, analyze, implement, and test new materials and technologies to support innovative energy storage and system product designs
  • Complete documentation for design, testing, and analysis
  • Hands-on fabrication of electromechanical assemblies and prototypes
  • Work with a multi-disciplinary team to test and troubleshoot prototype designs
  • Mentor junior staff
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Engineer

Are you excited by the challenge of pushing the boundaries of what modern AI mod...
Location
Location
United States , Austin
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in Computer Science or a related field (PhD preferred)
  • 6+ years of industry experience building and deploying machine learning models at scale
  • Deep expertise in LLMs, Mixture-of-Experts architectures, and modern ML frameworks such as PyTorch or TensorFlow
  • Demonstrated innovation through impactful research, patents, or production-grade ML systems
  • Ability to lead complex, cross-functional technical initiatives
  • Strong problem-solving skills and a passion for pushing the boundaries of AI
Job Responsibility
Job Responsibility
  • Architect, train, and optimize cutting-edge LLMs and MoE-based systems
  • Experiment with novel algorithms to improve efficiency, scalability, and model performance
  • Collaborate closely with engineering and product teams to deploy ML capabilities into production
  • Contribute to pioneering research in ML and NLP, driving methodological advancements
  • Mentor engineers and help shape technical best practices across the organisation
What we offer
What we offer
  • Competitive compensation and performance incentives
  • Comprehensive medical, dental, and vision benefits
  • Monthly wellness stipend + annual continuing education credit
  • A flexible work environment and unlimited approved PTO
  • Parental and bereavement leave and other employee support programs
  • Relocation support available
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer (AI Agent)

At Cresta, the AI Agent team is on a mission to create state-of-the-art AI Agent...
Location
Location
United States; Canada
Salary
Salary:
Not provided
cresta.com Logo
Cresta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Mathematics, or a related field
  • Master’s or Ph.D. preferred, or equivalent professional experience
  • 7+ years of hands-on industry experience with AI and machine learning
  • 3+ years of experience working with LLMs in large-scale production environments
  • Expert knowledge of machine learning concepts and methods, especially those related to NLP, Generative AI, and working with LLMs
  • Proven leadership in designing and deploying AI solutions at scale
  • Extensive practical knowledge of modern machine learning frameworks and technologies (e.g., PyTorch, Tensorflow, Hugging Face, NumPy)
  • Experience with distributed systems and cloud-based AI infrastructure
  • Strong problem-solving and strategic thinking abilities
  • Proven ability to lead cross-functional teams and work collaboratively to deliver innovative AI solutions in production
Job Responsibility
Job Responsibility
  • Design, develop, and deploy Cresta’s AI Agent solutions and proprietary models
  • Focus on practical AI challenges such as improving reasoning, planning capabilities, and evaluation in real-world scenarios
  • Collaborate with cross-functional teams including front-end and back-end software engineers to integrate AI Agents into Cresta’s customer solutions
  • Lead initiatives to scale AI systems for production environments, ensuring performance and reliability across use cases
  • Contribute to solving cutting-edge problems in AI and help define the future roadmap for Cresta’s AI Agents
  • Innovate and research ways to improve security, cost-efficiency, and reliability of AI systems
What we offer
What we offer
  • Variety of medical, dental, and vision plans
  • Paid parental leave
  • Monthly Health & Wellness allowance
  • Work from home office stipend
  • Lunch reimbursement for in-office employees
  • PTO: 3 weeks in Canada
  • Base salary, equity, and a variety of benefits
  • Fulltime
Read More
Arrow Right
New

Senior Research Engineer, Model Evaluation

Evaluation is critical to making progress in scaling intelligence. As models con...
Location
Location
United States; Canada; United Kingdom , Toronto; New York; Seattle; San Francisco; London
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You enjoy pushing the limits of what LLMs are capable of, and you have built high-quality evaluation resources to measure those capabilities (datasets, simulators, environments, etc.)
  • You have a track record of developing new methods and/or data to evaluate LLMs, e.g. publications at top-tier conferences, popular benchmarks, etc.
  • You have deep experience building with and around LLMs, and you have built tools for analyzing and understanding their performance
  • You have strong software engineering skills
Job Responsibility
Job Responsibility
  • Develop evaluation benchmarks, datasets, and environments for measuring the bleeding edge of model capabilities
  • Conduct research to push the state-of-the-art in LLM evaluation methods, including training LLM judges
  • improving evaluation efficiency
  • and scalably building high-quality datasets
  • Build scalable tools for investigating and understanding evaluation results that are used by all members of technical staff at Cohere, as well as leadership and our CEO
  • Learn from and work with the best researchers and engineers in the field
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff, Integration/RL Team (Research Engineer)

The integration team is responsible for developing and scaling machine learning ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extremely strong software engineering skills
  • Value test-driven development methods, clean code, and strive to reduce technical debts at all levels
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch and/or XLA/MLIR
  • Experience using and debugging large-scale distributed training strategies (memory/speed profiling)
  • [Bonus] Experience with distributed training infrastructures (Kubernetes) and associated frameworks (Ray)
  • [Bonus] Hands-on experience with the post-training phase of model training, with a strong emphasis on scalability and performance
  • [Bonus] Experience in ML, LLM and RL academic research
Job Responsibility
Job Responsibility
  • Design and write high-performing and scalable software for training models
  • Develop new tools to support and accelerate research and LLM training
  • Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem
  • Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime
  • Research, implement, and experiment with ideas on our cluster and data infrastructure
  • Collaborate, Collaborate, and Collaborate with other scientists, engineers, and teams!
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff, Data Engineering

As a Data Engineer specializing in pretraining data, you will play a pivotal rol...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills, with proficiency in Python and experience building data pipelines
  • Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or similar tools
  • Experience working with large-scale web datasets like CommonCrawl
  • A passion for bridging research and engineering to solve complex data-related challenges in AI model training
Job Responsibility
Job Responsibility
  • Design and build scalable data pipelines to ingest, parse, filter, and optimize diverse web datasets
  • Conduct data ablations to assess data quality and experiment with data mixtures to enhance model performance
  • Develop robust data modeling techniques to ensure datasets are structured and formatted for optimal training efficiency
  • Research and implement innovative data curation methods, leveraging Cohere’s infrastructure to drive advancements in natural language processing
  • Collaborate with cross-functional teams, including researchers and engineers, to ensure data pipelines meet the demands of cutting-edge language models
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right