CrawlJobs Logo

Member of Technical Staff, Training Infra Engineer

cohere.com Logo

Cohere

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Contribute in and provide strong support for model training pipelines, ship state of the art models to production, and bridge the gap between research and production. We have one of the highest ratio of compute to engineers in the world. We do not delineate strongly between engineering and research. Everyone will contribute to writing production code and supporting our research effort depending on individual interest and organizational needs. We have all the compute, data, and talent available for you to do your best work.

Job Responsibility:

  • Design and write high-performant and scalable software for training
  • Improve our training setup from an infrastructure and codebase performance standpoint
  • Craft and implement tools to speed up our training cycles and improve the overall efficacy of our training infrastructure
  • Research, implement, and experiment with ideas on our supercompute and data infrastructure
  • Learn from and work with the best researchers in the field

Requirements:

  • Extremely strong software engineering skills
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch and XLA/MLIR
  • Experience with distributed training infrastructures (Kubernetes, Slurm) and associated frameworks (Ray)
  • Experience using large-scale distributed training strategies
  • Hands on experience on training large model at scale and having contributed to the tooling and/or setup of the training infrastructure

Nice to have:

paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)

What we offer:
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff, Training Infra Engineer

New

Staff Software Engineer - AI/ML Infra

GEICO AI platform and Infrastructure team is seeking an exceptional Senior ML Pl...
Location
Location
United States , Palo Alto
Salary
Salary:
90000.00 - 300000.00 USD / Year
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
  • 8+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
  • 3+ years of hands-on experience with machine learning infrastructure and deployment at scale
  • 2+ years of experience working with Large Language Models and transformer architectures
  • Proficient in Python
  • strong skills in Go, Rust, or Java preferred
  • Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
  • Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
  • Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
  • Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
  • Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
  • Design, implement, and maintain feature stores for ML model training and inference pipelines
  • Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
  • Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
  • Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
  • Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
  • Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
  • Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
  • Evaluate and potentially implement hybrid cloud solutions with AWS/GCP as backup or specialized use cases
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Staff Software Engineer - AI/ML Infra

GEICO AI platform and Infrastructure team is seeking an exceptional Senior ML Pl...
Location
Location
United States , Chevy Chase; New York City; Palo Alto
Salary
Salary:
115000.00 - 300000.00 USD / Year
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
  • 8+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
  • 3+ years of hands-on experience with machine learning infrastructure and deployment at scale
  • 2+ years of experience working with Large Language Models and transformer architectures
  • Proficient in Python
  • strong skills in Go, Rust, or Java preferred
  • Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
  • Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
  • Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
  • Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
  • Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
  • Design, implement, and maintain feature stores for ML model training and inference pipelines
  • Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
  • Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
  • Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
  • Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
  • Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
  • Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
  • Evaluate and potentially implement hybrid cloud solutions with AWS/GCP as backup or specialized use cases
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Data Infra - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...
Location
Location
United States , Mountain View
Salary
Salary:
163000.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 8+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR equivalent experience
  • Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 12+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 15+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR equivalent experience
  • 4+ years experience with data governance, data compliance and/or data security
  • Passionate about the role of data in large-scale AI model training
  • Thrive in a highly collaborative, fast-paced environment
  • Have a high degree of expertise and pay close attention to details
Job Responsibility
Job Responsibility
  • Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
  • Own and maintain critical data infrastructures, including spark, ray, vector databases, and others
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Production Engineering Manager, Rotational Network Engineering (RNE) Program

This is unique role within Meta's Infrastructure organization. Meta’s Rotational...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of Networking, System Administration, Software Engineering or Product Development experience
  • Familiarity with source control, software development cycles and practices
  • Experience with launching and iterating on product, services, tools or technical frameworks
  • Experience managing an engineering team
  • B.S. in Engineering or equivalent experience
  • Analytical and troubleshooting skills
Job Responsibility
Job Responsibility
  • Build a plan for each team member with technical leads and mentors from Network Infrastructure, Backbone, and Datacenter Engineering
  • Establish and foster fruitful working relationships with various stakeholders and teams within Network Infra
  • Manage and grow multi-disciplinary recruiting plans across universities and industry
  • Develop and manage work plans from recruiting, to mentor and task selection to team assignment
  • Manage expectations of all interested parties: define clear program roadmap with key deliverables and milestone dates, maintain program information wiki pages, and identify and communicate risks and adjustments to the overall program to meet recruitment demands from Network Engineering teams
  • Understand the network product delivery cycle
  • Work closely with dedicated recruiting staff to expand the team, including sourcing candidates, interviewing candidates, participating in conferences/events, and on-boarding new employees
  • Influence Network Infrastructure teams for their buy in to the program, obtain agreement to provide mentors and projects and to consider rotational engineers as one of their hiring pipelines
  • Enable and unblock engineers through coaching, learning, and mentorship programs
  • Responsible for people management of a team of engineers, providing performance reviews, continual feedback, coaching and career growth for direct reports
Read More
Arrow Right
New

Pharmacy Technician

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States , Studio City
Salary
Salary:
19.46 USD / Hour
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
April 10, 2026
Flip Icon
Requirements
Requirements
  • Must comply with any state board of pharmacy requirements or laws governing the practice of pharmacy, which includes but is not limited to, age, education, and licensure/certification
  • If the state board of pharmacy does not address or mandate a minimum age requirement, must be at least 16 years of age
  • If the state board of pharmacy does not address or mandate a minimum educational requirement, must have a high school diploma or equivalent, or be actively enrolled in high school or high school equivalency program
  • State-level licensure and national certification requirements vary by state
  • Regular and predictable attendance, including nights and weekends
  • Ability to complete required training within designated timeframe
  • Attention and Focus: Ability to concentrate on a task over a period of time
  • Ability to pivot quickly from one task to another to meet patient and business needs
  • Ability to confirm prescription information and label accuracy, ensuring patient safety
  • Customer Service and Team Orientation: Actively look for ways to help people, and do so in a friendly manner
Job Responsibility
Job Responsibility
  • Support the pharmacy team in delivering operational and service excellence
  • Assist the pharmacy team to ensure that pharmacy operations run smoothly, our patients’ prescriptions are filled promptly, safely, and accurately, and we are providing caring service that exceeds patient expectations
  • Operate as part of the pharmacy team through consistent application of Standard Operating Procedures (SOPs), best practices, and effective communication
  • Following pharmacy workflow procedures at each pharmacy workstation (i.e., production, pick-up, drive-thru, and drop-off) for safe and accurate prescription fulfillment
  • Contributing to positive patient experiences by showing empathy and genuine care: creating heartfelt and personalized moments while serving patients at pick-up, drive-thru, and over the phone
  • keeping patients healthy by offering immunizations and other services at the register and over the phone
  • and demonstrating compassionate care by solving or escalating patient problems
  • Completing basic inventory activities, as permitted by law, and as directed by the pharmacy leadership team, such as accurately putting away medication deliveries and completing cycle counts, returns-to-stocks, waiting bin inventories, etc.
  • Contributing to a high-performing team, embracing a growth mindset, and being receptive to feedback
  • actively seeking opportunities to expand clinical and technical knowledge needed to better assist patients
What we offer
What we offer
  • Affordable medical plan options
  • a 401(k) plan (including matching company contributions)
  • an employee stock purchase plan
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching
  • Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility
  • Fulltime
Read More
Arrow Right
New

Research Fellow

The research will advance the use of multiple systems estimations in humanitaria...
Location
Location
United Kingdom , London
Salary
Salary:
45728.00 - 51872.00 GBP / Year
lshtm.ac.uk Logo
London School of Hygiene & Tropical Medicine
Expiration Date
March 03, 2026
Flip Icon
Requirements
Requirements
  • Postgraduate degree, ideally a doctoral degree, in a relevant topic
  • Proven expertise in data science or related fields
  • Strong skills in quantitative analysis applied to large or complex datasets
  • Knowledge of social media data sources, including accessibility, platform characteristics, ethical considerations
  • Experience developing or applying search strategies and lexicons
  • Proficiency in fitting and validating statistical models or machine learning algorithms
  • Advanced skills in R and/or Python for data processing and analysis
  • Familiarity with natural language processing techniques, such as text pre-processing (tokenization, stemming/lemmatization) and feature extraction
Job Responsibility
Job Responsibility
  • Advance the use of multiple systems estimations in humanitarian and public health contexts
  • Develop reproducible, multilingual workflows for social media analysis
  • Build data pipelines in R/Python
  • Create open-source tools for text and feature extraction
  • Assess and mitigate biases in social media data
  • Design evidence-based heuristics to guide researchers in applying these methods effectively
What we offer
What we offer
  • Annual leave entitlement is 30 working days per year, pro rata for part-time staff
  • Discretionary “Wellbeing Days”
  • Membership of the Pension Scheme
Read More
Arrow Right
New

Senior Manager, Staff Counsel

GEICO is seeking a Senior Manager of multiple Staff Counsel office activities in...
Location
Location
United States , Nashville; Knoxville; Memphis
Salary
Salary:
Not provided
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Juris Doctor degree REQUIRED
  • Must be licensed in good standing to practice law in applicable jurisdictions and meet and maintain licensing requirements including mandatory Continuing Legal Education (CLE) requirements where applicable
  • Must have a minimum of 10 years of litigation experience, including insurance defense or personal injury
  • Must be able to travel as required, including but not limited, to attend trials, hearings, depositions, management meetings and conferences
  • Must be able, with or without accommodation, to perform the essential functions which include, but are not limited to, thinking (concentrating, focusing, assimilating information), reading, writing, listening, typing, speaking, bending, reaching, lifting, and standing for extended periods
  • Must be able to communicate in a professional manner in person, via telephone and written correspondence/email
  • Must be able to document files in a clear, concise, professional written manner, to be understood by customers, clients, co-workers and other employees of the organization
  • Must be able to follow complex instructions, resolve conflicts or facilitate conflict resolution, and have strong organization/priority setting and multi-tasking skills
  • Must demonstrate successful performance in handling primary trial responsibility for cases of significant severity and complexity
Job Responsibility
Job Responsibility
  • Manages subordinates in all activities relating to the defense of lawsuits and against GEICO insureds in liability and property damage cases, and on behalf of GEICO in UM/UIM and Subrogation suits
  • Interviews and/or approves job applicants for employment
  • Conducts and/or reviews associate Performance Appraisals
  • Initiates or approves salary adjustments, performance ratings, and other personnel changes
  • Counsels associates and take disciplinary action or terminate the employment of associates as appropriate
  • May represent GEICO insureds in liability cases, and UM/UIM, subrogation, and PD suits filed in courts of limited and unlimited jurisdiction
  • Research laws and prepare legal briefs, opinions, and memoranda
  • Renders opinions on liability, damage, and value as requested by the Claims Department
  • May prepare and handle pleadings, motions, and discovery, to include depositions/examinations before trial and examinations under oath, and other deadlines
  • Trains and supervises less experienced attorneys, including assisting attorneys as first and second chair counsel, and/or observing attorneys at trials and arbitrations
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Clinical Reviewer Specialist

The Clinical Reviewer Specialist role involves conducting clinical reviews to pr...
Location
Location
Philippines , Manila
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 1-3 years of experience in processing appeals or utilization management
  • US Registered Nurse (USRN)
  • Knowledge of utilization management process
  • Knowledge of NCQA, Medicaid regulations
  • Good communication (Demonstrate strong reading comprehension and writing skills)
  • Able to work independently, strong analytic skills
  • Required shift timings - US daytime
Job Responsibility
Job Responsibility
  • Performs clinical reviews needed to resolve and process appeals by reviewing medical records and clinical data to determine medical necessity for services in accordance with policies, guidelines, and National Committee for Quality Assurance (NCQA) standards
  • Prepares case reviews for Medical Directors by researching the appeal, reviewing applicable criteria, and analyzing the basis for the appeal
  • Ensures timely review, processing, and response to appeal in accordance with State, Federal and NCQA standards
  • Communicates with providers, facilities and other departments regarding appeal requests
  • Generates appropriate appeals resolution communication and reporting for the member and provider in accordance with company policies, State, Federal and NCQA standards
  • Works with leadership to increase the consistency, efficiency, and appropriateness of responses of all appeal requests
  • Partners with interdepartmental teams to improve clinical appeals processes and procedures to prevent recurrences based on industry best practices
  • Uses sound judgement, especially in non-routine appeals, to make decisions to keep the appeal process moving forward in accordance with contractual timeliness standards
  • Maintain files on individual appeals by gathering, analyzing and reporting verbal and written member and provider appeals
  • Review claim appeal for reconsideration and recommend approvals/denials based on determination level or prepare for medical review presentation
Read More
Arrow Right