Senior ML Platform Engineer Job at Whoop (Boston)

Senior ML Platform Engineer, AI Platform

We are seeking a skilled and passionate ML Platform Engineer to join our team an...

Location

Singapore , Singapore

Salary:

Not provided

Airwallex

Expiration Date

Until further notice

Requirements

5+ years in backend software development
at least 2+ years focus on AI/ML Platform or MLOps infrastructure
deep expertise in MLOps practices, including automated deployment pipelines, model optimization, and production lifecycle management
proven experience designing and implementing low-latency model serving solutions
proficiency in Python
skill in writing high-quality, maintainable code
experience in design and development of large-scale distributed, high concurrency, low-latency inference, high availability systems
excellent communication and mentoring abilities
a relevant degree in Computer Science, Mathematics or related fields

Job Responsibility

Platform Development: Design, build, and maintain the end-to-end MLOps platform using Kubernetes and Cloud Services
Infrastructure as Code (IaC): Use Terraform or similar tools to manage, provision, and scale all ML-related infrastructure securely and efficiently
Pipeline Automation: Implement and optimize CI/CD/CT (Continuous Integration, Delivery, Training) pipelines to automate model training, testing, packaging, and deployment using tools like Argo and Kubeflow Pipelines
Serving Infrastructure: Build highly available, low-latency, and high-throughput model serving infrastructure
Observability: Implement robust monitoring, alerting, and logging solutions to track infrastructure health, model performance, and data/model drift
Tooling & Support: Evaluate, integrate, and support ML tools such as Feature Stores and distributed model training pipelines
Security & Compliance: Ensure platform security, implement RBAC (Role-Based Access Control), and manage secrets for sensitive data and production environments
Collaboration: Work closely with Data Scientists and ML Engineers to understand their needs and provide technical guidance on best practices for scaling their models

Fulltime

Senior Platform Engineer, ML Data Systems

We’re looking for an ML Data Engineer to evolve our eval dataset tools to meet t...

Location

United States , Mountain View

Salary:

137871.00 - 172339.00 USD / Year

Khan Academy

Expiration Date

Until further notice

Requirements

Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field
5 years of Software Engineering experience with 3+ of those years working with large ML datasets, especially those in open-source repositories such as Hugging Face
Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect)
Experience with data versioning tools (e.g., DVC, LakeFS) and cloud storage systems
Familiarity with machine learning workflows — from training data preparation to evaluation
Familiarity with the architecture and operation of large language models, and a nuanced understanding of their capabilities and limitations
Attention to detail and an obsession with data quality and reproducibility
Motivated by the Khan Academy mission “to provide a free world-class education for anyone, anywhere.”
Proven cross-cultural competency skills demonstrating self-awareness, awareness of other, and the ability to adopt inclusive perspectives, attitudes, and behaviors to drive inclusion and belonging throughout the organization.

Job Responsibility

Evolve and maintain pipelines for transforming raw trace data into ML-ready datasets
Clean, normalize, and enrich data while preserving semantic meaning and consistency
Prepare and format datasets for human labeling, and integrate results into ML datasets
Develop and maintain scalable ETL pipelines using Airflow, DBT, Go, and Python running on GCP
Implement automated tests and validation to detect data drift or labeling inconsistencies
Collaborate with AI engineers, platform developers, and product teams to define data strategies in support of continuously improving the quality of Khan’s AI-based tutoring
Contribute to shared tools and documentation for dataset management and AI evaluation
Inform our data governance strategies for proper data retention, PII controls/scrubbing, and isolation of particularly sensitive data such as offensive test imagery.

What we offer

Competitive salaries
Ample paid time off as needed
8 pre-scheduled Wellness Days in 2026 occurring on a Monday or a Friday for a 3-day weekend boost
Remote-first culture - that caters to your time zone, with open flexibility as needed, at times
Generous parental leave
An exceptional team that trusts you and gives you the freedom to do your best
The chance to put your talents towards a deeply meaningful mission and the opportunity to work on high-impact products that are already defining the future of education
Opportunities to connect through affinity, ally, and social groups
401(k) + 4% matching & comprehensive insurance, including medical, dental, vision, and life.

Fulltime

Senior ML Engineer - AI Platform & Agents

We are building agentic AI into the core of our product and need someone who can...

Location

France , Bordeaux

Salary:

Not provided

PhantomBuster

Expiration Date

Until further notice

Requirements

5+ years of experience as an ML Engineer, AI Engineer, or Software Engineer with a strong AI focus
Hands-on experience building AI agents using frameworks such as LangChain, Amazon Bedrock AgentCore, or similar
Strong understanding of LLM-based systems: prompt engineering, agent orchestration, tool use, and multi-agent workflows
Familiarity with MCP (Model Context Protocol) and experience integrating agents with external APIs or data sources
Experience working with Agents for Amazon Bedrock AgentCore or similar agent setups
Strong understanding of machine learning algorithms, statistical methods, and data preprocessing techniques
Experience with cloud platforms for model training and deployment, especially AWS
Proficiency in Python, including LangChain, and standard data libraries (Pandas, NumPy, etc.)
Fluency in English

Job Responsibility

Define and evolve our infrastructure to allow for better ML and AI capabilities, with a focus on LLM-based and agentic systems
Contribute to the development and expansion of our agentic AI framework powered by AWS Bedrock, enabling both internal tools and customer-facing features
Identify, source, and refine datasets to allow tuning models, powering retrieval pipelines, or expanding agentic workflows
Pre-process data by using techniques such as data cleaning, feature engineering, and transformation
Train, evaluate, and deploy both LLM-based systems and traditional machine learning models into production
Monitor, debug, and continuously improve deployed models and AI tools
Support machine learning usage throughout the company, including selecting the right modeling approach for the use case (LLM vs. traditional ML)
Support the integration and use of LLMs, including approaches such as fine-tuning, prompt tuning, and retrieval-augmented generation (RAG), to improve accuracy

What we offer

International team
Fun team building events
€40/month for remote work
Flexible working time
Home office budget up to €1500
100% of an Alan Blue subscription
Lunch vouchers - €8 (50% The Phantom Company) / worked day
Partnership with MokaCare
€70 a month benefit for entertainment expenses
Book Allowance and Sharing Program

Senior ML Engineer - AI Platform & Agents

Join PhantomBuster as a Senior ML Engineer to build agentic AI with AWS Bedrock,...

Location

France; Spain; Portugal

Salary:

Not provided

PhantomBuster

Expiration Date

Until further notice

Requirements

5+ years of experience as an ML Engineer, AI Engineer, or Software Engineer with a strong AI focus
Hands-on experience building AI agents using frameworks such as LangChain, Amazon Bedrock AgentCore, or similar
Strong understanding of LLM-based systems: prompt engineering, agent orchestration, tool use, and multi-agent workflows
Familiarity with MCP (Model Context Protocol) and experience integrating agents with external APIs or data sources
Experience working with Agents for Amazon Bedrock AgentCore or similar agent setups
Strong understanding of machine learning algorithms, statistical methods, and data preprocessing techniques
Experience with cloud platforms for model training and deployment, especially AWS
Proficiency in Python, including LangChain, and standard data libraries (Pandas, NumPy, etc.)
Fluency in English

Job Responsibility

Define and evolve our infrastructure to allow for better ML and AI capabilities, with a focus on LLM-based and agentic systems
Contribute to the development and expansion of our agentic AI framework powered by AWS Bedrock, enabling both internal tools and customer-facing features
Identify, source, and refine datasets to allow tuning models, powering retrieval pipelines, or expanding agentic workflows
Pre-process data by using techniques such as data cleaning, feature engineering, and transformation
Train, evaluate, and deploy both LLM-based systems and traditional machine learning models into production
Monitor, debug, and continuously improve deployed models and AI tools
Support machine learning usage throughout the company, including selecting the right modeling approach for the use case (LLM vs. traditional ML)
Support the integration and use of LLMs, including approaches such as fine-tuning, prompt tuning, and retrieval-augmented generation (RAG), to improve accuracy

What we offer

Fully remote working environment (France, Spain, or Portugal)
Real ownership: you will define how agentic AI is built at PhantomBuster, not follow someone else's decisions
Freedom to research and adopt new technologies as the space evolves & to make an impact at a small, self-funded, and profitable tech startup by laying the foundation for machine learning and AI
Collaborative and open-minded culture based on rationality, humility, honesty, and long-term thinking
International team
Fun team building events
€40/month for remote work
Flexible working time
Home office budget up to €1500
100% of an Alan Blue subscription (french-based contracts)

Fulltime

Senior Software Engineer, ML Platform

We’re looking for a software engineer to join Parafin’s Infrastructure team and ...

Location

United States , San Francisco

Salary:

230000.00 - 265000.00 USD / Year

Parafin

Expiration Date

Until further notice

Requirements

5+ years of software engineering experience, including experience on ML platform/MLOps systems (training, deployment, and/or feature pipelines)
Strong Python
solid software design and testing fundamentals
Proficiency with SQL
hands-on Spark/PySpark experience
Knowledge of ML fundamentals—probability & statistics, supervised vs. unsupervised learning, bias/variance & regularization, feature engineering, model evaluation metrics, validation strategies, and production concerns like drift, stability, and monitoring
Expertise with modern data/ML stacks—AWS, Databricks (workflows, lakehouse, MLflow/registry, Model Serving), and Airflow (or equivalent orchestration)
Experience building real-time systems (service design, caching, rate limiting, backpressure) and batch pipelines at scale
Practical knowledge of feature-store concepts (offline/online stores, backfills, point-in-time correctness), model registries, experiment tracking, and evaluation frameworks
Strong problem-solving skills and a proactive attitude toward ownership and platform health

Job Responsibility

Turn notebooks into software
Decompose data scientist training/inference notebooks into reusable, tested components (libraries, pipelines, templates) with clear interfaces and documentation
Create developer-friendly ML abstractions
Build SDKs, CLIs, and templates that make it simple to define features, train/evaluate models, and deploy to batch or real-time targets with minimal boilerplate
Build our real-time ML inference platform
Stand up and scale low-latency model serving
Expand batch ML inference
Improve scheduling, parallelism, cost controls, observability, and failure/rollback for large-scale batch scoring and post-processing
Own and expand the feature store
Design offline/online feature definitions, high read/write throughput, and consistent offline/online semantics

What we offer

Equity grant
Medical, dental & vision insurance
Work from home flexibility
Unlimited PTO
Commuter benefits
Free lunches
Paid parental leave
401(k)
Employee assistance program

Fulltime

Senior ML Inference Engineer - Platform

The Model Deployment & Inference Solutions team in GM AV deploys machine learnin...

Location

United States , Austin; Mountain View

Salary:

128700.00 - 261300.00 USD / Year

General Motors

Expiration Date

Until further notice

Requirements

BS, MS, or PhD in Computer Science or a related technical field
3+ years of relevant industry experience
Strong fundamentals and excellent coding ability in Python
Experience building or operating production platform or infrastructure systems where reliability, observability, and extensibility matter
Experience with ML model deployment, inference integration, model optimization workflows, or model serving infrastructure, with at least one prior context where you owned the path from a trained model to a running inference workload
Experience using coding agents (Cursor, Claude Code, GitHub Copilot, or equivalent) as part of your engineering workflow
Experience designing clean, well-tested software with clear interfaces and good abstractions
Strong cross-team collaboration skills

Job Responsibility

Design, build, and operate the ML deployment platform that automates the path from trained model to on-vehicle inference
Drive cross-organization model deployments to the autonomous vehicle stack, partnering with model development teams to take high-value models from training to production on-vehicle
Build agentic tools that diagnose and fix deployment-blocking issues, automating workflows currently performed manually by engineers
Build the developer experience that ML model development teams use day to day: tooling, dashboards, automation, and observability
Drive shift-left validation that surfaces deployment risk (compile, runtime, parity, latency) early in the model development cycle
Build platform tools that integrate the work of our sister teams (kernels, compiler, reduced precision and parity) so their optimization wins land directly in the deployment workflow
Partner with the team's Performance pillar and model development teams across the AV organization

What we offer

Medical
Dental
Vision
Health Savings Account
Flexible Spending Accounts
Retirement savings plan
Sickness and accident benefits
Life insurance
Paid vacation & holidays
Tuition assistance programs

Fulltime

Senior Software Engineer, ML Data Platform

DUTIES: Develop fast, robust, and spike-resistant data consumption, data mining...

Location

United States , Detroit

Salary:

216418.50 USD / Year

General Motors

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Electronic Engineering, Management Information Systems, or related field of study and Five (5) years of experience as a Software Engineer, Programmer Analyst, or related occupation
Five (5) years of experience with: Building Peta Byte (PB) scale data management systems
Optimizing those data processing clusters for cost efficiency and performance
Building serving systems capable of delivering data at high-throughput, low-latency and high QPS (Queries Per Second) in a cost-efficient and spike-resilient manner
Building scalable infrastructure on the cloud with Python, Java, or Scala
Writing SQL queries for analytic purposes.

Job Responsibility

Develop fast, robust, and spike-resistant data consumption, data mining, and processing tools for the entire company
Develop orchestration for large-scale post-processing, and computational pipelines
Participate in the development, optimization and productionization of the next generation data processing platform using Beam and Spark in the cloud
Build self-serve capabilities to help customers to adopt the next generation data processing platform
Use the latest cloud technologies to own, design, implement, and test scalable distributed data systems in the cloud
Champion engineering excellence by continuously improving systems and processes
Own technical projects from start to finish, contribute to the team’s product roadmap, and be responsible for major technical decisions and tradeoffs
Effectively participate in team’s planning, code reviews and design discussions
Consider the effects of projects across multiple teams and proactively manage conflicts
Work with partner teams and orgs to achieve cross-organizational goals and satisfy broad requirements

What we offer

An incentive pay program offers payouts based on company performance, job level, and individual performance

Fulltime

Senior Software Engineer - Matching ML Platform

Uber is looking for a Software Engineer to join our Matching ML Platform team. T...

Location

United States , Seattle; San Francisco; Sunnyvale

Salary:

202000.00 - 224000.00 USD / Year

Uber

Expiration Date

Until further notice

Requirements

5+ years experience working on the full software life cycle including gathering requirements, project planning, solution design, coding/implementation, testing, rollout/deployment and best practices as an individual contributor
Experience with ML in production systems
Experience coding using general purpose programming language (eg. C/C++, Java, Python, Go, C#)
Fast and passionate learner
Strong collaboration, documentation and communication skills

Job Responsibility

Build and scale a low-latency platform powering millions of real-time match decisions per second
Identify opportunities to improve various ML system's performance and health
Design modular systems that accelerate product innovation without rework
Optimize for fairness, efficiency, and marketplace health at global scale
Collaborate across product, infra, and ML teams to deliver business-critical impact

What we offer

Eligible to participate in Uber's bonus program
May be offered an equity award & other types of comp
All full-time employees are eligible to participate in a 401(k) plan
Eligible for various benefits

Fulltime

Select Country

Senior ML Platform Engineer

Job Description

Job Responsibility

Requirements

What we offer

Looking for more opportunities?