CrawlJobs Logo

Senior Platform Engineer, ML Data Systems

United States, Mountain View 137871.00 - 172339.00 USD / Year · Job Posted December 09, 2025
Apply Position
Job Link Share

Job Description

We’re looking for an ML Data Engineer to evolve our eval dataset tools to meet the growing platform needs of AI-based tutoring at Khan Academy. We’re looking for someone who can gather internal requirements, design schema based on well-known dataset patterns, and deploy, document, and train people on an internal dataset management framework. The systems you design will need to integrate with trace management and human labeling APIs. You’ll work closely with other AI engineers, platform developers, and labeling teams to ensure our data is clean, representative, and ready for both human and automated evaluation. This role bridges ML operations, data engineering and data science— enabling our AI systems to learn from reliable, well-structured datasets that reflect the diversity and nuance of real learners.

Job Responsibility

  • Evolve and maintain pipelines for transforming raw trace data into ML-ready datasets
  • Clean, normalize, and enrich data while preserving semantic meaning and consistency
  • Prepare and format datasets for human labeling, and integrate results into ML datasets
  • Develop and maintain scalable ETL pipelines using Airflow, DBT, Go, and Python running on GCP
  • Implement automated tests and validation to detect data drift or labeling inconsistencies
  • Collaborate with AI engineers, platform developers, and product teams to define data strategies in support of continuously improving the quality of Khan’s AI-based tutoring
  • Contribute to shared tools and documentation for dataset management and AI evaluation
  • Inform our data governance strategies for proper data retention, PII controls/scrubbing, and isolation of particularly sensitive data such as offensive test imagery.

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field
  • 5 years of Software Engineering experience with 3+ of those years working with large ML datasets, especially those in open-source repositories such as Hugging Face
  • Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect)
  • Experience with data versioning tools (e.g., DVC, LakeFS) and cloud storage systems
  • Familiarity with machine learning workflows — from training data preparation to evaluation
  • Familiarity with the architecture and operation of large language models, and a nuanced understanding of their capabilities and limitations
  • Attention to detail and an obsession with data quality and reproducibility
  • Motivated by the Khan Academy mission “to provide a free world-class education for anyone, anywhere.”
  • Proven cross-cultural competency skills demonstrating self-awareness, awareness of other, and the ability to adopt inclusive perspectives, attitudes, and behaviors to drive inclusion and belonging throughout the organization.

Nice to have

  • Experience with labeling platforms (e.g., Label Studio, Scale AI, Toloka) or human-in-the-loop systems
  • Understanding of ML evaluation techniques, including prompt-based and generative model metrics
  • Exposure to MLOps practices such as model registry, feature store, or continuous evaluation
  • Background in education technology or other human-centered AI applications.

What we offer

  • Competitive salaries
  • Ample paid time off as needed
  • 8 pre-scheduled Wellness Days in 2026 occurring on a Monday or a Friday for a 3-day weekend boost
  • Remote-first culture - that caters to your time zone, with open flexibility as needed, at times
  • Generous parental leave
  • An exceptional team that trusts you and gives you the freedom to do your best
  • The chance to put your talents towards a deeply meaningful mission and the opportunity to work on high-impact products that are already defining the future of education
  • Opportunities to connect through affinity, ally, and social groups
  • 401(k) + 4% matching & comprehensive insurance, including medical, dental, vision, and life.

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Platform Engineer, ML Data Systems

8 matching positions

Senior Software Engineer, ML Data Platform

DUTIES: Develop fast, robust, and spike-resistant data consumption, data mining...
Location
Location
United States , Detroit
Salary
Salary:
216418.50 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Electronic Engineering, Management Information Systems, or related field of study and Five (5) years of experience as a Software Engineer, Programmer Analyst, or related occupation
  • Five (5) years of experience with: Building Peta Byte (PB) scale data management systems
  • Optimizing those data processing clusters for cost efficiency and performance
  • Building serving systems capable of delivering data at high-throughput, low-latency and high QPS (Queries Per Second) in a cost-efficient and spike-resilient manner
  • Building scalable infrastructure on the cloud with Python, Java, or Scala
  • Writing SQL queries for analytic purposes.
Job Responsibility
Job Responsibility
  • Develop fast, robust, and spike-resistant data consumption, data mining, and processing tools for the entire company
  • Develop orchestration for large-scale post-processing, and computational pipelines
  • Participate in the development, optimization and productionization of the next generation data processing platform using Beam and Spark in the cloud
  • Build self-serve capabilities to help customers to adopt the next generation data processing platform
  • Use the latest cloud technologies to own, design, implement, and test scalable distributed data systems in the cloud
  • Champion engineering excellence by continuously improving systems and processes
  • Own technical projects from start to finish, contribute to the team’s product roadmap, and be responsible for major technical decisions and tradeoffs
  • Effectively participate in team’s planning, code reviews and design discussions
  • Consider the effects of projects across multiple teams and proactively manage conflicts
  • Work with partner teams and orgs to achieve cross-organizational goals and satisfy broad requirements
What we offer
What we offer
  • An incentive pay program offers payouts based on company performance, job level, and individual performance
  • Fulltime
Read More
Arrow Right

Senior Staff Data Engineer- ML & AI Platform

At Marktplaats, data is at the heart of everything we do, but Intelligence is wh...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
adevinta.com Logo
Adevinta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience with a specific focus on the intersection of Data Engineering, MLOps, and AI Infrastructure
  • Deep knowledge of Spark internals, structured streaming, and performance tuning for large-scale data processing
  • Proven experience architecting end-to-end ML platforms for Traditional ML (Classic MLOps) while actively enabling the organization on Generative AI concepts
  • Strong background in building automated pipelines and ensuring system observability
  • Practical experience building infrastructure for Large Language Models, including managing the complexity of chaining models and tools
  • Solid experience serving models at low latency and high concurrency using containerized solutions
  • Ability to speak the language of AI/ML Engineers and effectively bridge the gap between experimental code and production systems
  • Expert level Python
  • Experience with PyTorch, Terraform, Terragrunt, Docker, Kubernetes, GitHub Actions, Datadog
  • Experience with Databricks AI Stack: MLflow, Mosaic AI, Unity Catalog, Feature Store, Databricks Model Serving, Vector Databases
Job Responsibility
Job Responsibility
  • Lead the evolution of our Machine Learning & AI Platform, designing the architecture for AI Agents and establishing patterns for Vector Databases
  • Act as a first mover: validate new Databricks features and integrate them into the platform
  • Write the guidelines for GenAI development, helping teams transition from notebook experiments to production-grade LLM applications
  • Design the Feature Store, manage the Model Registry, and set up the infrastructure for Vector Search and RAG (Retrieval Augmented Generation) workflows
  • Elevate the technical bar of the team, mentoring Staff and Senior engineers on design patterns, code quality, and architectural decisions
  • Translate complex requirements from ML Engineers and Data Scientists into robust engineering tickets and infrastructure roadmaps
What we offer
What we offer
  • An attractive Base Salary
  • Participation in our Short Term Incentive plan (annual bonus)
  • Work From Anywhere: Enjoy up to 20 days a year of working from anywhere
  • A 24/7 Employee Assistance Program for you and your family
  • Fulltime
Read More
Arrow Right

Senior Staff Data Engineer- ML & AI Platform

At Marktplaats, data is at the heart of everything we do, but Intelligence is wh...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
adevinta.com Logo
Adevinta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience with a specific focus on the intersection of Data Engineering, MLOps, and AI Infrastructure
  • Deep knowledge of Spark internals, structured streaming, and performance tuning for large-scale data processing
  • Proven experience architecting end-to-end ML platforms for Traditional ML (Classic MLOps) while actively enabling the organization on Generative AI concepts
  • Strong background in building automated pipelines and ensuring system observability
  • Practical experience building infrastructure for Large Language Models, including managing the complexity of chaining models and tools
  • Solid experience serving models at low latency and high concurrency using containerized solutions
  • Ability to speak the language of AI/ML Engineers and effectively bridge the gap between experimental code and production systems
Job Responsibility
Job Responsibility
  • Lead the evolution of our Machine Learning & AI Platform, designing the architecture for AI Agents and establishing patterns for Vector Databases
  • Act as a first mover, validate new Databricks features and integrate them into the platform
  • Write the guidelines for GenAI development, helping teams transition from notebook experiments to production-grade LLM applications
  • Design the Feature Store, manage the Model Registry, and set up the infrastructure for Vector Search and RAG (Retrieval Augmented Generation) workflows
  • Elevate the technical bar of the team, mentoring Staff and Senior engineers on design patterns, code quality, and architectural decisions
  • Translate complex requirements from ML Engineers and Data Scientists into robust engineering tickets and infrastructure roadmaps
What we offer
What we offer
  • An attractive Base Salary
  • Participation in our Short Term Incentive plan (annual bonus)
  • Work From Anywhere: Enjoy up to 20 days a year of working from anywhere
  • A 24/7 Employee Assistance Program for you and your family
  • A collaborative environment with an opportunity to explore your potential and grow
  • Fulltime
Read More
Arrow Right

Senior ML Platform Engineer, AI Platform

We are seeking a skilled and passionate ML Platform Engineer to join our team an...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
airwallex.com Logo
Airwallex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in backend software development
  • at least 2+ years focus on AI/ML Platform or MLOps infrastructure
  • deep expertise in MLOps practices, including automated deployment pipelines, model optimization, and production lifecycle management
  • proven experience designing and implementing low-latency model serving solutions
  • proficiency in Python
  • skill in writing high-quality, maintainable code
  • experience in design and development of large-scale distributed, high concurrency, low-latency inference, high availability systems
  • excellent communication and mentoring abilities
  • a relevant degree in Computer Science, Mathematics or related fields
Job Responsibility
Job Responsibility
  • Platform Development: Design, build, and maintain the end-to-end MLOps platform using Kubernetes and Cloud Services
  • Infrastructure as Code (IaC): Use Terraform or similar tools to manage, provision, and scale all ML-related infrastructure securely and efficiently
  • Pipeline Automation: Implement and optimize CI/CD/CT (Continuous Integration, Delivery, Training) pipelines to automate model training, testing, packaging, and deployment using tools like Argo and Kubeflow Pipelines
  • Serving Infrastructure: Build highly available, low-latency, and high-throughput model serving infrastructure
  • Observability: Implement robust monitoring, alerting, and logging solutions to track infrastructure health, model performance, and data/model drift
  • Tooling & Support: Evaluate, integrate, and support ML tools such as Feature Stores and distributed model training pipelines
  • Security & Compliance: Ensure platform security, implement RBAC (Role-Based Access Control), and manage secrets for sensitive data and production environments
  • Collaboration: Work closely with Data Scientists and ML Engineers to understand their needs and provide technical guidance on best practices for scaling their models
  • Fulltime
Read More
Arrow Right

Senior ML Platform Engineer

At WHOOP, we're on a mission to unlock human performance and healthspan. WHOOP e...
Location
Location
United States , Boston
Salary
Salary:
150000.00 - 210000.00 USD / Year
whoop.com Logo
Whoop
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s Degree in Computer Science, Engineering, or a related field
  • or equivalent practical experience
  • 5+ years of experience in software engineering with a focus on ML infrastructure, cloud platforms, or MLOps
  • Strong programming skills in Python, with experience in building distributed systems and REST/gRPC APIs
  • Deep knowledge of cloud-native services and infrastructure-as-code (e.g., AWS CDK, Terraform, CloudFormation)
  • Hands-on experience with model deployment platforms such as AWS SageMaker, Vertex AI, or Kubernetes-based serving stacks
  • Proficiency in ML lifecycle tools (MLflow, Weights & Biases, BentoML) and containerization strategies (Docker, Kubernetes)
  • Understanding of data engineering and ingestion pipelines, with ability to interface with data lakes, feature stores, and streaming systems
  • Proven ability to work cross-functionally with Data Science, Data Platform, and Software Engineering teams, influencing decisions and driving alignment
  • Passion for AI and automation to solve real-world problems and improve operational workflows
Job Responsibility
Job Responsibility
  • Architect, build, own, and operate scalable ML infrastructure in cloud environments (e.g., AWS), optimizing for speed, observability, cost, and reproducibility
  • Create, support, and maintain core MLOps infrastructure (e.g., MLflow, feature store, experiment tracking, model registry), ensuring reliability, scalability, and long-term sustainability
  • Develop, evolve, and operate MLOps platforms and frameworks that standardize model deployment, versioning, drift detection, and lifecycle management at scale
  • Implement and continuously maintain end-to-end CI/CD pipelines for ML models using orchestration tools (e.g., Prefect, Airflow, Argo Workflows), ensuring robust testing, reproducibility, and traceability
  • Partner closely with Data Science, Sensor Intelligence, and Data Platform teams to operationalize and support model development, deployment, and monitoring workflows
  • Build, manage, and maintain both real-time and batch inference infrastructure, supporting diverse use cases from physiological analytics to personalized feedback loops for WHOOP members
  • Design, implement, and own automated observability tooling (e.g., for model latency, data drift, accuracy degradation), integrating metrics, logging, and alerting with existing platforms
  • Leverage AI-powered tools and automation to reduce operational overhead, enhance developer productivity, and accelerate model release cycles
  • Contribute to and maintain internal platform documentation, SDKs, and training materials, enabling self-service capabilities for model deployment and experimentation
  • Continuously evaluate and integrate emerging technologies and deployment strategies, influencing WHOOP’s roadmap for AI-driven platform efficiency, reliability, and scale
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Senior Data Engineer - Platform Team

Our team of dedicated Data Engineers delivers comprehensive data engineering sol...
Location
Location
Israel , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or master’s degree in computer science, Math, Software Engineering, Computer Engineering, or related field
  • At least 6+ years’ experience in business analytics, data science, data modelling OR/AND Data Engineering work
  • Architecting, building, and maintaining end-to-end, high-throughput data systems and their supporting services
  • Designing efficient data structures and database schemas and working with distributed systems architecture
  • Experience working with large data sets using SQL/Azure Data Lake/Spark/Data bricks, etc., to derive actionable insights is highly desirable
  • Experience in Building AI \ ML-OPS tools for data platforms
  • Developing for continuous integration and automated deployments
Job Responsibility
Job Responsibility
  • Design, develop, and maintain data pipelines, ETL workflows, and back-end services for real-time decisioning, reporting, optimization, data collection, and related functions
  • Develop AI \ ML workflows to support queries and automate data driven processes
  • Experiment with and recommend new technologies that simplify or improve the tech stack
  • Participate in an on-call rotation and work occasional off-hours
  • Mentor data engineers, guide complex design decisions, and foster a culture of engineering excellence, and continuous learning across the organization
  • Fulltime
Read More
Arrow Right

Senior Data Platform Engineer

We are looking for a Senior Data Platform Engineer to help build and scale the n...
Location
Location
United States , San Diego
Salary
Salary:
143000.00 - 215000.00 USD / Year
resmed.com Logo
ResMed
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in a STEM field or equivalent practical experience
  • Significant hands-on experience as a data engineer or senior software engineer working on data-intensive systems (typically 5–8+ years)
  • Strong SQL expertise and experience with data modeling on large-scale analytical platforms (Snowflake preferred)
  • Proven experience building and operating production data pipelines using Python and cloud services
  • Proficiency with dbt or similar transformation and analytics engineering tools
  • Solid software engineering fundamentals, including system design, debugging, performance optimization, and maintainable code practices
  • Experience with Git/GitHub workflows, including pull requests, code reviews, and collaborative development
  • Hands-on experience building or working with CI/CD pipelines (GitHub Actions preferred), including automated testing and deployments
  • Ability to work effectively across both data engineering and analytics engineering responsibilities
  • Strong hands-on experience building and operating data systems on AWS, including designing cloud-native architectures and working with services such as S3, IAM, EC2/ECS/EKS, Lambda, Glue, EMR, or related AWS data and compute services
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable data pipelines for ingestion, transformation, and delivery using Python, SQL, Spark, APIs, and modern cloud-native tools
  • Develop high-quality analytics and data models in Snowflake using dbt or similar frameworks, with a focus on performance, correctness, and maintainability
  • Apply strong software engineering practices to data systems, including modular design, testing, code reviews, and version control
  • Implement automation, monitoring, and observability to ensure reliable and resilient data pipelines in production
  • Collaborate closely with product managers, analytics engineers, data scientists, and application engineers to deliver data products that drive business and clinical outcomes
  • Support advanced analytics and ML use cases by building feature pipelines and data foundations for classical ML models and emerging AI-driven workloads
  • Contribute to shared standards, patterns, and best practices across the data engineering organization through hands-on contributions and technical collaboration
What we offer
What we offer
  • comprehensive medical, vision, dental, and life, AD&D, short-term and long-term disability insurance, sleep care management, Health Savings Account (HSA), Flexible Spending Account (FSA), commuter benefits, 401(k), Employee Stock Purchase Plan (ESPP), Employee Assistance Program (EAP), and tuition assistance
  • Employees accrue fifteen days Paid Time Off (PTO) in their first year of employment, receive 11 paid holidays plus 3 floating days and are eligible for 14 weeks of primary caregiver or two weeks of secondary caregiver leave when welcoming new family members
  • Fulltime
Read More
Arrow Right

Senior ML Engineer - AI Platform & Agents

We are building agentic AI into the core of our product and need someone who can...
Location
Location
France , Bordeaux
Salary
Salary:
Not provided
phantombuster.com Logo
PhantomBuster
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as an ML Engineer, AI Engineer, or Software Engineer with a strong AI focus
  • Hands-on experience building AI agents using frameworks such as LangChain, Amazon Bedrock AgentCore, or similar
  • Strong understanding of LLM-based systems: prompt engineering, agent orchestration, tool use, and multi-agent workflows
  • Familiarity with MCP (Model Context Protocol) and experience integrating agents with external APIs or data sources
  • Experience working with Agents for Amazon Bedrock AgentCore or similar agent setups
  • Strong understanding of machine learning algorithms, statistical methods, and data preprocessing techniques
  • Experience with cloud platforms for model training and deployment, especially AWS
  • Proficiency in Python, including LangChain, and standard data libraries (Pandas, NumPy, etc.)
  • Fluency in English
Job Responsibility
Job Responsibility
  • Define and evolve our infrastructure to allow for better ML and AI capabilities, with a focus on LLM-based and agentic systems
  • Contribute to the development and expansion of our agentic AI framework powered by AWS Bedrock, enabling both internal tools and customer-facing features
  • Identify, source, and refine datasets to allow tuning models, powering retrieval pipelines, or expanding agentic workflows
  • Pre-process data by using techniques such as data cleaning, feature engineering, and transformation
  • Train, evaluate, and deploy both LLM-based systems and traditional machine learning models into production
  • Monitor, debug, and continuously improve deployed models and AI tools
  • Support machine learning usage throughout the company, including selecting the right modeling approach for the use case (LLM vs. traditional ML)
  • Support the integration and use of LLMs, including approaches such as fine-tuning, prompt tuning, and retrieval-augmented generation (RAG), to improve accuracy
What we offer
What we offer
  • International team
  • Fun team building events
  • €40/month for remote work
  • Flexible working time
  • Home office budget up to €1500
  • 100% of an Alan Blue subscription
  • Lunch vouchers - €8 (50% The Phantom Company) / worked day
  • Partnership with MokaCare
  • €70 a month benefit for entertainment expenses
  • Book Allowance and Sharing Program
Read More
Arrow Right