CrawlJobs Logo

Machine Learning Data Engineer - Systems & Retrieval

United States, Palo Alto · Job Posted January 13, 2026
Apply Position
Job Link Share

Job Description

As a Machine Learning Data Engineer - Systems & Retrieval, you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

Job Responsibility

  • Design and implementation of distributed data ingestion and transformation pipelines
  • Building retrieval and indexing systems that support RAG and other LLM-based methods
  • Mining and organizing large unstructured datasets, both in research and production environments
  • Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability
  • Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements

  • Strong software engineering background with fluency in Python
  • Experience designing, building, and maintaining data pipelines in production environments
  • Deep understanding of data structures, storage formats, and distributed data systems
  • Familiarity with indexing and retrieval techniques for large-scale document corpora
  • Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics
  • Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)
  • Excellent debugging, observability, and logging practices to support reliability at scale
  • Strong communication skills and experience collaborating across ML, infra, and product teams

Nice to have

  • Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)
  • Academic or industry background in data mining, search, recommendation systems, or IR literature
  • Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar
  • Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval
  • Understanding of data validation and quality assurance in machine learning workflows
  • Experience working on cross-functional infra and MLOps teams
  • Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops
  • Comfort working across raw, unstructured data, structured databases, and model-ready formats

What we offer

  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k)
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a dedicated culinary team
  • Thursday Happy Hours

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Machine Learning Data Engineer - Systems & Retrieval

8 matching positions

Staff Applied Machine Learning Engineer - Intelligent Data, Signals & Systems

As a Staff Applied Machine Learning Engineer focused on Intelligent Data, Signal...
Location
Location
United States , Bay Area
Salary
Salary:
276800.00 - 415200.00 USD / Year
cash.app Logo
Cash App
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years building and operating production software and ML systems for business-critical products
  • Deep expertise in intelligent systems such as ranking/retrieval, recommendations, search, personalization, growth and lifecycle ML, customer intelligence, propensity/churn/LTV, next-best-action, or model-derived risk signals
  • Strong production ML judgment across feature pipelines, model serving, experimentation, monitoring, feedback loops, online/offline consistency, and reliable signal interfaces
  • Ability to evaluate impact beyond short-term conversion, including trust, fairness, access, risk, compliance, and long-term engagement
  • Experience using AI-assisted engineering tools with appropriate verification, testing, and review for customer-impacting systems.
Job Responsibility
Job Responsibility
  • Build and operate production ML systems that turn customer and product context into trusted signals, rankings, recommendations, and decision capabilities
  • Design production data and signal contracts that define intended use, freshness, provenance, confidence, eligibility, and calibration for downstream consumers
  • Own ranking, retrieval, recommendation, search, propensity, and next-best-action systems end to end, from feature and candidate generation through serving, experimentation, monitoring, and feedback loops
  • Evaluate customer and business impact beyond short-term conversion, including trust, fairness, access, risk, compliance, long-term engagement, and segment-level performance
  • Partner across product, growth, data, platform, modeling, risk, and compliance to translate ambiguous goals into measurable ML system designs
  • Use AI and agents to accelerate development, analysis, testing, documentation, and operations while exposing reusable capabilities to product services, internal tools, and AI-assisted workflows.
What we offer
What we offer
  • Remote work
  • Medical insurance
  • Flexible time off
  • Retirement savings plans
  • Modern family planning
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Engineer - Data Intelligence

We're building the platform that makes AI possible across Culture Amp. This is a...
Location
Location
Australia , Melbourne
Salary
Salary:
Not provided
cultureamp.com Logo
Culture Amp
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong platform engineering fundamentals, with experience building and operating services that other teams depend on
  • Expertise in Python and experience with ML tooling and infrastructure
  • Deep experience with large-scale data systems (streaming, batch processing, data lakes)
  • Proven experience with ML infrastructure: model serving, vector databases, embedding pipelines
  • Strong understanding of cloud platforms (AWS preferred) and backend architecture
  • Experience building and optimising RAG and retrieval systems
  • The communication skills to work across teams and influence technical direction beyond your own team
Job Responsibility
Job Responsibility
  • Designing and operating the platform services that power AI features across Culture Amp, including inference pipelines, embedding storage, and retrieval systems
  • Building a scalable approach to vector search across diverse categories of unstructured data (survey responses, performance feedback, company documents)
  • Driving MLOps and LLMOps practices across the organisation, including observability, cost management, and reliability
  • Ensuring AI is used responsibly: implementing guardrails, security controls, and data compliance measures
  • Partnering with data scientists on the team to productionise models and evaluate new AI capabilities
What we offer
What we offer
  • Employee Share Options Program
  • Programs, coaching, and budgets to help you thrive personally and professionally
  • Access to external providers for mental wellbeing and coaching support
  • Monthly Camper Life Allowance
  • Team budgets dedicated to team building activities and connection
  • Intentional quarterly wellbeing pauses
  • Extended year-end breaks
  • Excellent parental leave and in work support program available from day 1
  • 5 Social Impact Days a year
  • MacBooks for you to do your best & a work from home office budget
Read More
Arrow Right

Staff Machine Learning Engineer - Data Intelligence

We're building the platform that makes AI possible across Culture Amp. This is a...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
cultureamp.com Logo
Culture Amp
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong platform engineering fundamentals, with experience building and operating services that other teams depend on
  • Expertise in Python and experience with ML tooling and infrastructure
  • Deep experience with large-scale data systems (streaming, batch processing, data lakes)
  • Proven experience with ML infrastructure: model serving, vector databases, embedding pipelines
  • Strong understanding of cloud platforms (AWS preferred) and backend architecture
  • Experience building and optimising RAG and retrieval systems
  • The communication skills to work across teams and influence technical direction beyond your own team
Job Responsibility
Job Responsibility
  • Designing and operating the platform services that power AI features across Culture Amp, including inference pipelines, embedding storage, and retrieval systems
  • Building a scalable approach to vector search across diverse categories of unstructured data (survey responses, performance feedback, company documents)
  • Driving MLOps and LLMOps practices across the organisation, including observability, cost management, and reliability
  • Ensuring AI is used responsibly: implementing guardrails, security controls, and data compliance measures
  • Partnering with data scientists on the team to productionise models and evaluate new AI capabilities
What we offer
What we offer
  • Employee Share Options Program
  • Programs, coaching, and budgets to help you thrive personally and professionally
  • Access to external providers for mental wellbeing and coaching support
  • Monthly Camper Life Allowance
  • Team budgets dedicated to team building activities and connection
  • Intentional quarterly wellbeing pauses
  • Extended year-end breaks
  • Excellent parental leave and in work support program available from day 1
  • 5 Social Impact Days a year
  • MacBooks for you to do your best & a work from home office budget
Read More
Arrow Right

Machine Learning Engineer - Data Foundation and AI

You’ll be a machine learning engineer on the Data Foundation & AI team. In this ...
Location
Location
United States , San Francisco
Salary
Salary:
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 1-3 years of experience training, deploying, and scaling ML/AI models in production environments
  • Strong experience with distributed systems and ML operations — from large-scale training to low-latency serving and monitoring
  • Proficiency in Python and modern ML frameworks (e.g., PyTorch), with the ability to implement and optimize complex models
  • Hands-on experience building or scaling ML/AI infrastructure, pipelines, or reusable platforms that support multiple teams
  • Curiosity and drive to experiment with advanced AI techniques (e.g., embeddings, retrieval, generative modeling) while staying grounded in production impact
  • Ability to thrive in a collaborative environment, working with both technical and non-technical partners to drive measurable outcomes
Job Responsibility
Job Responsibility
  • Building and scaling advanced ML/AI systems that power core Plaid products and applications used by millions of consumers
  • Driving impact at scale by improving distributed training, serving, and ML operations to make Plaid’s AI capabilities faster, more reliable, and more widely available
  • Developing new AI applications that enable innovative product experiences across fintech
  • Tackling 0 to 1 problems where you explore new approaches, as well as scaling 1 to 10 systems for reliability and efficiency
  • Collaborating with some of the strongest MLEs at Plaid in a high-ownership, bottom-up driven team
  • Experimenting with cutting-edge ML and AI techniques while balancing practical productionization and measurable business impact
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • equity
  • commission
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer - Data Foundation and AI

You’ll be a machine learning engineer on the Data Foundation & AI team. In this ...
Location
Location
United States , New York
Salary
Salary:
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 1-3 years of experience training, deploying, and scaling ML/AI models in production environments
  • Strong experience with distributed systems and ML operations — from large-scale training to low-latency serving and monitoring
  • Proficiency in Python and modern ML frameworks (e.g., PyTorch), with the ability to implement and optimize complex models
  • Hands-on experience building or scaling ML/AI infrastructure, pipelines, or reusable platforms that support multiple teams
  • Curiosity and drive to experiment with advanced AI techniques (e.g., embeddings, retrieval, generative modeling) while staying grounded in production impact
  • Ability to thrive in a collaborative environment, working with both technical and non-technical partners to drive measurable outcomes
Job Responsibility
Job Responsibility
  • Building and scaling advanced ML/AI systems that power core Plaid products and applications used by millions of consumers
  • Driving impact at scale by improving distributed training, serving, and ML operations to make Plaid’s AI capabilities faster, more reliable, and more widely available
  • Developing new AI applications that enable innovative product experiences across fintech
  • Tackling 0 to 1 problems where you explore new approaches, as well as scaling 1 to 10 systems for reliability and efficiency
  • Collaborating with some of the strongest MLEs at Plaid in a high-ownership, bottom-up driven team
  • Experimenting with cutting-edge ML and AI techniques while balancing practical productionization and measurable business impact
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • equity
  • commission
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Systems Engineer

Our organization drives AI innovation across Jira products. We deliver seamless ...
Location
Location
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience building Machine Learning and AI solutions (4+ years)
  • Proven experience developing, deploying, and maintaining end-to-end ML systems, including data engineering, model serving, and monitoring
  • Expert proficiency with GenAI frameworks and tools, including developing and fine-tuning large language models (LLMs) and building retrieval-augmented generation (RAG) systems
  • Expert proficiency in Python and ML frameworks like PyTorch, TensorFlow, or JAX
  • Experience implementing MLOps, CI/CD pipelines, and automation for continuous training, deployment, and monitoring of ML models
Job Responsibility
Job Responsibility
  • Collaborate with software engineers, data scientists, and product managers to solve complex problems
  • Lead projects from technical design through launch
  • Partner with teams to achieve impactful results
  • Deliver robust ML solutions to build AI features reaching millions
  • This includes curating ML datasets, fine-tuning open-source LLMs, or accessing proprietary LLMs
  • Mentor junior members of the team
What we offer
What we offer
  • Health and wellbeing resources
  • Paid volunteer days
Read More
Arrow Right

Sr Machine Learning Engineer

Let’s do this. Let’s change the world. We are looking for a highly motivated exp...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience with AWS, Databricks, Apache Spark, PySpark, SparkSQL, Python, and SQL for large-scale data engineering
  • Strong proficiency in workflow orchestration, Spark performance tuning, and scalable batch and streaming data pipeline development
  • Experience with real-time data processing and integration using Apache Kafka, Debezium, or similar streaming technologies
  • Hands-on experience with MLOps tools and practices, including MLflow, model serving, feature stores, experiment tracking, deployment, and lifecycle management
  • Experience with GenAI engineering practices, including prompt engineering, LLM evaluation, AI observability, agentic workflows, and knowledge graphs
  • Ability to design and develop APIs or service interfaces for data, ML, and GenAI application integration
  • Experience with Agile/SAFe delivery models, DevOps practices, CI/CD concepts, and cross-functional team collaboration
  • Strong analytical, problem-solving, debugging, communication, and teamwork skills
  • Ability to quickly learn, adapt, and apply emerging technologies across data, ML, and AI engineering
  • Doctorate degree / Master's degree / Bachelor's degree and 8 to 13 years of experience years of experience in Computer Science, IT or related field
Job Responsibility
Job Responsibility
  • Design, deploy, monitor, and optimize production-grade ML and Generative AI applications for AI-enabled manufacturing solutions
  • Define technical architecture, engineering standards, and best practices across data engineering, ML, GenAI, analytics, and platform capabilities
  • Partner with business stakeholders, product owners, and cross-functional teams to translate manufacturing challenges into secure, scalable, production-ready AI and data solutions
  • Design, develop, and maintain complex ETL/ELT pipelines in Databricks using PySpark, Scala, and SQL for large-scale structured and unstructured data processing
  • Build efficient ingestion, transformation, migration, and deployment pipelines across databases, APIs, logs, event streams, images, PDFs, documents, and third-party platforms
  • Design and implement GenAI solutions including RAG, embeddings, vector databases, agentic workflows, tool-calling systems, LLM orchestration, serving optimization, knowledge graphs, and metadata-driven retrieval
  • Build GenAI applications using frameworks and platforms such as LangChain, LangGraph, LlamaIndex, DSPy, OpenAI APIs, Amazon Bedrock, or equivalent technologies
  • Develop evaluation and observability frameworks to monitor model quality, hallucination rates, drift, retrieval effectiveness, latency, token usage, cost, reliability, operational health, and business impact
  • Build and maintain MLOps and LLMOps capabilities, including experiment tracking, model registry, prompt management, versioning, CI/CD, automated testing, deployment automation, monitoring, governance, and release controls
  • Design scalable data quality, validation, security, privacy, access control, logging, governance, and interoperability capabilities across hybrid cloud environments
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer, Search Assistant

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform ...
Location
Location
United States , San Jose
Salary
Salary:
361300.00 - 510000.00 USD / Year
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of industry experience (or PhD with 5+ years) applying ML at scale in search, recommendation, ads, personalization, or related domains
  • Strong expertise in ranking systems, recommendation systems, retrieval, personalization, and multi-objective optimization
  • Experience building large-scale ML systems leveraging deep learning, sequence models, LLMs, reinforcement learning, or bandit frameworks
  • Strong product intuition and experience optimizing user engagement, retention, and monetization simultaneously
  • Proficiency in Python, Java, or Scala
  • Experience with distributed systems and ML infrastructure such as Spark, Airflow, streaming systems, feature stores, and cloud platforms
  • Strong technical leadership, system design, communication, and problem-solving skills
  • MS or PhD in Computer Science, Statistics, or a related field
Job Responsibility
Job Responsibility
  • Lead the technical vision and roadmap for ranking, personalization, and recommendation systems powering Roku’s entertainment assistant
  • Develop and deploy state-of-the-art ML models using deep learning, transformers, LLMs, bandits, reinforcement learning, and causal inference techniques
  • Build multi-objective optimization systems balancing engagement, retention, relevance, and monetization goals
  • Drive innovation in conversational discovery, contextual recommendations, and personalized content experiences across the platform
  • Design, run, and analyze online A/B experiments tied to key product and business KPIs
  • Architect scalable ML systems, feature platforms, and data pipelines supporting rapid experimentation and long-term growth
  • Mentor engineers and provide technical leadership across cross-functional initiatives involving engineering, product, UX, and analytics teams
What we offer
What we offer
  • Health insurance
  • Equity awards
  • Life insurance
  • Disability benefits
  • Parental leave
  • Wellness benefits
  • Paid time off
  • Global access to mental health and financial wellness support and resources
  • Healthcare (medical, dental, and vision)
  • Life, accident, disability, commuter, and retirement options (401(k)/pension)
  • Fulltime
Read More
Arrow Right