CrawlJobs Logo

Machine Learning Data Engineer - Systems & Retrieval

zyphra.com Logo

Zyphra

Location Icon

Location:
United States , Palo Alto

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Machine Learning Data Engineer - Systems & Retrieval, you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

Job Responsibility:

  • Design and implementation of distributed data ingestion and transformation pipelines
  • Building retrieval and indexing systems that support RAG and other LLM-based methods
  • Mining and organizing large unstructured datasets, both in research and production environments
  • Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability
  • Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements:

  • Strong software engineering background with fluency in Python
  • Experience designing, building, and maintaining data pipelines in production environments
  • Deep understanding of data structures, storage formats, and distributed data systems
  • Familiarity with indexing and retrieval techniques for large-scale document corpora
  • Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics
  • Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)
  • Excellent debugging, observability, and logging practices to support reliability at scale
  • Strong communication skills and experience collaborating across ML, infra, and product teams

Nice to have:

  • Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)
  • Academic or industry background in data mining, search, recommendation systems, or IR literature
  • Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar
  • Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval
  • Understanding of data validation and quality assurance in machine learning workflows
  • Experience working on cross-functional infra and MLOps teams
  • Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops
  • Comfort working across raw, unstructured data, structured databases, and model-ready formats
What we offer:
  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k)
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a dedicated culinary team
  • Thursday Happy Hours

Additional Information:

Job Posted:
January 13, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Machine Learning Data Engineer - Systems & Retrieval

Senior Machine Learning Systems Engineer

Our organization drives AI innovation across Jira products. We deliver seamless ...
Location
Location
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience building Machine Learning and AI solutions (4+ years)
  • Proven experience developing, deploying, and maintaining end-to-end ML systems, including data engineering, model serving, and monitoring
  • Expert proficiency with GenAI frameworks and tools, including developing and fine-tuning large language models (LLMs) and building retrieval-augmented generation (RAG) systems
  • Expert proficiency in Python and ML frameworks like PyTorch, TensorFlow, or JAX
  • Experience implementing MLOps, CI/CD pipelines, and automation for continuous training, deployment, and monitoring of ML models
Job Responsibility
Job Responsibility
  • Collaborate with software engineers, data scientists, and product managers to solve complex problems
  • Lead projects from technical design through launch
  • Partner with teams to achieve impactful results
  • Deliver robust ML solutions to build AI features reaching millions
  • This includes curating ML datasets, fine-tuning open-source LLMs, or accessing proprietary LLMs
  • Mentor junior members of the team
What we offer
What we offer
  • Health and wellbeing resources
  • Paid volunteer days
Read More
Arrow Right

Senior Machine Learning Engineer

We’re seeking a Senior Machine Learning Engineer (P50) to join our new GenAI Mod...
Location
Location
Singapore
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience (generally 5+ years) in ML systems engineering, backend engineering, or infrastructure roles
  • Strong background in one or more of: LLMs, NLP, search/retrieval, embeddings, or applied ML
  • Hands-on experience with at least one GenAI area: RAG pipelines, fine-tuning, hybrid retrieval, or orchestration frameworks
  • Proficiency with modern ML frameworks (PyTorch, TensorFlow, Hugging Face, LangChain, LlamaIndex)
  • Familiarity with vector databases (Weaviate, Pinecone, FAISS, etc.) and large-scale serving infra
  • Strong coding skills (Python, backend engineering) and ability to move fast from idea to prototype
  • Comfort working in fast-paced, experimental environments with evolving direction
  • Bachelor’s or Master’s in Computer Science, Machine Learning, or related field—or equivalent experience
Job Responsibility
Job Responsibility
  • Build and apply advanced GenAI models
  • Develop and fine-tune LLMs and embeddings for Atlassian’s unique knowledge and enterprise data
  • Implement retrieval-augmented generation (RAG), hybrid retrieval, and knowledge-grounded modeling approaches
  • Work hands-on with modern frameworks, contributing directly to high-value prototypes and experiments
  • Prototype and experiment quickly
  • Build proof-of-concept systems for GenAI-powered assistants, agentic workflows, and innovative user experiences
  • Run experiments, collect feedback, and iterate fast to validate impact
  • Design and implement evaluation methods for quality, groundedness, and user value
  • Collaborate and contribute
  • Work closely with peers across ML, engineering, and product teams to bring new ideas to life
What we offer
What we offer
  • Health and wellbeing resources
  • Paid volunteer days
Read More
Arrow Right

Autonomous Systems Data Mining Engineer

We are seeking a Data Engineer with a systems mindset to own and simplify access...
Location
Location
United States , San Mateo
Salary
Salary:
170000.00 - 240000.00 USD / Year
skydio.com Logo
Skydio
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in data engineering, backend engineering, or infrastructure roles
  • Exposure to robotics, autonomy, or real-world sensor data pipelines
  • Strong proficiency in Python (or similar language) and SQL
  • Experience designing scalable data pipelines with tools such as Apache Spark, Airflow, dbt, or equivalent
  • Familiarity with log processing, time-series analysis, or working with large volumes of semi-structured data
  • Ability to work cross-functionally with ML engineers, autonomy engineers, and product stakeholders
  • Systems thinking: you enjoy untangling complexity and designing elegant abstractions that empower others
Job Responsibility
Job Responsibility
  • Design systems to unify scattered data sources (logs, telemetry, analytics tables, media, etc.) into easily discoverable and queryable formats
  • Enable efficient curation of machine learning datasets by tagging, indexing, and filtering for relevant scenarios (e.g., environmental conditions, sensor behavior, scene attributes)
  • Build tools to automatically surface anomalies, regressions, or key signatures in logs and telemetry data (e.g., CPU usage spikes, sensor noise, degraded conditions)
  • Develop mechanisms to rapidly compare releases and surface regressions in performance metrics, resource usage, and data quality
  • Architect and maintain scalable data pipelines and services to index, enrich, and query multimodal autonomy data (e.g., time series, media, tabular analytics)
  • Collaborate with autonomy and ML teams to understand data usage patterns and build tools that streamline their workflows
  • Develop efficient methods for search, tagging, and filtering over structured and unstructured data
  • Help design systems to label and retrieve rare or complex scenarios, both automatically at ingestion and via manual search
  • Build dashboards and visualizations to support release monitoring and anomaly detection across a variety of system health signals
What we offer
What we offer
  • Equity in the form of stock options
  • Comprehensive benefits packages
  • Relocation assistance may also be provided for eligible roles
  • Paid vacation time
  • Sick leave
  • Holiday pay
  • 401K savings plan
  • Group health insurance plans
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer - Data Foundation and AI

You’ll be a machine learning engineer on the Data Foundation & AI team. In this ...
Location
Location
United States , New York
Salary
Salary:
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 1-3 years of experience training, deploying, and scaling ML/AI models in production environments
  • Strong experience with distributed systems and ML operations — from large-scale training to low-latency serving and monitoring
  • Proficiency in Python and modern ML frameworks (e.g., PyTorch), with the ability to implement and optimize complex models
  • Hands-on experience building or scaling ML/AI infrastructure, pipelines, or reusable platforms that support multiple teams
  • Curiosity and drive to experiment with advanced AI techniques (e.g., embeddings, retrieval, generative modeling) while staying grounded in production impact
  • Ability to thrive in a collaborative environment, working with both technical and non-technical partners to drive measurable outcomes
Job Responsibility
Job Responsibility
  • Building and scaling advanced ML/AI systems that power core Plaid products and applications used by millions of consumers
  • Driving impact at scale by improving distributed training, serving, and ML operations to make Plaid’s AI capabilities faster, more reliable, and more widely available
  • Developing new AI applications that enable innovative product experiences across fintech
  • Tackling 0 to 1 problems where you explore new approaches, as well as scaling 1 to 10 systems for reliability and efficiency
  • Collaborating with some of the strongest MLEs at Plaid in a high-ownership, bottom-up driven team
  • Experimenting with cutting-edge ML and AI techniques while balancing practical productionization and measurable business impact
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • equity
  • commission
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer - Data Foundation and AI

You’ll be a machine learning engineer on the Data Foundation & AI team. In this ...
Location
Location
United States , San Francisco
Salary
Salary:
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 1-3 years of experience training, deploying, and scaling ML/AI models in production environments
  • Strong experience with distributed systems and ML operations — from large-scale training to low-latency serving and monitoring
  • Proficiency in Python and modern ML frameworks (e.g., PyTorch), with the ability to implement and optimize complex models
  • Hands-on experience building or scaling ML/AI infrastructure, pipelines, or reusable platforms that support multiple teams
  • Curiosity and drive to experiment with advanced AI techniques (e.g., embeddings, retrieval, generative modeling) while staying grounded in production impact
  • Ability to thrive in a collaborative environment, working with both technical and non-technical partners to drive measurable outcomes
Job Responsibility
Job Responsibility
  • Building and scaling advanced ML/AI systems that power core Plaid products and applications used by millions of consumers
  • Driving impact at scale by improving distributed training, serving, and ML operations to make Plaid’s AI capabilities faster, more reliable, and more widely available
  • Developing new AI applications that enable innovative product experiences across fintech
  • Tackling 0 to 1 problems where you explore new approaches, as well as scaling 1 to 10 systems for reliability and efficiency
  • Collaborating with some of the strongest MLEs at Plaid in a high-ownership, bottom-up driven team
  • Experimenting with cutting-edge ML and AI techniques while balancing practical productionization and measurable business impact
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • equity
  • commission
  • Fulltime
Read More
Arrow Right

Machine Learning Research Engineer

You will be part of Kiddom’s Data Science team, building the foundation of our s...
Location
Location
United States , San Francisco; New York
Salary
Salary:
175000.00 - 250000.00 USD / Year
kiddom.co Logo
Kiddom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have 5+ years of industry experience applying machine learning to solve real-world problems with large, complex datasets, with 1–2 years in a technical leadership role
  • Proven track record designing, evaluating, and deploying ML/AI systems in production environments that drive measurable business impact, ideally in recommendation, personalization, search, or workflow optimization
  • Strong programming skills in Python and fluency in data manipulation (SQL, Pandas) and common ML toolkits (scikit-learn, XGBoost, TensorFlow/PyTorch)
  • Strong analytical skills and ability to break down complex problems into measurable hypotheses and experiments
  • Excellent communication skills with a history of cross-functional collaboration with product, design, and engineering stakeholders
Job Responsibility
Job Responsibility
  • Architect and scale machine learning systems for search, personalization, and recommendations that power Kiddom’s teacher helper and insight engine
  • Develop evaluation-first development workflows to measure how models improve teaching efficiency, lesson planning, and student learning outcomes
  • Fine-tune machine learning models with feedback signals from teachers and students to align outputs with instructional goals and classroom needs
  • Design intelligent discovery pipelines that combine semantic retrieval, curriculum alignment, and real-time personalization
  • Build agentic assistants that help teachers plan lessons, adapt instruction, and reduce repetitive tasks
  • Collaborate closely with product managers, designers, and curriculum experts to translate high-level educational goals into scalable ML-powered systems
  • Coach and mentor junior ML engineers and data scientists, fostering technical and professional growth
What we offer
What we offer
  • Competitive salary
  • Meaningful equity
  • Health insurance benefits: medical (various PPO/HMO/HSA plans), dental, vision, disability and life insurance
  • One Medical membership (in participating locations)
  • Flexible vacation time policy (subject to internal approval). Average use 4 weeks off per year
  • 10 paid sick days per year (pro rated depending on start date)
  • Paid holidays
  • Paid bereavement leave
  • Paid family leave after birth/adoption. Minimum of 16 paid weeks for birthing parents, 10 weeks for caretaker parents. Meant to supplement benefits offered by State
  • Commuter and FSA plans
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer

Influur is redefining how advertising works — through creators, data, and AI. Ou...
Location
Location
United States , Miami
Salary
Salary:
200000.00 USD / Year
influur.com Logo
Influur
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience designing, building, and maintaining end-to-end machine learning systems in production
  • Deep understanding of ML algorithms, embeddings, retrieval systems, and evaluation methodologies
  • Strong experience with large language models (LLMs), fine-tuning, inference optimization, and agent frameworks
  • Expertise in ML infrastructure, including feature stores, vector databases, model serving, and real-time inference pipelines
  • Strong Python skills and experience with PyTorch, TensorFlow, FastAPI, NumPy, scikit-learn, and data processing frameworks
  • Experience with scalable data pipelines (batch + streaming), including tools like Spark, Kafka, or similar
  • Experience implementing ML solutions such as recommendation engines, ranking models, and personalization systems
  • Solid understanding of statistical analysis (A/B testing, experimentation, causal inference)
  • Ability to work closely with engineering teams to productionize ML models with reliability, monitoring, and CI/CD best practices
What we offer
What we offer
  • Competitive equity in a venture-backed company shaping the future of music influencer marketing
  • A seat at the table as we redefine how the most iconic record labels, artists, and brands go viral
  • Access to elite tools, AI copilots, and a team that builds daily at top speed
  • Hybrid flexibility + top-tier health benefits
  • Fulltime
Read More
Arrow Right

Machine Learning Engineer

Influur is redefining how advertising works — through creators, data, and AI. Ou...
Location
Location
Salary
Salary:
200000.00 USD / Year
influur.com Logo
Influur
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience designing, building, and maintaining end-to-end machine learning systems in production
  • Deep understanding of ML algorithms, embeddings, retrieval systems, and evaluation methodologies
  • Strong experience with large language models (LLMs), fine-tuning, inference optimization, and agent frameworks
  • Expertise in ML infrastructure, including feature stores, vector databases, model serving, and real-time inference pipelines
  • Strong Python skills and experience with PyTorch, TensorFlow, FastAPI, NumPy, scikit-learn, and data processing frameworks
  • Experience with scalable data pipelines (batch + streaming), including tools like Spark, Kafka, or similar
  • Experience implementing ML solutions such as recommendation engines, ranking models, and personalization systems
  • Solid understanding of statistical analysis (A/B testing, experimentation, causal inference)
  • Ability to work closely with engineering teams to productionize ML models with reliability, monitoring, and CI/CD best practices
What we offer
What we offer
  • Competitive equity in a venture-backed company shaping the future of music influencer marketing
  • A seat at the table as we redefine how the most iconic record labels, artists, and brands go viral (think Bad Bunny) — with our tech, support, and strategic guidance
  • Access to elite tools, AI copilots, and a team that builds daily at top speed
  • Hybrid flexibility + top-tier health benefits
  • Fulltime
Read More
Arrow Right