Machine Learning Data Engineer - Systems & Retrieval Job at Zyphra (Palo Alto)

Staff Applied Machine Learning Engineer - Intelligent Data, Signals & Systems

As a Staff Applied Machine Learning Engineer focused on Intelligent Data, Signal...

Location

United States , Bay Area

Salary:

276800.00 - 415200.00 USD / Year

Cash App

Expiration Date

Until further notice

Requirements

12+ years building and operating production software and ML systems for business-critical products
Deep expertise in intelligent systems such as ranking/retrieval, recommendations, search, personalization, growth and lifecycle ML, customer intelligence, propensity/churn/LTV, next-best-action, or model-derived risk signals
Strong production ML judgment across feature pipelines, model serving, experimentation, monitoring, feedback loops, online/offline consistency, and reliable signal interfaces
Ability to evaluate impact beyond short-term conversion, including trust, fairness, access, risk, compliance, and long-term engagement
Experience using AI-assisted engineering tools with appropriate verification, testing, and review for customer-impacting systems.

Job Responsibility

Build and operate production ML systems that turn customer and product context into trusted signals, rankings, recommendations, and decision capabilities
Design production data and signal contracts that define intended use, freshness, provenance, confidence, eligibility, and calibration for downstream consumers
Own ranking, retrieval, recommendation, search, propensity, and next-best-action systems end to end, from feature and candidate generation through serving, experimentation, monitoring, and feedback loops
Evaluate customer and business impact beyond short-term conversion, including trust, fairness, access, risk, compliance, long-term engagement, and segment-level performance
Partner across product, growth, data, platform, modeling, risk, and compliance to translate ambiguous goals into measurable ML system designs
Use AI and agents to accelerate development, analysis, testing, documentation, and operations while exposing reusable capabilities to product services, internal tools, and AI-assisted workflows.

What we offer

Remote work
Medical insurance
Flexible time off
Retirement savings plans
Modern family planning

Fulltime

Staff Machine Learning Engineer - Data Intelligence

We're building the platform that makes AI possible across Culture Amp. This is a...

Location

Australia , Melbourne

Salary:

Not provided

Culture Amp

Expiration Date

Until further notice

Requirements

Strong platform engineering fundamentals, with experience building and operating services that other teams depend on
Expertise in Python and experience with ML tooling and infrastructure
Deep experience with large-scale data systems (streaming, batch processing, data lakes)
Proven experience with ML infrastructure: model serving, vector databases, embedding pipelines
Strong understanding of cloud platforms (AWS preferred) and backend architecture
Experience building and optimising RAG and retrieval systems
The communication skills to work across teams and influence technical direction beyond your own team

Job Responsibility

Designing and operating the platform services that power AI features across Culture Amp, including inference pipelines, embedding storage, and retrieval systems
Building a scalable approach to vector search across diverse categories of unstructured data (survey responses, performance feedback, company documents)
Driving MLOps and LLMOps practices across the organisation, including observability, cost management, and reliability
Ensuring AI is used responsibly: implementing guardrails, security controls, and data compliance measures
Partnering with data scientists on the team to productionise models and evaluate new AI capabilities

What we offer

Employee Share Options Program
Programs, coaching, and budgets to help you thrive personally and professionally
Access to external providers for mental wellbeing and coaching support
Monthly Camper Life Allowance
Team budgets dedicated to team building activities and connection
Intentional quarterly wellbeing pauses
Extended year-end breaks
Excellent parental leave and in work support program available from day 1
5 Social Impact Days a year
MacBooks for you to do your best & a work from home office budget

Staff Machine Learning Engineer - Data Intelligence

We're building the platform that makes AI possible across Culture Amp. This is a...

Location

Australia , Sydney

Salary:

Not provided

Culture Amp

Expiration Date

Until further notice

Requirements

Strong platform engineering fundamentals, with experience building and operating services that other teams depend on
Expertise in Python and experience with ML tooling and infrastructure
Deep experience with large-scale data systems (streaming, batch processing, data lakes)
Proven experience with ML infrastructure: model serving, vector databases, embedding pipelines
Strong understanding of cloud platforms (AWS preferred) and backend architecture
Experience building and optimising RAG and retrieval systems
The communication skills to work across teams and influence technical direction beyond your own team

Job Responsibility

Designing and operating the platform services that power AI features across Culture Amp, including inference pipelines, embedding storage, and retrieval systems
Building a scalable approach to vector search across diverse categories of unstructured data (survey responses, performance feedback, company documents)
Driving MLOps and LLMOps practices across the organisation, including observability, cost management, and reliability
Ensuring AI is used responsibly: implementing guardrails, security controls, and data compliance measures
Partnering with data scientists on the team to productionise models and evaluate new AI capabilities

What we offer

Employee Share Options Program
Programs, coaching, and budgets to help you thrive personally and professionally
Access to external providers for mental wellbeing and coaching support
Monthly Camper Life Allowance
Team budgets dedicated to team building activities and connection
Intentional quarterly wellbeing pauses
Extended year-end breaks
Excellent parental leave and in work support program available from day 1
5 Social Impact Days a year
MacBooks for you to do your best & a work from home office budget

Machine Learning Engineer - Data Foundation and AI

You’ll be a machine learning engineer on the Data Foundation & AI team. In this ...

Location

United States , San Francisco

Salary:

186000.00 - 236400.00 USD / Year

Plaid

Expiration Date

Until further notice

Requirements

1-3 years of experience training, deploying, and scaling ML/AI models in production environments
Strong experience with distributed systems and ML operations — from large-scale training to low-latency serving and monitoring
Proficiency in Python and modern ML frameworks (e.g., PyTorch), with the ability to implement and optimize complex models
Hands-on experience building or scaling ML/AI infrastructure, pipelines, or reusable platforms that support multiple teams
Curiosity and drive to experiment with advanced AI techniques (e.g., embeddings, retrieval, generative modeling) while staying grounded in production impact
Ability to thrive in a collaborative environment, working with both technical and non-technical partners to drive measurable outcomes

Job Responsibility

Building and scaling advanced ML/AI systems that power core Plaid products and applications used by millions of consumers
Driving impact at scale by improving distributed training, serving, and ML operations to make Plaid’s AI capabilities faster, more reliable, and more widely available
Developing new AI applications that enable innovative product experiences across fintech
Tackling 0 to 1 problems where you explore new approaches, as well as scaling 1 to 10 systems for reliability and efficiency
Collaborating with some of the strongest MLEs at Plaid in a high-ownership, bottom-up driven team
Experimenting with cutting-edge ML and AI techniques while balancing practical productionization and measurable business impact

What we offer

medical
dental
vision
401(k)
equity
commission

Fulltime

Machine Learning Engineer - Data Foundation and AI

You’ll be a machine learning engineer on the Data Foundation & AI team. In this ...

Location

United States , New York

Salary:

186000.00 - 236400.00 USD / Year

Plaid

Expiration Date

Until further notice

Requirements

1-3 years of experience training, deploying, and scaling ML/AI models in production environments
Strong experience with distributed systems and ML operations — from large-scale training to low-latency serving and monitoring
Proficiency in Python and modern ML frameworks (e.g., PyTorch), with the ability to implement and optimize complex models
Hands-on experience building or scaling ML/AI infrastructure, pipelines, or reusable platforms that support multiple teams
Curiosity and drive to experiment with advanced AI techniques (e.g., embeddings, retrieval, generative modeling) while staying grounded in production impact
Ability to thrive in a collaborative environment, working with both technical and non-technical partners to drive measurable outcomes

Job Responsibility

Building and scaling advanced ML/AI systems that power core Plaid products and applications used by millions of consumers
Driving impact at scale by improving distributed training, serving, and ML operations to make Plaid’s AI capabilities faster, more reliable, and more widely available
Developing new AI applications that enable innovative product experiences across fintech
Tackling 0 to 1 problems where you explore new approaches, as well as scaling 1 to 10 systems for reliability and efficiency
Collaborating with some of the strongest MLEs at Plaid in a high-ownership, bottom-up driven team
Experimenting with cutting-edge ML and AI techniques while balancing practical productionization and measurable business impact

What we offer

medical
dental
vision
401(k)
equity
commission

Fulltime

Senior Machine Learning Systems Engineer

Our organization drives AI innovation across Jira products. We deliver seamless ...

Location

Salary:

Not provided

Atlassian

Expiration Date

Until further notice

Requirements

Extensive experience building Machine Learning and AI solutions (4+ years)
Proven experience developing, deploying, and maintaining end-to-end ML systems, including data engineering, model serving, and monitoring
Expert proficiency with GenAI frameworks and tools, including developing and fine-tuning large language models (LLMs) and building retrieval-augmented generation (RAG) systems
Expert proficiency in Python and ML frameworks like PyTorch, TensorFlow, or JAX
Experience implementing MLOps, CI/CD pipelines, and automation for continuous training, deployment, and monitoring of ML models

Job Responsibility

Collaborate with software engineers, data scientists, and product managers to solve complex problems
Lead projects from technical design through launch
Partner with teams to achieve impactful results
Deliver robust ML solutions to build AI features reaching millions
This includes curating ML datasets, fine-tuning open-source LLMs, or accessing proprietary LLMs
Mentor junior members of the team

What we offer

Health and wellbeing resources
Paid volunteer days

Sr Machine Learning Engineer

Let’s do this. Let’s change the world. We are looking for a highly motivated exp...

Location

India , Hyderabad

Salary:

Not provided

Amgen

Expiration Date

Until further notice

Requirements

Hands-on experience with AWS, Databricks, Apache Spark, PySpark, SparkSQL, Python, and SQL for large-scale data engineering
Strong proficiency in workflow orchestration, Spark performance tuning, and scalable batch and streaming data pipeline development
Experience with real-time data processing and integration using Apache Kafka, Debezium, or similar streaming technologies
Hands-on experience with MLOps tools and practices, including MLflow, model serving, feature stores, experiment tracking, deployment, and lifecycle management
Experience with GenAI engineering practices, including prompt engineering, LLM evaluation, AI observability, agentic workflows, and knowledge graphs
Ability to design and develop APIs or service interfaces for data, ML, and GenAI application integration
Experience with Agile/SAFe delivery models, DevOps practices, CI/CD concepts, and cross-functional team collaboration
Strong analytical, problem-solving, debugging, communication, and teamwork skills
Ability to quickly learn, adapt, and apply emerging technologies across data, ML, and AI engineering
Doctorate degree / Master's degree / Bachelor's degree and 8 to 13 years of experience years of experience in Computer Science, IT or related field

Job Responsibility

Design, deploy, monitor, and optimize production-grade ML and Generative AI applications for AI-enabled manufacturing solutions
Define technical architecture, engineering standards, and best practices across data engineering, ML, GenAI, analytics, and platform capabilities
Partner with business stakeholders, product owners, and cross-functional teams to translate manufacturing challenges into secure, scalable, production-ready AI and data solutions
Design, develop, and maintain complex ETL/ELT pipelines in Databricks using PySpark, Scala, and SQL for large-scale structured and unstructured data processing
Build efficient ingestion, transformation, migration, and deployment pipelines across databases, APIs, logs, event streams, images, PDFs, documents, and third-party platforms
Design and implement GenAI solutions including RAG, embeddings, vector databases, agentic workflows, tool-calling systems, LLM orchestration, serving optimization, knowledge graphs, and metadata-driven retrieval
Build GenAI applications using frameworks and platforms such as LangChain, LangGraph, LlamaIndex, DSPy, OpenAI APIs, Amazon Bedrock, or equivalent technologies
Develop evaluation and observability frameworks to monitor model quality, hallucination rates, drift, retrieval effectiveness, latency, token usage, cost, reliability, operational health, and business impact
Build and maintain MLOps and LLMOps capabilities, including experiment tracking, model registry, prompt management, versioning, CI/CD, automated testing, deployment automation, monitoring, governance, and release controls
Design scalable data quality, validation, security, privacy, access control, logging, governance, and interoperability capabilities across hybrid cloud environments

Fulltime

Senior Machine Learning Engineer, Search Assistant

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform ...

Location

United States , San Jose

Salary:

361300.00 - 510000.00 USD / Year

Roku

Expiration Date

Until further notice

Requirements

8+ years of industry experience (or PhD with 5+ years) applying ML at scale in search, recommendation, ads, personalization, or related domains
Strong expertise in ranking systems, recommendation systems, retrieval, personalization, and multi-objective optimization
Experience building large-scale ML systems leveraging deep learning, sequence models, LLMs, reinforcement learning, or bandit frameworks
Strong product intuition and experience optimizing user engagement, retention, and monetization simultaneously
Proficiency in Python, Java, or Scala
Experience with distributed systems and ML infrastructure such as Spark, Airflow, streaming systems, feature stores, and cloud platforms
Strong technical leadership, system design, communication, and problem-solving skills
MS or PhD in Computer Science, Statistics, or a related field

Job Responsibility

Lead the technical vision and roadmap for ranking, personalization, and recommendation systems powering Roku’s entertainment assistant
Develop and deploy state-of-the-art ML models using deep learning, transformers, LLMs, bandits, reinforcement learning, and causal inference techniques
Build multi-objective optimization systems balancing engagement, retention, relevance, and monetization goals
Drive innovation in conversational discovery, contextual recommendations, and personalized content experiences across the platform
Design, run, and analyze online A/B experiments tied to key product and business KPIs
Architect scalable ML systems, feature platforms, and data pipelines supporting rapid experimentation and long-term growth
Mentor engineers and provide technical leadership across cross-functional initiatives involving engineering, product, UX, and analytics teams

What we offer

Health insurance
Equity awards
Life insurance
Disability benefits
Parental leave
Wellness benefits
Paid time off
Global access to mental health and financial wellness support and resources
Healthcare (medical, dental, and vision)
Life, accident, disability, commuter, and retirement options (401(k)/pension)

Fulltime

Select Country

Machine Learning Data Engineer - Systems & Retrieval

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?