Machine Learning Infra Engineer Job at Reducto (San Francisco)

Senior Machine Learning System Engineer

As a Senior ML System Engineer on the AI & ML Platform team, you will play a piv...

Location

United States , Seattle; San Francisco; New York; Austin

Salary:

165500.00 - 265800.00 USD / Year

Atlassian

Expiration Date

Until further notice

Requirements

Experience in building machine learning systems or ML infra / MLOps platform
Fluency in at least one modern object-oriented programming language (preferably Java/Kotlin and Python)
Experience with RESTful microservices
Experience using cloud tools such as Amazon Web Services (S3, Kinesis, Cloud Formation, EKS, AWS Security and Networking)
Experience with Continuous Delivery and Continuous Integration

Job Responsibility

Collaborate with your teammates to solve complex problems, from technical design to launch
Deliver cutting-edge solutions that are used by other Atlassian teams and products to build AI features that reach millions of customers
Deliver code reviews, documentation & bug fixes within a strong engineering culture
Partner across engineering teams to take on company-wide initiatives spanning multiple projects
Mentor junior members of the team

What we offer

health and wellbeing resources
paid volunteer days

Fulltime

Engineering Manager - Machine Learning Infrastructure

We build simple yet innovative consumer products and developer APIs that shape h...

Location

United States , San Francisco

Salary:

241200.00 - 400000.00 USD / Year

Plaid

Expiration Date

Until further notice

Requirements

8–10 years of experience in ML infrastructure, including direct hands-on expertise as an engineer, IC/TL
2+ years of experience managing infrastructure or ML platform engineers
Proven experience delivering and operating ML or AI infrastructure at scale
Solid technical depth across ML/AI infrastructure domains (e.g., feature stores, pipelines, deployment, inference, observability)
Demonstrated ability to drive execution on complex technical projects with cross-team stakeholders
Strong communication and stakeholder management skills

Job Responsibility

Lead and support the ML Infra team, driving project execution and ensuring delivery on key commitments
Build and launch Plaid’s next-generation feature store to improve reliability and velocity of model development
Define and drive adoption of an ML Ops “golden path” for secure, scalable model training, deployment, and monitoring
Ensure operational excellence of ML pipelines, deployment tooling, and inference systems
Partner with ML product teams to understand requirements and deliver solutions that accelerate model development and iteration
Recruit, mentor, and develop engineers, fostering a collaborative and high-performing team culture

What we offer

medical
dental
vision
401(k)
equity
commission

Fulltime

Senior Machine Learning Engineer

Start.io, a leading mobile marketing and audience platform, empowers the app eco...

Location

Salary:

Not provided

Start.io

Expiration Date

Until further notice

Requirements

B.Sc. or M.Sc. in Computer Science, Software Engineering, or a related technical discipline
5+ years of experience building high-performance backend or ML inference systems
Deep expertise in Python and experience with low-latency APIs and real-time serving frameworks (e.g., FastAPI, Triton Inference Server, TorchServe, BentoML)
Experience with scalable service architecture, message queues (Kafka, Pub/Sub), and async processing
Strong understanding of model deployment practices, online/offline feature parity, and real-time monitoring
Experience in cloud environments (AWS, GCP, or OCI) and container orchestration (Kubernetes)
Experience working with in-memory and NoSQL databases (e.g. Aerospike, Redis, Bigtable) to support ultra-fast data access in production-grade ML services
Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry) and best practices for alerting and diagnostics
A strong sense of ownership and the ability to drive solutions end-to-end
Passion for performance, clean architecture, and impactful systems

Job Responsibility

Own and lead the design and development of low-latency Algo inference services handling billions of requests per day
Build and scale robust real-time decision-making engines, integrating ML models with business logic under strict SLAs
Collaborate closely with DS to deploy models seamlessly and reliably in production
Design systems for model versioning, shadowing, and A/B testing at runtime
Ensure high availability, scalability, and observability of production systems
Continuously optimize latency, throughput, and cost-efficiency using modern tooling and techniques
Work independently while interfacing with cross-functional stakeholders from Algo, Infra, Product, Engineering, BA & Business

What we offer

Lead the mission-critical inference engine that drives our core product
Join a high-caliber Algo group solving real-time, large-scale, high-stakes problems
Work on systems where every millisecond matters, and every decision drives real value
Enjoy a fast-paced, collaborative, and empowered culture with full ownership of your domain

Senior Machine Learning Engineer

We’re seeking a Senior Machine Learning Engineer (P50) to join our new GenAI Mod...

Location

Singapore

Salary:

Not provided

Atlassian

Expiration Date

Until further notice

Requirements

Extensive experience (generally 5+ years) in ML systems engineering, backend engineering, or infrastructure roles
Strong background in one or more of: LLMs, NLP, search/retrieval, embeddings, or applied ML
Hands-on experience with at least one GenAI area: RAG pipelines, fine-tuning, hybrid retrieval, or orchestration frameworks
Proficiency with modern ML frameworks (PyTorch, TensorFlow, Hugging Face, LangChain, LlamaIndex)
Familiarity with vector databases (Weaviate, Pinecone, FAISS, etc.) and large-scale serving infra
Strong coding skills (Python, backend engineering) and ability to move fast from idea to prototype
Comfort working in fast-paced, experimental environments with evolving direction
Bachelor’s or Master’s in Computer Science, Machine Learning, or related field—or equivalent experience

Job Responsibility

Build and apply advanced GenAI models
Develop and fine-tune LLMs and embeddings for Atlassian’s unique knowledge and enterprise data
Implement retrieval-augmented generation (RAG), hybrid retrieval, and knowledge-grounded modeling approaches
Work hands-on with modern frameworks, contributing directly to high-value prototypes and experiments
Prototype and experiment quickly
Build proof-of-concept systems for GenAI-powered assistants, agentic workflows, and innovative user experiences
Run experiments, collect feedback, and iterate fast to validate impact
Design and implement evaluation methods for quality, groundedness, and user value
Collaborate and contribute
Work closely with peers across ML, engineering, and product teams to bring new ideas to life

What we offer

Health and wellbeing resources
Paid volunteer days

Senior Machine Learning Engineering Manager, Gen AI

We're seeking a Senior Machine Learning Manager (M60) to lead a cross-functional...

Location

United States

Salary:

193500.00 - 303150.00 USD / Year

Atlassian

Expiration Date

Until further notice

Requirements

8+ years in ML, search, or backend engineering roles, with 3+ years leading teams
Strong track record of shipping ML-powered or LLM-integrated user-facing products
Experience with RAG systems (vector search, hybrid retrieval, LLM orchestration)
Deep experience in either modeling (e.g., LLMs, search, NLP) or engineering (e.g., backend infra, full-stack), with the ability to lead end-to-end
Deep understanding of LLM ecosystems (OpenAI, Claude, Mistral, OSS), orchestration frameworks (LangChain, LlamaIndex), and vector databases (Weaviate, Pinecone, FAISS, etc.)
Strong product intuition and ability to translate complex tech into valuable user features
Familiarity with GenAI evaluation methods: hallucination detection, groundedness scoring, and human-in-the-loop feedback loops
Master’s or PhD in Computer Science, Machine Learning, or related field preferred—or equivalent practical experience

Job Responsibility

Lead the vision, design, and execution of LLM-powered AI products, leveraging advance AI modeling (e.g. SLM post-training/fine-tuning), RAG architectures and hybrid ranking system
Define system architecture across retrievers, rankers, orchestration layers, prompt templates, and feedback mechanisms
Work closely with product and design teams to ensure delightful, fast, and grounded user experiences
Build and manage a cross-disciplinary team including ML engineers, backend/frontend engineers, and applied scientists
Foster a culture of E2E ownership — empowering the team to move from prototype to production quickly and iteratively
Mentor individuals to grow in both technical depth and product acumen
Shape the technical roadmap and long-term strategy for GenAI search across Atlassian’s product suite
Partner with platform and infra teams to scale inference, evaluate performance, and integrate usage signals for continuous improvement
Champion data quality, grounding, and responsible AI practices in all deployed features

What we offer

health and wellbeing resources
paid volunteer days

Fulltime

Senior Software Engineer – ML Model Compliance & Automation

We are seeking a highly skilled and motivated Senior Software Engineer to lead t...

Location

India , Jaipur

Salary:

Not provided

InfoObjects

Expiration Date

Until further notice

Requirements

Experience Required: 3 - 7 yrs
GoLang (preferred)
Python (preferred)
Bash
MLOps Tools: KitOps, MLModelCI, MLflow, ONNX, TensorFlow, PyTorch, Docker
SBOM & Security: Syft, Grype, Trivy, CycloneDX, SPDX
CI/CD: GitHub Actions, GitLab CI, Jenkins, ArgoCD
Infra: Kubernetes, Docker, Helm, Terraform
Cloud: AWS, GCP, Azure (EKS/GKE/ECS preferred)
Version Control: Git, GitOps

Job Responsibility

Model Packaging & Artifact Management: Design and implement workflows for packaging ML models using KitOps, ONNX, MLflow, or TensorFlow SavedModel
Manage model artifact versioning, registries, and reproducibility
Ensure artifact integrity, consistency, and traceability across CI/CD pipelines
Model Profiling & Optimization: Automate model profiling (latency, size, ops) using MLModelCI, TorchServe, or ONNX Runtime
Apply quantization, pruning, and format conversions (e.g., FP32→INT8) for optimization
Embed profiling and optimization checks into CI/CD pipelines to assess deployment readiness
Compliance & SBOM Generation: Develop pipelines to generate and validate SBOMs for ML models
Implement compliance checks for licensing, vulnerabilities, and security using CycloneDX, SPDX, Syft, or Trivy
Validate schema, dependencies, and runtime environments for production readiness
Cloud Integration & Deployment: Automate model registration, endpoint creation, and monitoring setup in AWS/GCP/Azure

Fulltime

Research Engineering Manager, Post-Training

Meta is seeking a Research Engineering Manager to lead the Post-Training team wi...

Location

United States , Menlo Park

Salary:

219000.00 - 301000.00 USD / Year

Research Engineering Manager, Evaluations, Meta Superintelligence Labs

Meta is seeking a Research Engineering Manager to lead the Evaluations team with...

Location

United States , Menlo Park

Salary:

219000.00 - 301000.00 USD / Year

Machine Learning Infra Engineer

Reducto

Location:
United States , San Francisco

Category:
IT - Software Development

Contract Type:
Employment contract

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
May 17, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Machine Learning Infra Engineer

Senior Machine Learning System Engineer

Engineering Manager - Machine Learning Infrastructure

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Senior Machine Learning Engineering Manager, Gen AI

Senior Software Engineer – ML Model Compliance & Automation

Research Engineering Manager, Post-Training

Research Engineering Manager, Evaluations, Meta Superintelligence Labs

Our AI answers in your language

Machine Learning Infra Engineer

Reducto

Location:United States , San Francisco

Category:IT - Software Development

Contract Type:Employment contract

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:May 17, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Machine Learning Infra Engineer

Senior Machine Learning System Engineer

Engineering Manager - Machine Learning Infrastructure

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Senior Machine Learning Engineering Manager, Gen AI

Senior Software Engineer – ML Model Compliance & Automation

Research Engineering Manager, Post-Training

Research Engineering Manager, Evaluations, Meta Superintelligence Labs

Location:
United States , San Francisco

Category:
IT - Software Development

Contract Type:
Employment contract

Job Posted:
May 17, 2026