CrawlJobs Logo

Staff / Principal Machine Learning Engineer, Serving

inworld.ai Logo

Inworld AI

Location Icon

Location:
Switzerland

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

Not provided

Job Description:

Inworld is a product-oriented research lab of top AI researchers and engineers, developing best-in-class realtime multimodal models and the only realtime orchestration platform optimized for thousands of queries per second. We’ve raised more than $125M from Lightspeed, Section 32, Kleiner Perkins, Microsoft’s M12 venture fund, Founders Fund, Meta and Stanford, among others. Our technology has powered experiences from companies such as NVIDIA, Microsoft Xbox, Niantic, Logitech Streamlabs, Wishroll, Little Umbrella and Bible Chat. We’ve also been recognized by CB Insights as one of the 100 most promising AI companies globally and have been named one of LinkedIn's Top 10 Startups in the USA. A year ago, reliably working agentic systems and sub-second multimodal inference at scale barely existed. Nobody has a decade of experience here. So we're not screening for a resume template — we're looking for strong people from varied backgrounds who learn fast, thrive in ambiguity, and can show us what they've built, broken, and understood.

Requirements:

  • Inference Optimization. Deep understanding of modern serving frameworks and techniques like vLLM or TRT-LLM
  • Model Acceleration. Hands-on experience with quantization, distillation, caching strategies , continuous batching, paged attention, and speculative decoding
  • High-Performance Systems. Proficiency in C++, CUDA, Rust, or highly optimized Python. You know how to profile code and squeeze every ounce of performance out of NVIDIA GPUs
  • Distributed Systems & Scaling. Experience with Kubernetes, Ray, custom load balancing, multi-GPU/multi-node inference, and reliably handling thousands of concurrent connections
  • Public work. Non-trivial systems programming projects, open-source contributions to major inference engines, or deep-dive technical write-ups
  • Full-cycle ownership. You can take a model from the research team, containerize it, optimize its serving, and ensure it runs reliably in production
  • Background. PhD in CS, Physics, Math, or equivalent practical experience building backend or ML systems
  • Professional fluency in English (written and spoken) is required, as you will be collaborating daily with our US-based leadership and engineering teams

Additional Information:

Job Posted:
April 23, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff / Principal Machine Learning Engineer, Serving

New

Staff / Principal Machine Learning Engineer, Serving

A year ago, reliably working agentic systems and sub-second multimodal inference...
Location
Location
United States , Mountain View
Salary
Salary:
270000.00 - 500000.00 USD / Year
inworld.ai Logo
Inworld AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Inference Optimization. Deep understanding of modern serving frameworks and techniques like vLLM or TRT-LLM
  • Model Acceleration. Hands-on experience with quantization, distillation, caching strategies , continuous batching, paged attention, and speculative decoding
  • High-Performance Systems. Proficiency in C++, CUDA, Rust, or highly optimized Python. You know how to profile code and squeeze every ounce of performance out of NVIDIA GPUs
  • Distributed Systems & Scaling. Experience with Kubernetes, Ray, custom load balancing, multi-GPU/multi-node inference, and reliably handling thousands of concurrent connections
  • Public work. Non-trivial systems programming projects, open-source contributions to major inference engines, or deep-dive technical write-ups
  • Full-cycle ownership. You can take a model from the research team, containerize it, optimize its serving, and ensure it runs reliably in production
  • Background. PhD in CS, Physics, Math, or equivalent practical experience building backend or ML systems
Job Responsibility
Job Responsibility
  • We hand you unclear problems and expect you to make them clear
  • We value engineers who say 'I don't know yet' and then design the benchmark or prototype that finds out
  • We treat performance, latency, and reliability as first-class product features, not a box to check before launch
  • Impact comes before everything else, though we support sharing work and open-source contributions that move the field forward
  • Your work should be visible
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • relocation assistance
  • Fulltime
Read More
Arrow Right

Senior Staff Engineer, Applied AI

GEICO is seeking a Senior Staff Engineer, Applied AI to provide technical archit...
Location
Location
United States , Chevy Chase, MD; Palo Alto, CA
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8 or more years of professional software engineering or applied machine learning experience
  • 2 or more years working with Generative AI or LLM-based systems in production
  • Proven track record of architecting and delivering complex AI/ML capabilities that span multiple teams and have measurable business impact
  • Deep hands-on expertise with Python and modern AI frameworks including LangChain, LangGraph, LangSmith, LlamaIndex, Hugging Face, OpenAI/Anthropic APIs, and emerging agentic frameworks
  • Demonstrated experience building and deploying production RAG (Retrieval-Augmented Generation) systems including document ingestion, chunking strategies, vector search, and context retrieval
  • Demonstrated experience designing and operating production AI systems including multi-agent architectures, intelligent automation, and workflow orchestration
  • Strong understanding of agent architectures, workflow orchestration, retrieval-augmented generation (RAG), vector databases, knowledge graphs, and semantic reasoning
  • Familiarity with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) for building interoperable AI systems
  • Experience ensuring platform scalability, cross-domain coherence, and alignment with AI platform capabilities and strategy
  • Strong expertise in distributed systems, microservices architecture, service design, performance optimization, and reliability engineering
Job Responsibility
Job Responsibility
  • Specify architectures and system decompositions for AI/ML capabilities that involve significant integrations and cross-team collaboration across multiple product areas
  • Provide technical architecture and leadership for medium to large, complex, cross-functional AI initiatives with visibility at the tech VP level
  • Architect and lead implementation of advanced Generative AI solutions including agent-based systems, intelligent automation, document intelligence, and decision support systems that span multiple business domains
  • Design and implement sophisticated agentic workflows that orchestrate multiple AI agents, tools, APIs, reasoning steps, and business logic to automate complex enterprise processes at scale
  • Question status quo with an eye for simpler designs and more secure approaches, influencing tech VPs to set direction for multiple teams
  • Build systems and platforms that meet the highest standards for scalability, resilience, performance, availability, security, and compliance
  • Identify and scope opportunities for automating business processes using AI across multiple product areas and business domains
  • Advance the state-of-the-art in applied AI by integrating knowledge graphs, vector reasoning, retrieval architectures, and multi-agent systems to solve complex business problems
  • Drive innovation by exploring new models, frameworks, reasoning techniques, and AI architectures and applying them strategically to high-impact business challenges
  • Run rigorous experimentation programs including hypothesis definition, A/B testing, measurement frameworks, and iterative improvement across production AI systems
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Staff / Principal Machine Learning Engineer, Serving

Staff / Principal Machine Learning Engineer, Serving - UK. About Inworld: Inworl...
Location
Location
United Kingdom
Salary
Salary:
140000.00 - 200000.00 GBP / Year
inworld.ai Logo
Inworld AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Inference Optimization
  • Model Acceleration
  • High-Performance Systems
  • Distributed Systems & Scaling
  • Public work
  • Full-cycle ownership
  • Background
Job Responsibility
Job Responsibility
  • Take a model from the research team, containerize it, optimize its serving, and ensure it runs reliably in production
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Distinguished Engineer

As a Distinguished Engineer at Capital One, you will be a part of a community of...
Location
Location
United States , McLean; New York
Salary
Salary:
269100.00 - 335100.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree
  • At least 9 years of experience in software engineering (including solution architecture and enterprise design patterns)
  • At least 7 years of experience in Data Engineering or Machine Learning Engineering, specifically building and maintaining large-scale production data pipelines (PySpark, SQL, Airflow)
  • At least 5 years of experience in Cloud Native Architecture, including container orchestration (Kubernetes or KubeFlow) and serverless design
  • At least 3 years of experience in a Technical Leadership role (Principal, Staff, or Distinguished Engineer), leading cross-functional initiatives without direct reporting lines
Job Responsibility
Job Responsibility
  • Articulate and evangelize a bold technical vision for your domain
  • Decompose complex problems into practical and operational solutions
  • Ensure the quality of technical design and implementation
  • Serve as an authoritative expert on non-functional system characteristics, such as performance, scalability and operability
  • Continue learning and injecting advanced technical knowledge into our community
  • Handle several projects simultaneously, balancing your time to maximize impact
  • Act as a role model and mentor within the tech community, helping to coach and strengthen the technical expertise and know-how of our engineering and product community
  • Partner with the Capital One Travel Data & Personalization team to architect the next generation of our Travel Intelligence Platform
  • Serve as the primary architect for the foundational data schemas and orchestration layers
  • Establish a holistic Agentic AI framework
What we offer
What we offer
  • Comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer, Infrastructure

At Docker, we make app development easier so developers can focus on what matter...
Location
Location
United States , Seattle
Salary
Salary:
251000.00 - 352000.00 USD / Year
docker.com Logo
Docker
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of software engineering experience with demonstrated expertise across multiple platform domains (identity, billing, data, infrastructure)
  • Proven track record architecting and delivering large-scale distributed systems serving millions of users and thousands of enterprise customers
  • Deep expertise in at least two of: identity/access management systems, billing/monetization platforms, data platforms, or cloud infrastructure
  • Broad working knowledge across all platform domains with ability to make sound architectural decisions spanning multiple areas
  • Expert-level understanding of API design, service architecture, and system integration patterns at scale
  • Experience with cloud platforms (AWS, GCP, or Azure) and modern infrastructure patterns (Kubernetes, service mesh, infrastructure-as-code)
  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience
  • Track record of establishing strategic technical plans that directly enabled business outcomes (revenue growth, cost reduction, market expansion)
  • Experience translating business strategy into technical architecture and roadmaps
  • Demonstrated ability to identify and prioritize investments that provide maximum platform leverage
Job Responsibility
Job Responsibility
  • Define and own the multi-year technical vision for Docker's foundational platform, encompassing accounts, billing, data, enterprise governance, and infrastructure
  • Establish strategic plans and objectives for major platform initiatives, making architectural decisions that ensure effective achievement of Docker's business objectives
  • Contribute to and drive the strategic vision in collaboration with the VP of Engineering, translating organizational strategy into technical roadmaps that span multiple teams and years
  • Identify and prioritize platform investments that provide maximum leverage—capabilities built once that enable rapid iteration across all Docker products
  • Develop architectural principles and standards that guide technical decisions across the Bridge organization and influence product engineering teams
  • Anticipate future business needs and ensure platform architecture provides the flexibility to support Docker's evolving commercial models
  • Lead large cross-company programs that require coordination across Desktop, Hub, AI, Security, Cloud, and Platform teams
  • Architect the unified platform interfaces ("Control Planes") that enable product teams to answer canonical questions like "Can this user access this feature?" or "How much has this organization consumed?" without understanding underlying complexity
  • Drive convergence of fragmented systems across Docker—replacing product-specific implementations with shared platform capabilities for authentication, authorization, billing, and observability
  • Establish technical contracts between platform and product teams that enable independent velocity while ensuring consistency and reliability
What we offer
What we offer
  • Freedom & flexibility
  • fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup
  • we want you comfortable while you work
  • 16 weeks of paid Parental leave
  • Technology stipend equivalent to $100 net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity
  • Fulltime
Read More
Arrow Right

Principal Engineer, Model Dev Platform

As the Principal Engineer for the Model Development Platform at Wayve, you will ...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical Leadership at Scale – 10+ years of experience designing and building large-scale distributed systems, ML/AI infrastructure, full stack web application, or developer platforms, including at least 3 years as a staff or principal-level engineer
  • Architectural Depth & Breadth – Proven ability to design systems spanning web platforms, ML pipelines, and large-scale compute orchestration (e.g., Spark, Ray, Kubernetes, Airflow, MLflow)
  • Reliability & Performance Mindset – Experience driving platform reliability improvements, defining SLAs/SLOs, and building self-healing and observable systems that operate at “four nines” availability or better
  • Hands-On Systems Design – Deep understanding of distributed computing, workflow orchestration, data modeling, and API design, with the ability to write and review production-quality code
  • Collaborative Influence – Excellent communication and cross-functional collaboration skills
  • ability to guide engineers, managers, and researchers toward unified technical direction
  • Mentorship & Culture – Demonstrated success in mentoring engineers across levels and cultivating a culture of engineering excellence
  • Education – Bachelor’s degree in Computer Science, Software Engineering, or related field (advanced degree preferred, or equivalent experience)
Job Responsibility
Job Responsibility
  • Design and evolve the overarching architecture of the model development platform, ensuring system-wide reliability, observability, and scalability
  • Work across disciplines—from front-end web UIs to large-scale distributed training, from Spark-based data pipelines to experiment scheduling algorithms using linear optimization—to unify the platform’s architecture and ensure smooth interoperability between systems
  • Dive deep into the thorniest technical challenges faced by individual subteams, bringing your expertise in distributed systems, large-scale compute, and system design to bear
  • Develop and refine systems that optimize how models are tested—whether in simulation or on-road—balancing constraints like hardware availability, safety requirements, and research priorities
  • Architect data processing pipelines capable of ingesting, transforming, and enriching petabytes of sensor data from the global fleet
  • Serve as a mentor and coach for engineers across the organization—developing technical talent, improving design practices, and fostering a culture of learning and technical excellence
  • Partner with Product Management, Research, and Operations to align technical architecture with user needs and product vision
Read More
Arrow Right

Staff Software Engineer, Social Graph

We are seeking a Staff Software Engineer with deep expertise in graph theory, gr...
Location
Location
United States , San Francisco
Salary
Salary:
181000.00 - 271000.00 USD / Year
gofundme.com Logo
GoFundMe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of industry experience, including significant experience at senior / staff / principal levels
  • Demonstrated expertise launching and scaling graph-based applications in production
  • Deep understanding of graph theory, graph algorithms (e.g., traversal, clustering, centrality), and modern graph data structures
  • Expert-level experience with graph databases (Neo4j, TigerGraph, JanusGraph, DGL-backed systems, etc.) and efficient graph querying
  • Proven ability to design high-scale pipelines for ingesting and transforming social or behavioral data
  • Experience with distributed streaming frameworks (Kafka, Flink, Spark Streaming)
  • Hands-on experience incorporating graph-derived features into recommendation, ranking, trust, or safety models
  • Familiarity with Graph Neural Networks (GNNs), graph embeddings, or graph-based ranking systems
  • Strong product intuition and ability to articulate how graph systems drive business outcomes
  • Ability to influence architectural direction and mentor teams
Job Responsibility
Job Responsibility
  • Serve as the technical lead for initiatives related to social graph modeling, storage, retrieval, and computation
  • Architect and scale graph databases and graph query systems capable of supporting billions of nodes and edges with low-latency performance
  • Design and ship pipelines for ingesting, cleaning, and transforming social and behavioral data into graph structures
  • Partner with ML teams to productionize graph-based features, including embeddings, similarity signals, trust metrics, and GNN-powered ranking features
  • Lead the development of graph-informed recommendation, trust, and safety systems, ensuring models reflect real-world connectivity patterns
  • Define and implement feature engineering strategies leveraging graph topology (e.g., mutual connections, influence scoring, community structure)
  • Contribute to architecture decisions related to streaming systems (Kafka, Flink, Spark Streaming) and real-time graph updates
  • Mentor engineers and guide best practices on graph design, distributed systems, feature computation, and ML integration
  • Collaborate with Product to translate graph capabilities into business-impacting features that drive trust, engagement, and discovery
  • Ensure reliability, scalability, observability, and data quality in all graph-related systems
What we offer
What we offer
  • Competitive pay and comprehensive healthcare benefits
  • Financial assistance for things like hybrid work, family planning
  • Generous parental leave
  • Flexible time-off policies
  • Mental health and wellness resources
  • Learning, development, and recognition programs
  • Volunteering program
  • Equity
  • Fulltime
Read More
Arrow Right

Principal Geophysicist

This is a senior-level position for an expert Geophysicist with advanced experie...
Location
Location
Saudi Arabia , Dammam
Salary
Salary:
Not provided
fugro.com Logo
Fugro
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree (MSc or PhD) in Geophysics or related field
  • Minimum 15 years of relevant experience in mineral exploration geophysics, with at least 5 years in a senior or principal consulting role
  • Expert proficiency with industry-standard geophysical software (e.g., Oasis montaj, Geosoft, Leapfrog Geo, GOCAD)
  • Expertise in deep geophysical investigation techniques (e.g., borehole EM, magnetotellurics), data integration, and modeling
  • Strong background in data processing, inversion, and 3D modeling
  • Strong leadership, analytical, and communication skills
  • Proven ability to deliver innovative solutions and support strategic business objectives
Job Responsibility
Job Responsibility
  • Drive business growth by identifying opportunities, shaping strategic initiatives, and fostering an entrepreneurial approach to execution
  • Lead complex geophysical investigations, including advanced imaging, mineral systems modeling, and integration of geophysical data for mining and infrastructure projects
  • Design, implement, and manage multi-platform geophysical programs (airborne, ground, borehole, and drone), ensuring the integrity and accuracy of acquired data
  • Provide expert technical input on exploration strategy, target generation, and regional development programs. Contribute technical recommendations for potential land acquisitions and mine development projects
  • Conduct advanced analysis and 2D/3D modeling of geophysical data (magnetic, gravity, electromagnetic (EM), induced polarization (IP), seismic, radiometric, etc.) to produce accurate subsurface interpretations
  • Integrate geophysical data seamlessly with geological, structural, geochemical, and remote sensing datasets. Oversee all QA/QC procedures for geophysical data processing and inversion
  • Prepare and defend technical proposals, client presentations, and bid documents, serving as a named expert in client engagements
  • Act as the internal technical authority and provide expert guidance to geologists and engineers on geophysical methods. Mentor and train junior staff, fostering a culture of technical excellence
  • Collaborate with multidisciplinary teams to deliver integrated solutions and support cross-border project initiatives
  • Engage with clients through workshops, technical presentations, and bid defenses, building trust and credibility
  • Fulltime
Read More
Arrow Right