CrawlJobs Logo

Staff Product Manager, Managed Inference

United States, San Francisco 204000.00 - 247000.00 USD / Year · Job Posted January 19, 2026
Apply Position
Job Link Share

Job Description

As a core member of the Crusoe Managed AI Services team, you will own the complete product lifecycle for our Managed Inference offerings, from initial concept and strategic roadmap to successful execution and market adoption. You will be the champion for our inference service offerings, translating market needs and technical complexities into clear product specifications, compelling narratives, and product decisions that drive business growth for Crusoe Cloud.

Job Responsibility

  • Own the end-to-end product lifecycle for Crusoe’s Managed Inference services, including roadmap definition, execution, and iteration
  • Translate customer needs, market signals, and technical constraints into clear product requirements and prioritization
  • Partner closely with Engineering, Infrastructure, and Platform teams to deliver scalable, reliable inference services
  • Drive product decisions across performance, reliability, cost efficiency, and developer experience
  • Define and track success metrics for inference services in production environments
  • Collaborate with go-to-market teams to support product launches, positioning, and customer adoption
  • Communicate product strategy and tradeoffs clearly to cross-functional partners and leadership

Requirements

  • 6+ years of experience in technical product management or engineering roles with product responsibilities
  • Experience building and launching cloud infrastructure, platform, or AI/ML services used in production
  • Strong understanding of cloud infrastructure (e.g., AWS, GCP, Azure) and modern compute architectures
  • Familiarity with the machine learning lifecycle, particularly model deployment, inference, and monitoring
  • Strong communication and collaboration skills, with experience working across engineering, product, and business teams
  • Demonstrated ability to operate independently with strong product judgment and a bias for action
  • Bachelor’s degree in Computer Science or a related technical field (or equivalent experience)

Nice to have

  • Experience building developer-facing platforms or services
  • Exposure to inference-as-a-service, model serving frameworks, or ML infrastructure tooling
  • Participation in developer communities or open-source projects
  • Strong interest in trends across AI infrastructure and inference at scale

What we offer

  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal
  • Company paid commuter benefit
  • $300/month

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Product Manager, Managed Inference

8 matching positions

Staff Product Manager, Managed Inference

As a core member of the Crusoe Managed AI Services team, you will own the comple...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
204000.00 - 247000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience in technical product management or engineering roles with product responsibilities
  • Experience building and launching cloud infrastructure, platform, or AI/ML services used in production
  • Strong understanding of cloud infrastructure (e.g., AWS, GCP, Azure) and modern compute architectures
  • Familiarity with the machine learning lifecycle, particularly model deployment, inference, and monitoring
  • Strong communication and collaboration skills, with experience working across engineering, product, and business teams
  • Demonstrated ability to operate independently with strong product judgment and a bias for action
  • Bachelor’s degree in Computer Science or a related technical field (or equivalent experience)
Job Responsibility
Job Responsibility
  • Own the end-to-end product lifecycle for Crusoe’s Managed Inference services, including roadmap definition, execution, and iteration
  • Translate customer needs, market signals, and technical constraints into clear product requirements and prioritization
  • Partner closely with Engineering, Infrastructure, and Platform teams to deliver scalable, reliable inference services
  • Drive product decisions across performance, reliability, cost efficiency, and developer experience
  • Define and track success metrics for inference services in production environments
  • Collaborate with go-to-market teams to support product launches, positioning, and customer adoption
  • Communicate product strategy and tradeoffs clearly to cross-functional partners and leadership
What we offer
What we offer
  • Restricted Stock Units
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Senior Manager, Staff Software Engineering

We are looking to recruit Senior Engineering Manager to lead a newly formed Appl...
Location
Location
United States , Chevy Chase
Salary
Salary:
150000.00 - 300000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years hands-on software or solutions engineering
  • recent experience architecting/building production systems
  • 5+ years leading Agile teams delivering customer-impacting products
  • proven track record shipping AI/ML-enabled features to production with measurable impact
  • strong cloud experience (AWS or Azure)
  • CI/CD, SDLC, and modern engineering practices
  • practical MLOps/LLMOps experience (model lifecycle, telemetry, A/B testing, online and batch inference)
  • excellent stakeholder management and communication
  • Bachelor’s in CS/IS or related field (or equivalent experience)
Job Responsibility
Job Responsibility
  • Partner with product, data science, and AI Platform teams to define and deliver an Applied AI roadmap for claims tied to measurable business outcomes
  • Lead end-to-end delivery of AI features in the claims domain (e.g., claims triage/routing, adjuster assistance, subrogation/recovery opportunities, workload optimization)
  • Establish robust MLOps/LLMOps (model/prompt/version management, evaluation, feature/vector stores, CI/CD for ML, online/batch inference, cost/performance tuning)
  • Rapidly prototype using AI-assisted development tools (e.g., Cursor, Claude Code) to validate concepts and accelerate delivery
  • Ensure reliability, security, and Responsible AI (privacy-by-design, guardrails, human-in-the-loop, auditability)
  • Drive engineering excellence (clean architecture, scalable APIs, event-driven integration, observability, automated testing, SLOs/runbooks/on-call)
  • Recruit, coach, and grow a high-performing team
  • champion Agile practices and operational excellence
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Inference

We're looking for an ML infrastructure engineer to bridge the gap between resear...
Location
Location
United States
Salary
Salary:
240000.00 - 290000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience running ML model inference at scale in production environments
  • Strong experience with PyTorch and multi-GPU inference for large models
  • Experience with Kubernetes for ML workloads—deploying, scaling, and debugging GPU-based services
  • Comfortable working across multiple cloud providers and managing GPU driver compatibility
  • Experience with monitoring and observability for ML systems (errors, throughput, GPU utilization)
  • Self-starter who can work embedded with research teams and move fast
  • Strong systems thinking and pragmatic approach to production reliability
  • Humility and open mindedness
Job Responsibility
Job Responsibility
  • Productionize model checkpoints end-to-end: from research completion to internal testing to production deployment to post-release support
  • Build and optimize inference systems for large-scale generative models running on multi-GPU environments
  • Design and implement model serving infrastructure specialized for diffusion models and real-time diffusion workflows
  • Add monitoring and observability for new model releases—track errors, throughput, GPU utilization, and latency
  • Embed with research teams to gather training data, run preprocessing scripts, and support the model development process
  • Explore and integrate with GPU inference providers (Modal, E2E, Baseten, etc.)
  • Fulltime
Read More
Arrow Right

Senior Staff Engineering Manager - Applied AI

GEICO is in the midst of an exciting transformation as a product and tech powere...
Location
Location
United States , Chevy Chase; Palo Alto; Dallas
Salary
Salary:
140000.00 - 300000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years hands-on software or solutions engineering
  • recent experience architecting/building production systems
  • 5+ years leading Agile teams delivering customer-impacting products
  • proven track record shipping AI/ML-enabled features to production with measurable impact
  • strong cloud experience (AWS or Azure)
  • CI/CD, SDLC, and modern engineering practices
  • practical MLOps/LLMOps experience (model lifecycle, telemetry, A/B testing, online and batch inference)
  • excellent stakeholder management and communication
  • Bachelor’s in CS/IS or related field (or equivalent experience)
Job Responsibility
Job Responsibility
  • Partner with product, data science, and AI Platform teams to define and deliver an Applied AI roadmap for claims tied to measurable business outcomes
  • Lead end-to-end delivery of AI features in the claims domain (e.g., claims triage/routing, adjuster assistance, subrogation/recovery opportunities, workload optimization)
  • Establish robust MLOps/LLMOps (model/prompt/version management, evaluation, feature/vector stores, CI/CD for ML, online/batch inference, cost/performance tuning)
  • Rapidly prototype using AI-assisted development tools (e.g., Cursor, Claude Code) to validate concepts and accelerate delivery
  • Ensure reliability, security, and Responsible AI (privacy-by-design, guardrails, human-in-the-loop, auditability)
  • Drive engineering excellence (clean architecture, scalable APIs, event-driven integration, observability, automated testing, SLOs/runbooks/on-call)
  • Recruit, coach, and grow a high-performing team
  • champion Agile practices and operational excellence
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Managed AI - AI Platform

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'...
Location
Location
United States , San Francisco, CA; Sunnyvale, CA
Salary
Salary:
208725.00 - 253000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in Computer Science/Engineering
  • 8-10+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function
  • Experience with distributed systems, cloud services (compute, storage, networking, database), and delivering early-stage projects quickly
  • Experience with Generative AI (LLMs, Multimodal) and familiar with AI infrastructure (training, inference, ETL pipelines)
  • Proficient with container runtimes (e.g., Kubernetes), microservices, REST APIs, gRPC, and the full software development lifecycle including CI/CD
Job Responsibility
Job Responsibility
  • Lead the design and implementation of core AI services, including: Resilient fault-tolerant queues for efficient task distribution
  • Model catalogs for managing and versioning AI models
  • Scheduling mechanisms optimized for cost and performance
  • Architect and scale infrastructure to handle millions of API requests per second
  • Implement robust monitoring and alerting to ensure system health and 24/7 availability
  • Collaborate closely with product management, business strategy, and other engineering teams to define the AI platform roadmap
  • Influence the long-term vision and architectural decisions of the platform
  • Contribute to open-source AI frameworks and actively participate in the AI community
  • Prototype and rapidly iterate on emerging technologies and new features
What we offer
What we offer
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Inference Infrastructure

Our mission is to scale intelligence to serve humanity. We’re training and deplo...
Location
Location
San Francisco, Toronto, London, New York, Montreal
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of engineering experience running production infrastructure at a large scale
  • Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clusters
  • Experience with Kubernetes dev and production coding and support
  • Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving
  • Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments
  • Experience in compute/storage/network resource and cost management
  • Excellent collaboration and troubleshooting skills to build mission-critical systems, and ensure smooth operations and efficient teamwork
  • The grit and adaptability to solve complex technical challenges that evolve day to day
  • Familiarity with computational characteristics of accelerators (GPUs, TPUs, and/or custom accelerators), especially how they influence latency and throughput of inference
  • Strong understanding or working experience with distributed systems
Job Responsibility
Job Responsibility
  • Developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints
  • Working closely with many teams to deploy optimized NLP models to production in low latency, high throughput, and high availability environments
  • Interfacing with customers and creating customized deployments to meet their specific needs
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Staff Site Reliability Engineer, Managed AI

At Crusoe, our Site Reliability Engineering team ensures the reliability and sca...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
204000.00 - 247000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering background — experience building production-grade systems beyond scripting or Bash
  • Demonstrated experience in distributed systems design and implementation
  • Hands-on work with large language models (LLMs) or AI/ML infrastructure
  • SRE mindset and experience (whether or not under the SRE title) including: Defining and measuring SLIs/SLOs
  • Building monitoring and observability systems
  • Driving performance and reliability improvements
  • Designing fault-tolerant systems and automated testing strategies
  • Proficiency in at least one modern programming language (Python, Go, Java, C++)
  • Familiarity with Kubernetes or container orchestration platforms
  • Strong collaboration and communication skills
Job Responsibility
Job Responsibility
  • Design and operate reliable managed AI services with a focus on serving and scaling LLM workloads
  • Build automation and reliability tooling to support distributed AI pipelines and inference services
  • Define, measure, and improve SLIs/SLOs across AI workloads to ensure performance and reliability targets are met
  • Collaborate with AI, platform, and infrastructure teams to optimize large-scale training and inference clusters
  • Automate observability by building telemetry and performance tuning strategies for latency-sensitive AI services
  • Investigate and resolve reliability issues in distributed AI systems using telemetry, logs, and profiling
  • Contribute to the architecture of next-generation distributed systems purpose-built for AI-first environments
What we offer
What we offer
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right
New

Senior Project Engineer

CEC is a privately held engineering firm, serving both public entities and priva...
Location
Location
United States , Norman; Oklahoma City; Tulsa
Salary
Salary:
Not provided
connectcec.com Logo
CEC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree (B.A. or B.S.) in Structural Engineering, Architectural Engineering (with Structural emphasis), or Civil Engineering (with Structural emphasis) from ABET accredited university
  • Eight or more years of progressive experience in structural engineering
  • Professional Engineer (P.E.) or Structural Engineer (S.E.) license (preferred) required
  • If not licensed in Oklahoma, Texas, and/or Arkansas, must be able to obtain licenses within six months of start date
  • Previous experience with designing and detailing structural systems for building-type structures with demonstrable familiarity the following Structural systems: Reinforced Concrete
  • Structural Steel
  • Concrete Masonry Units (CMU)
  • Demonstrable familiarity with Autodesk Revit
  • Valid driver's license, clean driving record, and ability to drive to meetings and project sites
  • English language skills: ability to read, analyze, and interpret general business periodicals, professional journals, technical procedures, or governmental regulations
Job Responsibility
Job Responsibility
  • Serves as project engineer with autonomy for project-related tasks and as a designated client contact
  • May have project management responsibilities on multiple projects, particularly those with high complexity and/or risk
  • Responsible for successful completion of projects, including schedule, budget, and profitability
  • Develops estimates, pricing, scoping, and marketing strategies for proposed projects
  • Develops feasible engineering design alternatives to meet project requirements
  • Prepares or directs preparation of structural calculations, modeling, analysis, and design
  • Utilizes or oversees use of Revit and other structural-specific design and calculation software to create or assist in the creation of detailed designs for construction documents
  • Prepares or directs preparation of contract documents, including technical reports, studies, plan specifications, construction cost estimates, and permitting for projects
  • Selects and utilizes structural materials appropriate for the project based on coordination with architects, engineers, plans, specifications, and code requirements
  • Reviews construction shop drawings, responds to RFIs, performs site visits, and other Construction Administration activities
  • Fulltime
Read More
Arrow Right