Staff Product Manager, Managed Inference Job at Crusoe (San Francisco)

Staff Product Manager, Managed Inference

As a core member of the Crusoe Managed AI Services team, you will own the comple...

Location

United States , San Francisco; Sunnyvale

Salary:

204000.00 - 247000.00 USD / Year

Crusoe

Expiration Date

Until further notice

Requirements

6+ years of experience in technical product management or engineering roles with product responsibilities
Experience building and launching cloud infrastructure, platform, or AI/ML services used in production
Strong understanding of cloud infrastructure (e.g., AWS, GCP, Azure) and modern compute architectures
Familiarity with the machine learning lifecycle, particularly model deployment, inference, and monitoring
Strong communication and collaboration skills, with experience working across engineering, product, and business teams
Demonstrated ability to operate independently with strong product judgment and a bias for action
Bachelor’s degree in Computer Science or a related technical field (or equivalent experience)

Job Responsibility

Own the end-to-end product lifecycle for Crusoe’s Managed Inference services, including roadmap definition, execution, and iteration
Translate customer needs, market signals, and technical constraints into clear product requirements and prioritization
Partner closely with Engineering, Infrastructure, and Platform teams to deliver scalable, reliable inference services
Drive product decisions across performance, reliability, cost efficiency, and developer experience
Define and track success metrics for inference services in production environments
Collaborate with go-to-market teams to support product launches, positioning, and customer adoption
Communicate product strategy and tradeoffs clearly to cross-functional partners and leadership

What we offer

Restricted Stock Units
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement

Fulltime

Senior Manager, Staff Software Engineering

We are looking to recruit Senior Engineering Manager to lead a newly formed Appl...

Location

United States , Chevy Chase

Salary:

150000.00 - 300000.00 USD / Year

Geico

Expiration Date

Until further notice

Requirements

10+ years hands-on software or solutions engineering
recent experience architecting/building production systems
5+ years leading Agile teams delivering customer-impacting products
proven track record shipping AI/ML-enabled features to production with measurable impact
strong cloud experience (AWS or Azure)
CI/CD, SDLC, and modern engineering practices
practical MLOps/LLMOps experience (model lifecycle, telemetry, A/B testing, online and batch inference)
excellent stakeholder management and communication
Bachelor’s in CS/IS or related field (or equivalent experience)

Job Responsibility

Partner with product, data science, and AI Platform teams to define and deliver an Applied AI roadmap for claims tied to measurable business outcomes
Lead end-to-end delivery of AI features in the claims domain (e.g., claims triage/routing, adjuster assistance, subrogation/recovery opportunities, workload optimization)
Establish robust MLOps/LLMOps (model/prompt/version management, evaluation, feature/vector stores, CI/CD for ML, online/batch inference, cost/performance tuning)
Rapidly prototype using AI-assisted development tools (e.g., Cursor, Claude Code) to validate concepts and accelerate delivery
Ensure reliability, security, and Responsible AI (privacy-by-design, guardrails, human-in-the-loop, auditability)
Drive engineering excellence (clean architecture, scalable APIs, event-driven integration, observability, automated testing, SLOs/runbooks/on-call)
Recruit, coach, and grow a high-performing team
champion Agile practices and operational excellence

What we offer

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
Financial benefits including market-competitive compensation
a 401K savings plan vested from day one that offers a 6% match
performance and recognition-based incentives
and tuition assistance
Access to additional benefits like mental healthcare as well as fertility and adoption assistance
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year

Fulltime

Member of Technical Staff, Inference

We're looking for an ML infrastructure engineer to bridge the gap between resear...

Location

United States

Salary:

240000.00 - 290000.00 USD / Year

Runway

Expiration Date

Until further notice

Requirements

4+ years of experience running ML model inference at scale in production environments
Strong experience with PyTorch and multi-GPU inference for large models
Experience with Kubernetes for ML workloads—deploying, scaling, and debugging GPU-based services
Comfortable working across multiple cloud providers and managing GPU driver compatibility
Experience with monitoring and observability for ML systems (errors, throughput, GPU utilization)
Self-starter who can work embedded with research teams and move fast
Strong systems thinking and pragmatic approach to production reliability
Humility and open mindedness

Job Responsibility

Productionize model checkpoints end-to-end: from research completion to internal testing to production deployment to post-release support
Build and optimize inference systems for large-scale generative models running on multi-GPU environments
Design and implement model serving infrastructure specialized for diffusion models and real-time diffusion workflows
Add monitoring and observability for new model releases—track errors, throughput, GPU utilization, and latency
Embed with research teams to gather training data, run preprocessing scripts, and support the model development process
Explore and integrate with GPU inference providers (Modal, E2E, Baseten, etc.)

Fulltime

Senior Staff Engineering Manager - Applied AI

GEICO is in the midst of an exciting transformation as a product and tech powere...

Location

United States , Chevy Chase; Palo Alto; Dallas

Salary:

140000.00 - 300000.00 USD / Year

Geico

Expiration Date

Until further notice

Requirements

10+ years hands-on software or solutions engineering
recent experience architecting/building production systems
5+ years leading Agile teams delivering customer-impacting products
proven track record shipping AI/ML-enabled features to production with measurable impact
strong cloud experience (AWS or Azure)
CI/CD, SDLC, and modern engineering practices
practical MLOps/LLMOps experience (model lifecycle, telemetry, A/B testing, online and batch inference)
excellent stakeholder management and communication
Bachelor’s in CS/IS or related field (or equivalent experience)

Job Responsibility

Partner with product, data science, and AI Platform teams to define and deliver an Applied AI roadmap for claims tied to measurable business outcomes
Lead end-to-end delivery of AI features in the claims domain (e.g., claims triage/routing, adjuster assistance, subrogation/recovery opportunities, workload optimization)
Establish robust MLOps/LLMOps (model/prompt/version management, evaluation, feature/vector stores, CI/CD for ML, online/batch inference, cost/performance tuning)
Rapidly prototype using AI-assisted development tools (e.g., Cursor, Claude Code) to validate concepts and accelerate delivery
Ensure reliability, security, and Responsible AI (privacy-by-design, guardrails, human-in-the-loop, auditability)
Drive engineering excellence (clean architecture, scalable APIs, event-driven integration, observability, automated testing, SLOs/runbooks/on-call)
Recruit, coach, and grow a high-performing team
champion Agile practices and operational excellence

What we offer

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
Financial benefits including market-competitive compensation
a 401K savings plan vested from day one that offers a 6% match
performance and recognition-based incentives
and tuition assistance
Access to additional benefits like mental healthcare as well as fertility and adoption assistance
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year

Fulltime

Staff Software Engineer, Managed AI - AI Platform

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'...

Location

United States , San Francisco, CA; Sunnyvale, CA

Salary:

208725.00 - 253000.00 USD / Year

Crusoe

Expiration Date

Until further notice

Requirements

Advanced degree in Computer Science/Engineering
8-10+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function
Experience with distributed systems, cloud services (compute, storage, networking, database), and delivering early-stage projects quickly
Experience with Generative AI (LLMs, Multimodal) and familiar with AI infrastructure (training, inference, ETL pipelines)
Proficient with container runtimes (e.g., Kubernetes), microservices, REST APIs, gRPC, and the full software development lifecycle including CI/CD

Job Responsibility

Lead the design and implementation of core AI services, including: Resilient fault-tolerant queues for efficient task distribution
Model catalogs for managing and versioning AI models
Scheduling mechanisms optimized for cost and performance
Architect and scale infrastructure to handle millions of API requests per second
Implement robust monitoring and alerting to ensure system health and 24/7 availability
Collaborate closely with product management, business strategy, and other engineering teams to define the AI platform roadmap
Influence the long-term vision and architectural decisions of the platform
Contribute to open-source AI frameworks and actively participate in the AI community
Prototype and rapidly iterate on emerging technologies and new features

What we offer

Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement

Fulltime

Staff Software Engineer, Inference Infrastructure

Our mission is to scale intelligence to serve humanity. We’re training and deplo...

Location

San Francisco, Toronto, London, New York, Montreal

Salary:

Not provided

Cohere

Expiration Date

Until further notice

Requirements

5+ years of engineering experience running production infrastructure at a large scale
Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clusters
Experience with Kubernetes dev and production coding and support
Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving
Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments
Experience in compute/storage/network resource and cost management
Excellent collaboration and troubleshooting skills to build mission-critical systems, and ensure smooth operations and efficient teamwork
The grit and adaptability to solve complex technical challenges that evolve day to day
Familiarity with computational characteristics of accelerators (GPUs, TPUs, and/or custom accelerators), especially how they influence latency and throughput of inference
Strong understanding or working experience with distributed systems

Job Responsibility

Developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints
Working closely with many teams to deploy optimized NLP models to production in low latency, high throughput, and high availability environments
Interfacing with customers and creating customized deployments to meet their specific needs

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Fulltime

Staff Site Reliability Engineer, Managed AI

At Crusoe, our Site Reliability Engineering team ensures the reliability and sca...

Location

United States , San Francisco; Sunnyvale

Salary:

204000.00 - 247000.00 USD / Year

Crusoe

Expiration Date

Until further notice

Requirements

Strong software engineering background — experience building production-grade systems beyond scripting or Bash
Demonstrated experience in distributed systems design and implementation
Hands-on work with large language models (LLMs) or AI/ML infrastructure
SRE mindset and experience (whether or not under the SRE title) including: Defining and measuring SLIs/SLOs
Building monitoring and observability systems
Driving performance and reliability improvements
Designing fault-tolerant systems and automated testing strategies
Proficiency in at least one modern programming language (Python, Go, Java, C++)
Familiarity with Kubernetes or container orchestration platforms
Strong collaboration and communication skills

Job Responsibility

Design and operate reliable managed AI services with a focus on serving and scaling LLM workloads
Build automation and reliability tooling to support distributed AI pipelines and inference services
Define, measure, and improve SLIs/SLOs across AI workloads to ensure performance and reliability targets are met
Collaborate with AI, platform, and infrastructure teams to optimize large-scale training and inference clusters
Automate observability by building telemetry and performance tuning strategies for latency-sensitive AI services
Investigate and resolve reliability issues in distributed AI systems using telemetry, logs, and profiling
Contribute to the architecture of next-generation distributed systems purpose-built for AI-first environments

What we offer

Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement

Fulltime

New

Senior Project Engineer

CEC is a privately held engineering firm, serving both public entities and priva...

Location

United States , Norman; Oklahoma City; Tulsa

Salary:

Not provided

CEC

Expiration Date

Until further notice

Requirements

Bachelor's degree (B.A. or B.S.) in Structural Engineering, Architectural Engineering (with Structural emphasis), or Civil Engineering (with Structural emphasis) from ABET accredited university
Eight or more years of progressive experience in structural engineering
Professional Engineer (P.E.) or Structural Engineer (S.E.) license (preferred) required
If not licensed in Oklahoma, Texas, and/or Arkansas, must be able to obtain licenses within six months of start date
Previous experience with designing and detailing structural systems for building-type structures with demonstrable familiarity the following Structural systems: Reinforced Concrete
Structural Steel
Concrete Masonry Units (CMU)
Demonstrable familiarity with Autodesk Revit
Valid driver's license, clean driving record, and ability to drive to meetings and project sites
English language skills: ability to read, analyze, and interpret general business periodicals, professional journals, technical procedures, or governmental regulations

Job Responsibility

Serves as project engineer with autonomy for project-related tasks and as a designated client contact
May have project management responsibilities on multiple projects, particularly those with high complexity and/or risk
Responsible for successful completion of projects, including schedule, budget, and profitability
Develops estimates, pricing, scoping, and marketing strategies for proposed projects
Develops feasible engineering design alternatives to meet project requirements
Prepares or directs preparation of structural calculations, modeling, analysis, and design
Utilizes or oversees use of Revit and other structural-specific design and calculation software to create or assist in the creation of detailed designs for construction documents
Prepares or directs preparation of contract documents, including technical reports, studies, plan specifications, construction cost estimates, and permitting for projects
Selects and utilizes structural materials appropriate for the project based on coordination with architects, engineers, plans, specifications, and code requirements
Reviews construction shop drawings, responds to RFIs, performs site visits, and other Construction Administration activities

Fulltime

Select Country

Staff Product Manager, Managed Inference

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Staff Product Manager, Managed Inference

Staff Product Manager, Managed Inference

Senior Manager, Staff Software Engineering

Member of Technical Staff, Inference

Senior Staff Engineering Manager - Applied AI

Staff Software Engineer, Managed AI - AI Platform

Staff Software Engineer, Inference Infrastructure

Staff Site Reliability Engineer, Managed AI

Senior Project Engineer

Our AI answers in your language