CrawlJobs Logo

Technical Program Manager, AI Infrastructure

cerebras.net Logo

Cerebras Systems

Location Icon

Location:
United States , Sunnyvale

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Be part of the team that builds and operates the world's fastest AI infrastructure for training and inference. Your role as a TPM will help accelerate data center buildouts to meet the explosive demand for our inference service platform.

Job Responsibility:

  • Own end-to-end technical programs for multiple data center buildouts, coordinating with partners, contractors, and internal teams
  • Drive facility site readiness for power and cooling for Cerebras Wafer-Scale Engine systems
  • Coordinate equipment delivery and manage vendor accountability for schedules and quality related to rack integration and inter-rack cabling
  • Act as the single-threaded owner across internal partners: Hardware & Systems Engineering, Network & Storage Engineering, AI Cloud Infrastructure & Operations
  • Enforce handover criteria between site completion, equipment deployment, and operations
  • Own overall schedule tracking, risk identification, and mitigation, creating clear visibility for leadership
  • Establish program governance, risk tracking, and RACI clarity
  • Present program status, metrics, and operational risks to senior leadership
  • Drive partner accountability on contractual milestones and commercial commitments
  • Document repeatable processes and implement them to scale across future data centers
  • Partner on installation, commissioning, change management, and break/fix workflows
  • Lead incident reviews and postmortems, ensuring corrective actions are completed

Requirements:

  • Experience leading large, cross-functional infrastructure programs
  • Experience with AI/ML, HPC, or accelerator-based infrastructure
  • Strong understanding of data center power and cooling fundamentals
  • Experience installing and managing network, storage, and compute devices
  • Proven ability to define and operationalize metrics
  • Strong written and executive-level communication skills
  • Experience working with colocation providers and facilities teams
  • Background in incident management, reliability, or service operations

Nice to have:

Experience running network operations teams is a plus

What we offer:
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs

Additional Information:

Job Posted:
February 17, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Technical Program Manager, AI Infrastructure

Principal Technical Program Manager - AI and Search Infrastructure

Atlassians can choose where they work – whether in an office, from home, or a co...
Location
Location
United States , San Francisco; Mountain View; Seattle
Salary
Salary:
166100.00 - 266800.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 5+ years of experience with deep domain expertise in the areas of Search Infrastructure (i.e. Search Quality, Search Relevance, Vector Search) or Artificial Intelligence (i.e. LLMs, Inference, Fine-tuning, Content Generation)
  • Minimum 10+ years of experience in Technical Product / Program Management or Technical Leadership role
  • Experience in 0-1 product / platform development and scaling
  • Excellent collaborator who can work across teams effectively and represent domain area to senior leadership (VP+ levels)
  • Experience in gathering product needs, setting strategy & plans, creating clear roadmap with aligned success metrics
  • Experience driving platform evangelization and adoption internally and with external customers (i.e. Enterprise / SMB)
  • Experience building platform products for Enterprise is a plus
  • Master degree or higher education in Computer Science is a plus
  • Excellent verbal, written, and facilitation skills (including experience with facilitating meetings and engaging with an executive audience)
  • Demonstrated experience and success leading high-impact, cross-functional programs or products
Job Responsibility
Job Responsibility
  • Own the strategic vision for Atlassian platform area (i.e. AI, Search Infra) with focus of driving best outcomes for customers (Atlassians)
  • Provide technical expertise for Atlassian platform area (AI or Search Infra)
  • Define product / platform strategy, plans, priorities and roadmap
  • Partner with engineering team and other disciplines on strategy, plans, roadmaps and execution
  • Lead communication with senior leadership, customers and stakeholders on regular cadence
  • Establish key performance indicators (KPIs) and metrics to measure the effectiveness of key initiatives / projects
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Technical Program Manager, AI Platform

Figma is growing our team of passionate creatives and builders on a mission to m...
Location
Location
United States , San Francisco; New York
Salary
Salary:
180000.00 - 308000.00 USD / Year
figma.com Logo
Figma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of technical program management experience (or equivalent) in AI platform, AI research, and AI infrastructure
  • Understand how AI gets built and scaled: model evaluation loops, annotation pipelines, quota limits, and data versioning
  • Have hands-on experience running AI cost/capacity reviews, forecast planning, and vendor oversight. Deep understanding of model cost mechanics, including token burn, cache hit rates, latency, and quota limits
  • Comfort operating in high-ambiguity, high-velocity environments with exec visibility
  • Strong writing and communication skills — you bring structure, clarity, and momentum to complex technical programs
  • Bring a systems-thinking mindset to the AI delivery pipeline, and know where to tighten loops or increase speed
Job Responsibility
Job Responsibility
  • Own and drive programs supporting Figma’s AI platform — including annotation velocity, evaluation pipelines, and cost/capacity readiness
  • Partner with Infra and Finance to plan model scaling across providers: track token usage, forecast traffic, manage regional limits, optimize caching strategies, and reduce latency
  • Lead our internal AI Annotation Program: manage vendors and design annotators. Define task priorities, improve quality standards and increase annotator throughput
  • Support internal AIOps initiatives — model go/no-go decision making, monitor model behavior, prevent regressions, and ensure readiness across quality gates
  • Drive cross-functional execution of key AI-powered product features — coordinate scope, risks, comms, and launch checklists
  • Partner with Data Science to maintain and improve internal visibility: annotation metrics, token quotas, reliability dashboards, and evaluation timelines
What we offer
What we offer
  • equity to employees
  • health, dental & vision
  • retirement with company contribution
  • parental leave & reproductive or family planning support
  • mental health & wellness benefits
  • generous PTO
  • company recharge days
  • a learning & development stipend
  • a work from home stipend
  • cell phone reimbursement
  • Fulltime
Read More
Arrow Right

Technical Account Manager

Our Technical Account Managers play a crucial client-facing role to drive adopti...
Location
Location
Spain
Salary
Salary:
Not provided
maisa.ai Logo
Maisa
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong demonstrable experience managing enterprise technology implementations
  • Proven experience leading customer onboarding, training programs, and change management initiatives at scale
  • Strong program management capabilities, including coordinating parallel workstreams, dependencies, and stakeholder management from technical teams to C-suite
  • Solid technical knowledge across modern cloud infrastructure and software technologies (AWS, Azure, Kubernetes, CI/CD, REST APIs) from either commercial experience or a Bachelor's degree in Computer Science, Information Systems, or related field
  • Demonstrable experience leading discovery processes and process mapping using various frameworks and methodologies
  • Excellent communication and documentation skills with the ability to present complex technical ideas and AI automation solutions to both technical and non-technical stakeholders
  • Business proficiency in Spanish and English
  • Ability to travel up to 25% of the time
Job Responsibility
Job Responsibility
  • Lead enterprise-wide platform rollouts across multiple customer departments, managing parallel implementation streams, comprehensive project plans, milestones, and resource requirements
  • Coordinate customer onboarding activities, including technical setup, process discovery and mapping, user training, and knowledge transfer
  • Establish and track KPIs and success metrics for each deployment phase, providing regular status updates to internal and customer executive stakeholders
  • Identify and mitigate implementation risks through proactive program management
  • Develop scalable training programs and materials to drive customer self-sufficiency and adoption through change management best practices
  • Build and maintain a use case repository to track implementation progress, measure success, and identify expansion opportunities
  • Serve as the primary point of contact for technical and operational issues during implementation
  • Facilitate cross-functional collaboration between customer teams and Maisa AI resources, driving stakeholder engagement
  • Fulltime
Read More
Arrow Right

Product Manager, AI Data Pipelines & Developer Community

As a Product Manager for AI Data Pipelines & Developer Community, you will be in...
Location
Location
United States , Campbell
Salary
Salary:
Not provided
komprise.com Logo
Komprise, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in product management, particularly with a focus on data and AI-driven products
  • Strong understanding of technical data architectures, machine learning models, and AI applications in a business context
  • Demonstrated success in growing technical product adoption and developer communities
  • Experience working with large datasets, data pipelines, and data platforms
  • Excellent communication and interpersonal skills, with the ability to convey complex ideas to both technical and non-technical audiences
  • Proven ability to think strategically and execute methodically, translating complex problems into actionable solutions
  • A Bachelor’s degree in Computer Science, Data Science, Business, or a related field is preferred
Job Responsibility
Job Responsibility
  • Define and Drive Product Strategy: Develop and communicate a clear vision and roadmap for the AI data pipeline platform and associated developer tools, ensuring alignment with business goals and market needs
  • Build and Optimize AI Data Pipelines: Work closely with our product team and IT, end-users, data engineers and data scientists from our customers to build, optimize, and maintain scalable data pipelines that allow them to enrich, classify, curate data for AI agents for augmentation and inferencing
  • Translate User Needs into Product Requirements: Gather and prioritize user requirements, translating them into detailed product specifications and coordinating development efforts with engineering and data science teams
  • Champion Developer Experience: Understand the needs of the developer community, both internal and external, and develop tools, resources, and programs that enhance their experience building with our AI data platform. Investigate participation in partner developer communities and open-source communities
  • Grow and Engage the Developer Community: Lead efforts to foster a vibrant developer community, potentially through events, forums, technical content, and strategic outreach
  • Measure and Analyze Product Performance: Define and track key metrics to measure the success and impact of both the AI data platform and community engagement efforts, using data-driven insights to iterate and improve
  • Cross-Functional Collaboration: Partner effectively with cross-functional teams, including engineering, marketing, sales, and customer success, to ensure product vision alignment and drive successful execution
  • Stay Ahead of the Curve: Monitor industry trends, emerging technologies, and competitive offerings related to AI, data infrastructure, and developer ecosystems
Read More
Arrow Right

AI Product Manager

We’re scaling AI and machine learning across our products, devices, and operatio...
Location
Location
United States , Boston
Salary
Salary:
121300.00 - 177900.00 USD / Year
simplisafe.com Logo
SimpliSafe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of product management experience, including significant ownership of AI/ML or data-intensive products
  • Clear track record of shipping production ML systems (not just integrating third-party AI APIs), in close partnership with data science, ML engineering, and MLOps
  • Principal-level impact: leading cross-team initiatives, shaping strategy, and influencing senior stakeholders
  • Strong understanding of core ML concepts and lifecycle: data, labeling, training/validation, evaluation metrics, deployment, monitoring, and retraining
  • ML experience with at least one of following: computer vision or sensor data, LLM-powered applications (prompting, RAG, fine-tuning, evaluation) and/or hardware or edge products (e.g., on-device models, connectivity/latency trade-offs)
  • Familiarity with modern ML infrastructure (cloud platforms, model serving, CI/CD for ML, monitoring/alerting)
  • Comfortable going deep into data, metrics, and model behavior—not just the UX layer
  • Excellent communicator who can make complex AI topics clear to diverse audiences
  • Strong alignment with our values: customer-obsessed, low ego, highly collaborative, comfortable with ambiguity, and biased toward learning and iteration.
Job Responsibility
Job Responsibility
  • Define and communicate the multi-year roadmap for key AI/ML capabilities across SimpliSafe
  • Identify and prioritize AI opportunities where models and data can materially improve safety, customer experience, or efficiency—on both devices and cloud services
  • Make build-vs-buy decisions for AI capabilities in partnership with data science and engineering
  • Partner with data scientists, ML engineers, and MLOps to design and deliver end-to-end ML solutions—from problem framing through data, training, evaluation, deployment, and monitoring
  • Work with hardware and embedded teams to shape edge AI/ML experiences (e.g., on-device detection, low-latency decisions, bandwidth-aware designs)
  • Define model-level requirements (metrics, latency, cost, guardrails) and connect them to business outcomes (e.g., false alarm reduction, detection accuracy, handle time, CSAT)
  • Translate product needs into requirements for ML platform capabilities (model serving, observability, experiment tracking, human-in-the-loop tools)
  • Lead product direction for LLM and multimodal use cases (e.g., text, vision, sensor data)
  • Decide when to use prompt engineering, RAG, fine-tuning, or traditional ML—and how to evaluate quality, safety, and hallucinations
  • Design workflows that incorporate human review and escalation where needed
What we offer
What we offer
  • A mission- and values-driven culture and a safe, inclusive environment where you can build, grow, and thrive
  • A comprehensive total rewards package that supports your wellness and provides security for SimpliSafers and their families
  • Free SimpliSafe system and professional monitoring for your home
  • Employee Resource Groups (ERGs) that bring people together, give opportunities to network, mentor and develop, and advocate for change
  • Participation in our annual bonus program, equity, and other forms of compensation, in addition to a full range of medical, retirement, and lifestyle benefits.
  • Fulltime
Read More
Arrow Right

Senior Technical Product Manager

As a Senior Technical Product Manager, Clinical Data Platform, you will be a key...
Location
Location
United States
Salary
Salary:
Not provided
aledade.com Logo
Aledade, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of product or technical program management experience in healthcare data platforms, interoperability, or machine learning infrastructure, with a focus on clinical data ingestion and transformation, technology-enabled services industry, or a SaaS product
  • Experience using and writing queries against data for the purposes of performing preliminary research to inform solution design and build internal business understanding
  • Strong understanding of the software development lifecycle, Agile methodologies, and cross functional collaboration across engineering, informatics and data science teams
  • Product development experience supporting LLM pipelines or retrieval-augmented generation workflows using structured and unstructured healthcare data
  • Proven ability to bridge business objectives and platform capabilities in environments requiring data standardization and semantic normalization
Job Responsibility
Job Responsibility
  • Define and drive both short and long term technical roadmaps for data pipeline infrastructure, ensuring scalable, reliable ingestion and transformation of structured and unstructured data across diverse upstream sources and downstream consumers to deliver maximum value with minimum risk
  • Partner cross functionally with engineering, analytics and key business stakeholders to identify data requirements, translate them into technical specifications and support implementation through backlog grooming, solution design and adoption oversight
  • Monitor pipeline performance and data quality metrics, proactively investigate anomalies with SQL or equivalent query tools to drive root cause analysis and implement improvements to support data completeness, timeliness, analytics and generative AI initiatives
  • Work with internal teams and end users to develop a deep understanding of requirements, perform thoughtful technical solution designs, use data to test hypotheses, and support teams throughout execution
  • Write detailed user stories for new features, capturing detailed descriptions of business rationale, requirements, and success criteria that are defined by measurable outcomes
  • Ongoing optimization of live user workflows and capabilities including monitoring of key metrics & internal user feedback
What we offer
What we offer
  • Flexible work schedules and the ability to work remotely are available for many roles
  • Health, dental and vision insurance paid up to 80% for employees, dependents and domestic partners
  • Robust time-off plan (21 days of PTO in your first year)
  • Two paid volunteer days and 11 paid holidays
  • 12 weeks paid parental leave for all new parents
  • Six weeks paid sabbatical after six years of service
  • Educational Assistant Program and Clinical Employee Reimbursement Program
  • 401(k) with up to 4% match
  • Stock options
  • Fulltime
Read More
Arrow Right

Technical Program Manager-AI Delivery

The CO+I AI Delivery team is focused on delivering various platform services to ...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Engineering, or a related technical field (or equivalent experience)
  • Experience as a Technical Program Manager, Program Manager, or similar role in highly technical environments
  • Proven experience leading complex, cross‑team technical programs with significant infrastructure or platform components
  • Strong technical foundation in one or more of the following: Cloud infrastructure and distributed systems
  • Large‑scale datacentre delivery projects
  • Hardware‑software integrations (compute, networking, storage, power, cooling)
  • Demonstrated ability to manage execution in ambiguous, fast‑moving environments
  • Excellent written and verbal communication skills, with experience presenting to senior leadership
Job Responsibility
Job Responsibility
  • Own end‑to‑end technical programs focused on accelerating AI deployment timelines, from requirements through live production
  • Drive execution across multiple parallel workstreams, ensuring alignment on scope, milestones, dependencies, risks, and outcomes
  • Establish clear success metrics and mechanisms to track delivery, quality, and velocity
  • Document appropriately all artifacts during deliberative processes and established consensus
  • Partner deeply with hardware engineering, software engineering, infrastructure, networking, data center operations, and supply chain teams to unblock execution
  • Act as the central point of coordination across highly interdependent teams and external partners
  • Influence decision‑making with data, technical insight, and strong executive communication
  • Develop deep working knowledge of AI deployment architectures, including compute (GPU/accelerators), networking, storage, racks, power, cooling, and platform readiness
  • Identify technical risks early and drive mitigation strategies across hardware, firmware, software, and operational domains
  • Translate complex technical concepts into clear, actionable plans for both technical and non‑technical stakeholders
  • Fulltime
Read More
Arrow Right
New

Senior Technical Program Manager – AI Infrastructure, Site Operations

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. ...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in Technical Program Management, Infrastructure Ops, or Data Center Ops
  • Experience leading large, cross-functional infrastructure programs
  • Strong understanding of: Data center power and cooling fundamentals
  • Network and storage basics
  • Hardware-centric platforms
  • Proven ability to define and operationalize metrics
  • Strong written and executive-level communication skills
Job Responsibility
Job Responsibility
  • Own end-to-end technical programs for data center and site operations
  • Act as single-threaded owner across: Hardware & Systems Engineering
  • AI Cloud Infrastructure & Operations
  • Network & Storage Engineering
  • Facilities, power, cooling, and colo partners
  • Drive site readiness for Cerebras Wafer-Scale Engine systems
  • Partner on installation, commissioning, change management, and break/fix workflows
  • Lead incident reviews and postmortems
  • ensure corrective actions are closed
  • Define and own operational metrics and KPIs, including: Availability and reliability
What we offer
What we offer
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Read More
Arrow Right