Technical Program Manager, Platform Job at Scale (San Francisco, CA)

Job Description

As a Technical Program Manager for the Platform team, you will partner with engineering teams to directly accelerate the development and maturity of the Scale Generative AI Platform (SGP). We are looking for a TPM who has actively built and shipped products in the past and understands how to deliver robust, scalable developer tooling and distributed systems. In this role, you will own the strategic alignment and end-to-end execution of our most critical infrastructure initiatives—from initial scoping to measurable, company-wide and customer-ready adoption. You will serve as the core communication backbone and connective tissue between platform engineering, product teams, and executive leadership. Operating in a hyper-growth, demanding AI environment, you will translate SGP's architectural complexities into clear execution strategies, unblock engineering bottlenecks, proactively mitigate deployment risks, and ensure our foundational platforms deliver reliable, performant, and secure systems capable of global-scale deployment.

Job Responsibility

Lifecycle & Platform Delivery: Lead strategic planning and high-velocity execution for SGP core capabilities (orchestration layers, model serving, APIs). Manage features from technical scoping and architecture design through production launch
Cross-Functional GenAI Alignment: Drive execution and manage complex technical dependencies across systems engineering, Core ML, Research, and Product teams to deliver unified SGP capabilities with architectural consistency
Technical Translation & Requirements: Translate complex infrastructure metrics (LLM inference optimization, GPU utilization, compute orchestration) into actionable roadmaps. Map demands like multi-tenancy, data privacy, and isolation into platform features
Risk & Dependency Mitigation: Proactively identify, track, and mitigate technical risks unique to massive-scale GenAI infrastructure and global SGP deployments, maintaining momentum despite fast-evolving AI frameworks
Developer Velocity & Operational Excellence: Establish lightweight agile processes that empower engineers to ship fast without breaking core systems. Define and enforce clear SLOs and performance benchmarks to guarantee production-grade reliability for clients
Metrics-Driven Adoption: Track and report on SGP adoption metrics, system reliability, delivery forecasts, and engineering bottlenecks directly to executive leadership to ensure the platform scales responsibly.

Requirements

5+ years of experience as a Technical Program Manager, Product Manager, or Software Engineer, with a proven track record of having built and shipped technical products or platforms from scratch (e.g., internal cloud infrastructure, developer APIs, distributed systems, or ML platforms)
Platform Domain Expertise: 3+ years of dedicated experience managing programs focused directly on core engineering infrastructure, cloud-native ecosystems (AWS/GCP), container orchestration (Kubernetes), or distributed systems
AI/ML Infrastructure Literacy: Foundational understanding of the infrastructure required for the Generative AI lifecycle, including high-throughput data pipelines, GPU/CPU cluster utilization, or model training/evaluation setups
Masterful Communication: Proven track record of presenting to and influencing executive-level stakeholders, with the ability to translate complex technical/architectural challenges into clear business impacts
Execution Excellence: Advanced proficiency with iterative development methodologies and modern project management tooling (Linear, Jira, etc.) applied to foundational infrastructure environments.

Nice to have

Engineering Roots: Strong software engineering fundamentals, with prior professional experience as a Software Engineer, DevOps Engineer, or Data Developer before transitioning into program management
Platform Adoption Track Record: Proven success driving the internal adoption of technical platforms, SDKs, or APIs across disparate, fast-moving product lines
Data-Centric AI Familiarity: Direct experience working with large-scale data quality pipelines, distributed vector databases, or specialized AI inference engines (e.g., Triton, Ray).

What we offer

comprehensive health, dental and vision coverage
retirement benefits
a learning and development stipend
generous PTO
commuter stipend

Scale - All Job Offers

Select Country

Technical Program Manager, Platform

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Technical Program Manager, Platform

Principal Platform Technical Program Manager

Platform Technical Program Manager - NPI

Technical Program Manager, AI Platform

Senior Principal Technical Program Manager - ML Platform

Lead Principal Technical Program Manager - ML Platform

Technical Program Manager – Device & Platform Programs

Technical Program Manager - ADAS Software Platform

Technical Program Manager - Cloud & AI Platform

Our AI answers in your language