This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Sesame, we are building the next generation of voice-based personal agents. We are seeking a Technical Program Manager to bridge the gap between LLM research and a polished, user-ready product. In this role, you will drive the development lifecycle, ensuring novel capabilities are delivered on schedule and rigorously measured against real-world performance. You will also be responsible for the "quality loop": transforming qualitative user feedback into technical evaluation datasets and ensuring our models evolve based on measurable data.
Job Responsibility:
Manage the end-to-end lifecycle of LLM projects, navigating the transition from research milestones to production-level deployments
Transform subjective user feedback into objective metrics and datasets
Design and implement technical evaluations to address issues found in the field
Track internal and external feedback, ensuring identified issues are followed through to resolution in subsequent iterations
Maintain the technical roadmap for voice-based capabilities, proactively identifying dependencies and resolving technical blockers across teams
Ensure the roadmap incorporates the specialized work and constraints of all teams—not just ML—to deliver a cohesive user experience
Requirements:
BS or MS in Computer Science, Electrical Engineering, or a related technical discipline
3+ years of experience managing LLM research or AI/ML product cycles
A solid technical understanding of LLMs (training, prompting, inference, quality) and the unique constraints of voice-based interfaces
Proficiency in Python with the ability to independently write and implement evaluation logic and data-handling scripts
Proven ability to coordinate and align with stakeholders across different functions, managing complex dependencies with great technical depth
Nice to have:
Prior experience with LLM-based voice products and modern generative speech technologies
Experience integrating model evaluation workflows into automated testing and deployment pipelines
Familiarity with the architectural challenges of deploying low-latency, conversational AI agents
What we offer:
401k matching
100% employer-paid health, vision, and dental benefits