CrawlJobs Logo

Ai Engineer, Voice Designer

Canada, Kitchener Employment contract 145000.00 - 172500.00 CAD / Year · Job Posted July 03, 2026
Apply Position
Job Link Share

Job Description

As an AI Engineer: Voice Designer, you’ll own the back-end implementation and linguistic optimization of the Text-to-Speech (TTS) layer for our next-generation AI voice agents. You’ll work squarely within our Speech Team—a high-impact R&D and engineering group focused on speech recognition, enhancement, and synthesis. You will bridge the gap between core speech science and product engineering, ensuring our voice agents sound human, context-aware, and trustworthy. You’ll also help create the systems that manage voice personas, tone, and conversational fillers, eventually exposing these as tweakable parameters to our customer-facing UI.

Job Responsibility

  • Own the back-end implementation and linguistic optimization of the Text-to-Speech (TTS) layer for our next-generation AI voice agents
  • Own the integration and optimization of multiple TTS vendor APIs while leading research and prototyping for open-source or in-house TTS architectures
  • Apply expertise in phonetics and sociolinguistics to ensure TTS input is formatted for maximum naturalness, including SSML orchestration and pronunciation handling
  • Craft context-specific utterances to optimize turn handling and build caller trust during agentic thought processes
  • Design and manage LLM and TTS prompts and parameters to define and refine agent personalities across different industry verticals
  • Architect the logic to expose voice attributes (speed, pitch, tone, style) to the product UI, allowing customers to customize their agent’s voice profile
  • Partner with ASR and Audio AI engineers to ensure end-to-end voice quality and minimize latency in the ASR to LLM to TTS pipeline

Requirements

  • Strong Python programming skills and experience with deep learning frameworks (e.g. PyTorch)
  • 3+ years of experience in Speech Synthesis (TTS) or Voice Design, including hands-on work with frameworks like NVIDIA NeMo, ESPnet, or Coqui, and hands-on experience with major TTS APIs such as ElevenLabs, Rime, and Cartesia
  • Degree in Computational Linguistics, Computer Science, or AI/ML with a deep understanding of phonetics, prosody, and syntax
  • Proven experience crafting and evaluating LLM prompts (system, few-shot) and managing structured prompt templates
  • Experience building production-grade APIs and integrating multi-vendor services in a cloud environment (GCP preferred)
  • Knowledge of speech quality metrics (MOS, intelligibility, latency) and the ability to design rigorous A/B tests for voice personas

What we offer

  • Competitive salary
  • comprehensive benefits
  • real opportunities for growth
  • cutting-edge AI tools
  • robust training program

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Ai Engineer, Voice Designer

8 matching positions

New

Senior Ai Voice Engineer

We are looking for a Senior AI Voice Engineer to help build and scale a producti...
Location
Location
United Kingdom
Salary
Salary:
500.00 - 550.00 GBP / Day
arrowsgroup.com Logo
Arrows Groupe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expert Python development
  • Real-time AI systems
  • LLM-powered applications
  • Conversational AI platforms
  • API design and integration
  • Cloud-native architecture
  • Production deployment and scalability
  • Voice, speech or contact centre technology
  • Commercial experience of Azure Voice Live
  • Commercial experience of ElevenLabs
Job Responsibility
Job Responsibility
  • Designing and building production-grade AI voice agent solutions
  • Developing scalable Python services and APIs
  • Working with speech-to-text (STT), large language models (LLMs) and text-to-speech (TTS) technologies
  • Evaluating and recommending architecture decisions across voice AI platforms
  • Building resilient, low-latency conversational systems
  • Integrating AI voice capabilities into existing customer service workflows
  • Defining monitoring, observability and performance standards
  • Helping transition existing prototypes into a secure, scalable production platform
  • Fulltime
Read More
Arrow Right

Staff Voice AI Engineer

Applied AI at Uber builds intelligent systems that power next-generation product...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
232000.00 - 258000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in software engineering, data science, or machine learning, including a track record of shipping production AI systems
  • Deep understanding of large language models, including fine-tuning, prompt engineering, embeddings, and retrieval-augmented generation (RAG)
  • Strong backend and distributed systems expertise, with experience designing and operating highly available, scalable services in production
  • Deep experience with ML infrastructure, including model training pipelines, online serving systems, feature stores, experiment platforms, and evaluation frameworks
  • Hands-on experience with distributed data processing systems (e.g., Spark, Flink, Ray) and workflow orchestration (e.g., Airflow or equivalent)
  • Ability to analyze data, run experiments, and derive insights for model and product improvement
  • Excellent communication and collaboration skills across technical and non-technical teams
Job Responsibility
Job Responsibility
  • Design and build end-to-end Voice AI solutions, from understanding customer pain points and defining product requirements to deploying LLM-powered, real-time voice interfaces in production
  • Benchmark and evaluate voice AI systems, including speech recognition, speech synthesis, and spoken language understanding, by designing evaluations, analyzing results, and identifying systematic weaknesses
  • Improve voice model performance through system prompt tuning, fine-tuning voice- and speech-specific models, and optimizing architectures for low-latency, real-time voice interactions
  • Analyze voice request logs, prompt traces, and audio inputs to diagnose failure modes, improve transcription accuracy, conversational quality, and overall user experience
  • Build and maintain internal tools and platforms to automate Voice AI workflows, such as large-scale transcription pipelines, real-time audio processing services, and evaluation harnesses for voice quality
  • Own Voice AI systems in production end-to-end, including rollout strategies, monitoring, alerting, quality regression detection, and on-call readiness
  • Collaborate closely with product, design, and research teams to translate user needs into Voice AI capabilities with measurable business and customer impact
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • Eligible to participate in a 401(k) plan
  • Eligible for various benefits (details at link)
  • Fulltime
Read More
Arrow Right

Software Engineer - Voice AI Agent

Great customer support requires human agents and AI in perfect balance, and Asse...
Location
Location
United States , San Francisco
Salary
Salary:
135000.00 - 280000.00 USD / Year
assembled.com Logo
Assembled
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in software engineering as an individual contributor
  • Strong proficiency in a modern programming language (Go, Python, C#, etc.)
  • Experience with distributed systems
  • Solid understanding of data structures, algorithms, and software design principles
  • Ownership mindset and proven track record of learning new technologies quickly
  • Enthusiasm and passion for learning AI/ML technologies (no prior experience required)
  • Highly ambitious and driven, setting high goals for yourself and others
  • Put customers first, focusing on solving real problems
  • Enjoy fast-paced environments and can quickly adjust when new insights emerge
  • A bit of a maverick streak that helps you come up with creative solutions
Job Responsibility
Job Responsibility
  • Build foundational voice features: Develop voice-specific product features from the ground up, such as implementing voice recognition capabilities powered by LLMs and intelligent categorization of incoming calls
  • Improve LLM model results for voice applications: Enhance our voice recognition and generation engine using advanced techniques
  • Develop voice AI infrastructure: Architect the abstractions that enable integration of various types of LLMs tailored for voice applications
  • Engage with customers: Collaborate with our customers (both support agents and managers) to understand how they interact with our voice product, and how we can improve their experience
  • Wear many hats: Be versatile in roles — coding, user research, planning, brainstorming, and cross-team collaboration
  • Shape the team culture: Encourage a startup mentality focused on product quality and taking initiative
What we offer
What we offer
  • Generous medical, dental, and vision benefits
  • Paid company holidays, sick time, and unlimited time off
  • Monthly credits to spend on each: professional development, general wellness, Assembled customers, and commuting
  • Paid parental leave
  • Hybrid work model with catered lunches everyday (M-F), snacks, and beverages in our SF & NY offices
  • 401(k) plan enrollment
  • Stock options
  • Fulltime
Read More
Arrow Right

Software Engineer - Voice AI Agent

Our Voice AI team is building autonomous AI agents that handle inbound calls for...
Location
Location
United States
Salary
Salary:
135000.00 - 280000.00 USD / Year
assembled.com Logo
Assembled
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in software engineering as an individual contributor
  • Strong proficiency in a modern programming language (Go, Python, C#, etc.)
  • Experience with distributed systems
  • Solid understanding of data structures, algorithms, and software design principles
  • Ownership mindset and proven track record of learning new technologies quickly
  • Enthusiasm and passion for learning AI/ML technologies (no prior experience required)
  • Highly ambitious and driven, setting high goals for yourself and others
  • Put customers first, focusing on solving real problems
  • Enjoy fast-paced environments and can quickly adjust when new insights emerge
  • A bit of a maverick streak that helps you come up with creative solutions
Job Responsibility
Job Responsibility
  • Building high-quality software for our voice AI platform, from rapid prototypes that push the boundaries of what's possible to production-ready, scalable solutions
  • Continuously improving our AI capabilities and accuracy through experimentation, data analysis, and innovative approaches
  • Implementing and optimizing LLM and voice technology while balancing intelligence, latency, and cost
  • Collaborating across engineering and cross-functional teams to tackle challenging technical problems throughout the full lifecycle of our voice AI products - from ideation and prototyping to deployment and monitoring
  • Build foundational voice features: Develop voice-specific product features from the ground up, such as implementing voice recognition capabilities powered by LLMs and intelligent categorization of incoming calls. You'll help design and build intuitive interfaces for support agents to monitor and interact with AI voice assistants
  • Improve LLM model results for voice applications: Enhance our voice recognition and generation engine using advanced techniques. You'll help us leverage implicit knowledge bases to improve model performance in voice contexts
  • Develop voice AI infrastructure: Architect the abstractions that enable integration of various types of LLMs tailored for voice applications. You'll design and implement evaluation and logging systems to monitor performance
  • Engage with customers: Collaborate with our customers (both support agents and managers) to understand how they interact with our voice product, and how we can improve their experience
  • Wear many hats: Be versatile in roles — coding, user research, planning, brainstorming, and cross-team collaboration
  • Shape the team culture: Encourage a startup mentality focused on product quality and taking initiative
What we offer
What we offer
  • Generous medical, dental, and vision benefits
  • Paid company holidays, sick time, and unlimited time off
  • Monthly credits to spend on each: professional development, general wellness, Assembled customers, and commuting
  • Paid parental leave
  • 401(k) plan enrollment
  • Stock options are provided as part of the compensation package
  • Fulltime
Read More
Arrow Right

AI Research Scientist - Voice AI Team, Meta Superintelligence Labs

Meta is seeking AI Research Scientists to join the Realtime AI Voice team in Met...
Location
Location
United States , Menlo Park, CA +2 locations
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD degree in Computer Science, Mathematics, or similar quantitative field
  • 2+ years of post-PhD experience in an academic, industry, or government laboratory setting, with primary responsibilities focused on AI research
  • Proven track record of publications at peer-reviewed AI & speech conferences (e.g. NeurIPS, ICML, ICLR, ICASSP)
  • Experience in training, fine-tuning, and/or experimenting with foundation models beyond black-box use
  • Familiarity with one or more deep learning frameworks (e.g., pytorch, tensorflow)
  • Experience communicating complex research to public audiences of peers
Job Responsibility
Job Responsibility
  • Lead, collaborate, and execute on research that pushes forward the state of the art in speech and large language model research
  • Directly contribute to experiments, including designing experimental details, develop reusable code, running evaluations, and organizing results
  • Help identify long-term research goals as well as intermediate milestones
  • Work cross-functionally to translate research breakthroughs into scalable, production-ready solutions for Meta's conversational AI / product experiences
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist - Voice AI Team

Meta is seeking AI Research Scientists to join the Realtime AI Voice team in Met...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD degree in Computer Science, Mathematics, or similar quantitative field
  • Proven track record publications at peer-reviewed AI & speech conferences (e.g. NeurIPS, ICML, ICLR, ICASSP)
  • Experience in training, fine-tuning, and/or experimenting with foundation models beyond black-box use
  • Familiarity with one or more deep learning frameworks (e.g. pytorch, tensorflow, …)
  • Experience to communicate complex research for public audiences of peers
Job Responsibility
Job Responsibility
  • Collaborate, and execute on research that pushes forward the state of the art in speech and large language model research
  • Directly contribute to experiments, including designing experimental details, develop reusable code, running evaluations, and organizing results
  • Help identify long-term research goals as well as intermediate milestones
  • Work cross-functionally to translate research breakthroughs into scalable, production-ready solutions for Meta’s conversational AI / product experiences
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right
New

Sr AI Engineer - Agentic Systems

As a core technical leader within our Agentic AI initiatives, you will shape the...
Location
Location
United States , Anywhere
Salary
Salary:
Not provided
dialpad.com Logo
Dialpad
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of relevant software engineering experience
  • Proven track record of technical leadership (as a Senior, Staff, or Principal Engineer) shipping complex, large-scale systems
  • Strong foundations in scaling distributed systems and production-grade infrastructure
  • Experience with LLM Platforms: Inference optimization and fine-tuning strategies
  • Experience with Data & Retrieval: Advanced retrieval systems and memory architectures
  • Experience with Agent Frameworks like LangChain/LangGraph, CrewAI, or AWS/Google Agent ecosystems
  • Experience with AI Ops: Evaluation, observability, and safety frameworks for production AI systems
  • Experience with Real-Time Infrastructure: Streaming infrastructure and voice/conversational AI
  • Experience with Tool Integration: Tool use, API execution frameworks, and human-in-the-loop validation systems
  • Operational Excellence: Experience setting clear technical goals, identifying architectural risks, and systematically clearing tech-debt gaps
Job Responsibility
Job Responsibility
  • Drive Technical Strategy: Own the architectural roadmap and delivery of Dialpad’s Agentic infrastructure, core orchestration layers, memory architectures, and evaluation/observability systems
  • Build & Scale: Design and deploy scalable, multi-modal AI agents capable of autonomous support, real-time voice reasoning, and secure API tool execution across complex enterprise workflows
  • Mentor & Influence: Act as a technical anchor for the organization, raising the engineering bar, mentoring senior peers, and defining technical standards for an AI-native SDLC
  • Partner Cross-Functionally: Collaborate with leadership across Product, Engineering, and Applied Research to align technical execution with Dialpad’s long-term business strategy
  • Push the Frontier: Research and implement emerging agent frameworks, LLM inference optimization, advanced retrieval systems, and cutting-edge safety/policy guardrails to keep Dialpad at the absolute forefront of the era of the agent
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits
  • Real opportunities for growth
  • Cutting-edge AI tools
  • Robust training program
  • Inclusive office environment
  • Fulltime
Read More
Arrow Right
New

Applied AI Engineer

Infer is building the operating system for insurance agencies. We make AI agents...
Location
Location
India , Bengaluru
Salary
Salary:
2000000.00 - 5000000.00 INR / Year
helpcare.ai Logo
Helpcare AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • ML engineering experience shipping production systems
  • Strong Python and a working ML stack (PyTorch, Huggingface, pandas, scikit-learn)
  • Hands-on experience designing LLM-based agents: prompting, tool/function calling, multi-turn state, structured outputs
  • Hands-on experience building evals or eval frameworks for ML, LLM, or voice systems. Built LLM-as-judge eval pipelines and know their failure modes
  • Practical experience with ASR/STT comparing providers, fine-tuning, or running open models like Whisper
  • Practical experience with TTS systems (ElevenLabs or open models)
  • Comfortable working with audio data: sample rates, codecs, noise, alignment
Job Responsibility
Job Responsibility
  • Building and maintaining the eval framework that scores voice agent quality across transcription, LLM reasoning, tool use, TTS, and full-conversation outcomes
  • Design voice agent behavior: system prompts, tool use, conversation flow, error recovery, and guardrails for real-time interactions
  • Drive STT and TTS accuracy improvements by comparing providers, tuning configurations, and running rigorous A/B experiments the team can act on
  • Drive TTS quality improvements voice selection, latency vs. fidelity tradeoffs, prosody, edge cases
  • Curate and grow our evaluation datasets, including hard-case mining from production traffic
  • You'll build benchmarks we can run against any new model in days, run a red-team pipeline that probes for jailbreaks, hallucinated quotes, and compliance failures
  • Partner with backend engineers to wire eval signals into CI so regressions get caught before they ship
  • Wire eval signals into CI so regressions block merges, and build self-improvement loops where hard cases from production auto-feed the eval set and our prompts optimize themselves over time
  • Fulltime
Read More
Arrow Right