CrawlJobs Logo

AI Engineer (AI Voice)

Spain, Barcelona · Job Posted April 23, 2026

Job offer has expired

Job Link Share

Job Description

Our client is a pioneering HealthTech scale-up developing a first-of-its-kind AI assistant designed by clinicians, for clinicians. They are tackling one of the most urgent global challenges: the critical shortage of healthcare professionals. By building a controllable and highly customizable ‘AI brain,’ they provide real-time, reliable support across the entire care continuum. This is a chance to work on a product where your code directly alleviates the burden on frontline medical staff. As AI Engineer, you will own and evolve the core “brain” service of this company platform. This is not about building simple wrappers; you will architect sophisticated multi-agent systems that reason and communicate in real-time via voice and text. You will tackle high-stakes challenges in low-latency streaming, autonomous orchestration, and continuous evaluation, shipping fast-moving Python services at the intersection of cutting-edge AI and human-centric health technology.

Job Responsibility

  • Lead the end-to-end architecture and evolution of the 'core brain' service, taking full responsibility for SLAs, latency budgets, and deployment strategies
  • Design and operate low-latency communication systems featuring streaming voice/text, Voice Activity Detection (VAD), and complex interaction handling (barge-in, turn-taking, and interruptions)
  • Build sophisticated multi-agent systems using planner–executor–critic patterns, shared memory, and advanced coordination protocols
  • Implement and refine complex reasoning frameworks, including ReAct and Chain-of-Thought (CoT), as well as Tree/Graph-of-Thought architectures where applicable
  • Leverage programmatic optimization tools (such as DSPy, MiPRO, or GEPA) to compile and evolve prompts iteratively under strict evaluation constraints
  • Develop robust Retrieval-Augmented Generation (RAG) pipelines focusing on high-signal retrieval, hybrid search, re-ranking, and query rewriting to ensure grounded and faithful AI responses
  • Architect a comprehensive evaluation framework—from pre-call safety checks to post-call automated evals (hallucination detection, red-teaming)—using OpenTelemetry and structured logging to monitor drift and performance in real-time
  • Ship high-quality, production-ready Python services using FastAPI, ensuring high performance and continuous integration/deployment (CI/CD) gates

Requirements

  • Extensive experience with Python, FastAPI, Pydantic, and asyncio for high-performance service development
  • Proven track record with Multi-agent systems, Vector Stores, and advanced RAG architectures
  • Proficiency in Docker, Kubernetes (K8s), and Terraform for scalable deployments
  • Familiarity with STT/TTS (Speech-to-Text/Text-to-Speech) and monitoring via OpenTelemetry (OTEL)
  • Full Professional Level of English

Nice to have

  • Hands-on experience with WebRTC stacks, LiveKit, and SIP gateways
  • Deep dive into DeepEval or similar LLM-as-a-judge frameworks
  • Experience using DSPy for automated prompt compilation and self-improving workflows

What we offer

  • Permanent contract with a long-term vision and deep investment in your professional journey
  • Build cutting-edge, real-time agent technology within a best-in-class HealthTech team where your code has a direct impact on global healthcare
  • Join a highly motivating atmosphere that fosters continuous learning, peer-to-peer mentorship, and the freedom to experiment with the latest AI breakthroughs
  • Truly flexible work-life integration with Remote-first or Hybrid options in our Barcelona hub
  • Engaging team-building events and fun off-sites in Barcelona to connect with a diverse, international team
  • High-tech laptop of your choice and a budget for solid dev ergonomics to ensure your workspace is optimized for peak performance
  • Access to the latest tools, research papers, and internal knowledge-sharing sessions on the frontier of Multi-agent systems and LLMs

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI Engineer (AI Voice)

8 matching positions

Middle Software Engineer - AI Voice Systems

We are looking for a Middle Software Developer with a strong Python background, ...
Location
Location
Ukraine
Salary
Salary:
Not provided
sigma.software Logo
Sigma Software Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Python / strong
  • TypeScript / strong
  • Kubernetes / good
  • English / strong
Job Responsibility
Job Responsibility
  • Backend services
  • External provider integrations
  • Contribute to ongoing development of the existing platform
What we offer
What we offer
  • Diversity of Domains & Businesses
  • Variety of technology
  • Health & Legal support
  • Active professional community
  • Continuous education and growing
  • Flexible schedule
  • Remote work
  • Outstanding offices (if you choose it)
  • Sports and community activities
  • Fulltime
Read More
Arrow Right

Staff Voice AI Engineer

Applied AI at Uber builds intelligent systems that power next-generation product...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
232000.00 - 258000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in software engineering, data science, or machine learning, including a track record of shipping production AI systems
  • Deep understanding of large language models, including fine-tuning, prompt engineering, embeddings, and retrieval-augmented generation (RAG)
  • Strong backend and distributed systems expertise, with experience designing and operating highly available, scalable services in production
  • Deep experience with ML infrastructure, including model training pipelines, online serving systems, feature stores, experiment platforms, and evaluation frameworks
  • Hands-on experience with distributed data processing systems (e.g., Spark, Flink, Ray) and workflow orchestration (e.g., Airflow or equivalent)
  • Ability to analyze data, run experiments, and derive insights for model and product improvement
  • Excellent communication and collaboration skills across technical and non-technical teams
Job Responsibility
Job Responsibility
  • Design and build end-to-end Voice AI solutions, from understanding customer pain points and defining product requirements to deploying LLM-powered, real-time voice interfaces in production
  • Benchmark and evaluate voice AI systems, including speech recognition, speech synthesis, and spoken language understanding, by designing evaluations, analyzing results, and identifying systematic weaknesses
  • Improve voice model performance through system prompt tuning, fine-tuning voice- and speech-specific models, and optimizing architectures for low-latency, real-time voice interactions
  • Analyze voice request logs, prompt traces, and audio inputs to diagnose failure modes, improve transcription accuracy, conversational quality, and overall user experience
  • Build and maintain internal tools and platforms to automate Voice AI workflows, such as large-scale transcription pipelines, real-time audio processing services, and evaluation harnesses for voice quality
  • Own Voice AI systems in production end-to-end, including rollout strategies, monitoring, alerting, quality regression detection, and on-call readiness
  • Collaborate closely with product, design, and research teams to translate user needs into Voice AI capabilities with measurable business and customer impact
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • Eligible to participate in a 401(k) plan
  • Eligible for various benefits (details at link)
  • Fulltime
Read More
Arrow Right

Software Engineer - Voice AI Agent

Great customer support requires human agents and AI in perfect balance, and Asse...
Location
Location
United States , San Francisco
Salary
Salary:
135000.00 - 280000.00 USD / Year
assembled.com Logo
Assembled
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in software engineering as an individual contributor
  • Strong proficiency in a modern programming language (Go, Python, C#, etc.)
  • Experience with distributed systems
  • Solid understanding of data structures, algorithms, and software design principles
  • Ownership mindset and proven track record of learning new technologies quickly
  • Enthusiasm and passion for learning AI/ML technologies (no prior experience required)
  • Highly ambitious and driven, setting high goals for yourself and others
  • Put customers first, focusing on solving real problems
  • Enjoy fast-paced environments and can quickly adjust when new insights emerge
  • A bit of a maverick streak that helps you come up with creative solutions
Job Responsibility
Job Responsibility
  • Build foundational voice features: Develop voice-specific product features from the ground up, such as implementing voice recognition capabilities powered by LLMs and intelligent categorization of incoming calls
  • Improve LLM model results for voice applications: Enhance our voice recognition and generation engine using advanced techniques
  • Develop voice AI infrastructure: Architect the abstractions that enable integration of various types of LLMs tailored for voice applications
  • Engage with customers: Collaborate with our customers (both support agents and managers) to understand how they interact with our voice product, and how we can improve their experience
  • Wear many hats: Be versatile in roles — coding, user research, planning, brainstorming, and cross-team collaboration
  • Shape the team culture: Encourage a startup mentality focused on product quality and taking initiative
What we offer
What we offer
  • Generous medical, dental, and vision benefits
  • Paid company holidays, sick time, and unlimited time off
  • Monthly credits to spend on each: professional development, general wellness, Assembled customers, and commuting
  • Paid parental leave
  • Hybrid work model with catered lunches everyday (M-F), snacks, and beverages in our SF & NY offices
  • 401(k) plan enrollment
  • Stock options
  • Fulltime
Read More
Arrow Right

Software Engineer - Voice AI Agent

Our Voice AI team is building autonomous AI agents that handle inbound calls for...
Location
Location
United States
Salary
Salary:
135000.00 - 280000.00 USD / Year
assembled.com Logo
Assembled
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in software engineering as an individual contributor
  • Strong proficiency in a modern programming language (Go, Python, C#, etc.)
  • Experience with distributed systems
  • Solid understanding of data structures, algorithms, and software design principles
  • Ownership mindset and proven track record of learning new technologies quickly
  • Enthusiasm and passion for learning AI/ML technologies (no prior experience required)
  • Highly ambitious and driven, setting high goals for yourself and others
  • Put customers first, focusing on solving real problems
  • Enjoy fast-paced environments and can quickly adjust when new insights emerge
  • A bit of a maverick streak that helps you come up with creative solutions
Job Responsibility
Job Responsibility
  • Building high-quality software for our voice AI platform, from rapid prototypes that push the boundaries of what's possible to production-ready, scalable solutions
  • Continuously improving our AI capabilities and accuracy through experimentation, data analysis, and innovative approaches
  • Implementing and optimizing LLM and voice technology while balancing intelligence, latency, and cost
  • Collaborating across engineering and cross-functional teams to tackle challenging technical problems throughout the full lifecycle of our voice AI products - from ideation and prototyping to deployment and monitoring
  • Build foundational voice features: Develop voice-specific product features from the ground up, such as implementing voice recognition capabilities powered by LLMs and intelligent categorization of incoming calls. You'll help design and build intuitive interfaces for support agents to monitor and interact with AI voice assistants
  • Improve LLM model results for voice applications: Enhance our voice recognition and generation engine using advanced techniques. You'll help us leverage implicit knowledge bases to improve model performance in voice contexts
  • Develop voice AI infrastructure: Architect the abstractions that enable integration of various types of LLMs tailored for voice applications. You'll design and implement evaluation and logging systems to monitor performance
  • Engage with customers: Collaborate with our customers (both support agents and managers) to understand how they interact with our voice product, and how we can improve their experience
  • Wear many hats: Be versatile in roles — coding, user research, planning, brainstorming, and cross-team collaboration
  • Shape the team culture: Encourage a startup mentality focused on product quality and taking initiative
What we offer
What we offer
  • Generous medical, dental, and vision benefits
  • Paid company holidays, sick time, and unlimited time off
  • Monthly credits to spend on each: professional development, general wellness, Assembled customers, and commuting
  • Paid parental leave
  • 401(k) plan enrollment
  • Stock options are provided as part of the compensation package
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Voice AI

Nooks is the AI Sales Assistant Platform (ASAP) that automates the busywork so r...
Location
Location
United States , San Francisco
Salary
Salary:
250000.00 - 325000.00 USD / Year
nooks.ai Logo
Nooks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6–10+ years backend/infra engineering with real-time A/V or telephony systems
  • Hands-on experience with WebRTC, SIP, Twilio, or similar stacks
  • Strong foundation in distributed systems and low-latency infra
  • Experience debugging and optimizing QoS (latency, jitter, packet loss)
Job Responsibility
Job Responsibility
  • Architect and improve real-time voice infra (WebRTC/SIP/Twilio)
  • Ensure call quality and latency meet SLA targets globally
  • Build services for advanced features (call transfers, monitoring, recordings)
  • Develop observability and debugging tools for call flows
  • Partner with Product/Support to enable new A/V features and fast issue resolution
  • Own backend infrastructure for real-time audio/video calling (Twilio, Salesfloor, A/V quality, recordings, transcriptions) to deliver a dialer that scales reliably to 10× volume
What we offer
What we offer
  • equity
  • comprehensive health, dental, vision, life and disability insurance coverage
  • hybrid work
  • unlimited paid time off
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Voice AI

Own backend infrastructure for real-time audio/video calling (Twilio, Salesfloor...
Location
Location
United States , San Francisco
Salary
Salary:
215000.00 - 280000.00 USD / Year
nooks.ai Logo
Nooks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6–10+ years backend/infra engineering with real-time A/V or telephony systems
  • Hands-on experience with WebRTC, SIP, Twilio, or similar stacks
  • Strong foundation in distributed systems and low-latency infra
  • Experience debugging and optimizing QoS (latency, jitter, packet loss)
Job Responsibility
Job Responsibility
  • Architect and improve real-time voice infra (WebRTC/SIP/Twilio)
  • Ensure call quality and latency meet SLA targets globally
  • Build services for advanced features (call transfers, monitoring, recordings)
  • Develop observability and debugging tools for call flows
  • Partner with Product/Support to enable new A/V features and fast issue resolution
What we offer
What we offer
  • equity
  • comprehensive health, dental, vision, life and disability insurance coverage
  • hybrid work
  • unlimited paid time off
  • Fulltime
Read More
Arrow Right

Software Engineer, Voice AI

Humanoid robots doing backflips are super cool, but to be useful they need to be...
Location
Location
United States , San Francisco
Salary
Salary:
150000.00 - 225000.00 USD / Year
workatastartup.com Logo
YC Work at a Startup
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with cloud development (AWS, GCP, Kubernetes)
  • Experience with latency sensitive applications (voice pipelines, video games, streaming)
  • Experience with voice AI
  • Excited to take high ownership of product features
  • Addiction to observability metrics
  • Excellent taste in developer tooling
Job Responsibility
Job Responsibility
  • Use robots that cost more than most cars to evaluate and test new features
  • Iterate on software deployed to customer's robots in the real world
  • Improve our composite voice pipelines using the latest state of the art models
  • Add and improve features like knowledge storage/retrieval, smarter turn taking, automatic personality tuning
  • Create clever approaches to reducing latency and make interactions feel more natural
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (AI Voice Agents)

Working closely with the Engineering Manager and Product Lead, you will be a Mid...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
heidihealth.com Logo
Heidi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Mastery of Fullstack fundamentals: proficient in Python and modern frontend frameworks (React/TypeScript), capable of owning a feature from database schema to UI interaction
  • Applied AI & Voice fluency: working knowledge of LLM integration (RAG, prompt engineering) and audio technologies (ASR, speech processing)
  • Pragmatic problem solving: balance engineering purity with the need for speed
  • Cloud fluency (AWS or GCP): can spin up own infrastructure (containers, serverless functions) and manage CI/CD pipelines
  • Rigorous testing in production: implement observability and feedback loops to monitor AI features in the wild
Job Responsibility
Job Responsibility
  • Build end-to-end AI features: Architect and ship fullstack solutions (from React frontends to Python backend services) that leverage voice AI and LLMs to automate clinical workflows
  • Operationalize Voice AI: Implement and fine-tune audio processing pipelines, ensuring Automatic Speech Recognition (ASR) and LLM agents perform accurately in diverse medical environments
  • Bridge the gap between model and product: Translate complex feedback from clinicians into technical solutions, rapidly prototyping and deploying improvements to model behavior, prompting strategies, and audio handling
  • Optimise for real-time interaction: Tune fullstack performance to handle real-time audio streaming and token generation, minimizing latency
  • Partner with implementation and clinical teams: Shorten the feedback loop by shipping critical integrations and feature requests from concept to production in days, not quarters
What we offer
What we offer
  • Flexible hybrid working environment, with 3 days in the office
  • Additional paid day off for your birthday and wellness days
  • Special corporate rates at Anytime Fitness in Melbourne, Sydney tbc
  • A generous personal development budget of $500 per annum
  • Learn from some of the best engineers and creatives, joining a diverse team
  • Become an owner, with shares (equity) in the company
  • Fulltime
Read More
Arrow Right