This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We're hiring a senior AI Engineer to design, build, and ship production AI systems — with strong emphasis on Voice AI. You'll own the full lifecycle: architecture, training, deployment, and monitoring across language and voice modalities.
Job Responsibility
LLM & GenAI: Fine-tune and deploy LLMs
build RAG pipelines and agentic workflows (LangChain, LlamaIndex)
Voice Pipelines: Architect real-time ASR → LLM → TTS pipelines with <300 ms latency
Voice Agents: Build production voice agents with turn-taking, barge-in handling, and emotion-aware dialogue
Speech Fine-Tuning: Adapt ASR/TTS models for domain-specific accents, terminology, and speaking styles
MLOps: Build reproducible ML pipelines (Kubeflow / MLflow)
maintain CI/CD, monitoring, and model versioning
Inference Optimization: Apply quantization (GGUF, GPTQ), distillation, and hardware-aware inference (TensorRT, vLLM) to cut cost and latency
APIs & Services: Ship high-performance inference APIs in Python (FastAPI) or Go on Kubernetes
Data & Evaluation: Curate text + speech corpora
define eval harnesses covering WER, MOS, latency P95, and safety
Requirements
4+ yrs ML/software engineering
2+ yrs on production AI systems
Strong Python
PyTorch or TensorFlow
LLM fine-tuning: LoRA / QLoRA / PEFT
End-to-end ML pipeline experience (train → serve)
Cloud (AWS / GCP / Azure) + Docker / Kubernetes
ASR & TTS integration in real-time streaming systems
VAD, noise suppression, and barge-in handling
Telephony APIs (Twilio, Vonage) or WebRTC experience
Nice to have
Whisper / wav2vec fine-tuning for domain adaptation