This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Voice is one of the key interfaces humans will interact with AI at scale. To make this reality, we are building the engine for the next generation of AI-driven software. Our primary focus is pushing the boundaries of speech modeling (STT & TTS). We approach this by researching and utilizing ML ideas that allow us to achieve state-of-the-art results (we recently ranked #1 on Artificial Analysis for Text-to-Speech models).
Job Responsibility:
Researching, building, optimizing, and deploying the production ML systems that thousands of developers integrate with their systems
Focusing on the difficult research and engineering problems of building the engine for the next generation of AI-driven software
Requirements:
A PhD in a relevant technical field, or a BA/BS degree with equivalent research and/or engineering experience
5+ years of combined experience in software development (e.g. with Python or C++) and applied ML engineering
Demonstrated experience applying or researching Machine Learning in one or more of the following domains: Speech or video processing
Natural Language Processing (NLP)
Action planning
Strong foundation in data structures, algorithms, and neural network architectures
Proficiency with ML frameworks such as PyTorch
Professional working proficiency in English
Nice to have:
A passion for learning and staying up-to-date with the latest advancements in ML/Voice AI research and its applications
Ability to work collaboratively in a fast-paced environment with shifting priorities
Familiarity with pre-training, fine-tuning, RLHF and evaluation of large language and speech models
Knowledge of working with embedded systems and/or running ML on edge devices