This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Voice AI team in EMEA, part of the Meta Superintelligence Labs, is looking for a Research Scientist (Speech and Language). The Voice AI team works on large language models (LLMs) with native supporting for processing, understanding and generating of audio and speech as a modality besides others such as text or vision. As part of this, we are leveraging knowledge in areas like speech/audio encoders/tokenizer, pre-training, post-training, (online) reinforcement learning, LLM alignment, multimodal modelling, speech and audio processing, speech recognition (ASR), speech synthesis (TTS), and multilingual modelling. Our work is focused on advancing core technologies to drive and advance core product experiences at Meta such as video dubbing on IG/FB or Meta AI which is available on e.g. RayBan Meta glasses or within WhatsApp.
Job Responsibility:
Apply relevant AI and machine learning techniques to build and advance audio and speech technologies using large language models that can be applied to a wide area of Meta production use cases
Work towards long-term ambitious research and productization goals, while identifying intermediate milestones
Work with large data, and contribute to development of large scale foundation models
Influence progress of relevant research communities by producing publications
Requirements:
PhD degree in Artificial Intelligence (AI), computer science, related technical fields with 1+ years of experience, or BS degree with 3+ years of industrial research experience in the related field
AI research experience in the domains of audio and speech processing
First-author publications at peer-reviewed AI conferences (e.g. Interspeech, ICASSP, ASRU, SLT, NeurIPS, CVPR, ICML, ICLR, ICCV, ACL)
Strong skills to communicating complex research for public audiences or peers
Experience developing machine learning algorithms in e.g. Python, PyTorch, C/C++
Nice to have:
Research experience in generative AI, especially in building and optimising large language models for areas of audio/ speech processing and understanding, computer vision and/or natural language understanding beyond black-box use
Additional AI research experience in computer vision and/or NLP
Previous internship(s) and/or research assistantship(s) in an AI research organization
Industry experience working on Speech, Language, and LLM related topics and the experience to apply relevant AI and machine learning techniques to build intelligent rich speech & language systems for improving product experiences
Interest in taking new research findings in this area and implementing them towards product needs