This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We're building a next-generation AI-powered platform and web application for creating audio and video content quickly and easily. This involves developing a revolutionary way to record, transcribe, edit, and mix audio and video on the web using state-of-the-art AI models—a challenge that requires solving complex technical problems. We're hiring a senior engineer to join our AI Platform and Enablement team. The ideal candidate thrives in a fast-moving, high-ownership environment and is comfortable navigating the ambiguity of bringing research work into an established product.
Job Responsibility:
Build, maintain, and standardize third-party model integrations, including consulting for other engineering teams with AI model integration needs
Design, implement, and maintain our AI infrastructure supporting our machine learning life cycle, including data ingestion pipelines, training developer experience and infrastructure, evaluation frameworks, and deployments / GPU infrastructure
Collaborate with Product Managers, Research Engineers, and AI Researchers to understand their infrastructure needs and ensure our AI systems are robust, scalable, and efficient
Optimize and scale our models and algorithms for efficient inference
Deploy, monitor, and manage AI models in production
Requirements:
Experience in deploying and managing AI models in production
Experience with the tools of large volume data pipelines like spark, flume, dask, etc.
Familiarity with cloud platforms (AWS, Google Cloud, Azure) and container technologies (Docker, Kubernetes)
Knowledge of DevOps and MLOps best practices
Strong problem-solving abilities and excellent communication skills
Nice to have:
Experience with generative AI models
Familiarity with audio and video processing
Knowledge of Python, C/C++, CUDA, and experience profiling GPU performance and distributed training runs
Experience with machine learning frameworks like PyTorch, TensorFlow or similar