This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Design, own and maintain the machine learning infrastructure, partnering with cross functional teams across the lifecycle of key ML initiatives
Own and drive the roadmap for Akuna’s machine learning infrastructure, tooling, and processes
Take a hands-on role designing, building, and maintaining platforms that enable efficient model development, training, deployment, monitoring, and governance
Establish best practices for reproducible ML workflows, model lifecycle management, CI/CD, observability, and production reliability
Operate and continuously improve ML infrastructure to ensure it is scalable, performant, reliable, and easy for researchers and engineers to use
Provide technical leadership and guidance to engineering and research teams adopting ML infrastructure capabilities
Requirements:
Bachelor's degree or higher in Computer Science, Engineering or a related technical field
Minimum 3+ years full-time experience building, deploying, or operating machine learning infrastructure in production environments
Hands-on experience with ML infrastructure or data platforms such as Databricks or Spark
Strong programming skills in Python, C++, or both in Linux-based environments
Effective communication skills, with the ability to partner effectively across engineering, research, and business teams
Demonstrated ownership, technical leadership, and ability to define and execute a roadmap