This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Our client is building the next generation answer engine. They apply LLMs for knowledge search at scale, serving millions of users across the world. Their current backend stack is Python, Rust, TensorRT-LLM, Kubernetes, AWS. You will work on designing and implementing machine learning models that drive our recommendation systems, retrieval algorithms, and classifiers.
Job Responsibility:
Develop, train, and optimize machine learning models for recommendation systems
Build groundwork infrastructure for retrieval
Work with real world data to create scalable feedback loops
Incorporate state of the art LLMs into traditional machine learning systems
Requirements:
Experience with machine learning at scale
Experience with retrieval infrastructure
Creative problem-solving skills with an ability to explore a new landscape of models
Eagerness to build from 0 to 1 and unlock potential in new products
Familiarity with big data pipelines for feature engineering
At least 6 years of experience with machine learning and backend engineering
Applicants must be authorized to work in the United States legally