This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The client is seeking an experienced AI Engineer with a strong focus on Generative AI to design, develop, and deploy advanced AI systems. This role will play a key part in building scalable GenAI solutions and integrating large language models into production environments. The position offers significant ownership, with the opportunity to influence architecture, product direction, and AI strategy from an early stage.
Job Responsibility
Design, build, and deploy generative AI solutions, including LLM-powered applications
Develop and optimize end-to-end ML pipelines (data processing, training, evaluation, deployment)
Integrate and fine-tune large language models (e.g., OpenAI, Anthropic, open-source models)
Implement advanced GenAI techniques such as retrieval-augmented generation (RAG), prompt engineering, and agents
Build scalable backend services and APIs to serve AI models in production
Collaborate with product and engineering teams to translate business requirements into AI features
Continuously improve model performance, scalability, latency, and cost efficiency
Stay current with emerging trends and advancements in AI, particularly in the generative AI space
Requirements
5+ years of experience in AI/ML engineering or a related field
Strong programming skills in Python
Proven experience building and deploying generative AI or LLM-based applications
Hands-on experience with frameworks such as PyTorch, TensorFlow, or similar
Solid understanding of machine learning fundamentals and evaluation techniques
Experience building APIs and production systems (e.g., FastAPI, Flask)
Experience with cloud platforms (AWS, GCP, or Azure)
Strong understanding of prompt engineering, embeddings, and vector search
Ability to thrive in a fast-paced startup environment with high ownership