This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Our partner is a fast-growing, innovation-driven company building and deploying AI solutions across Space, Manufacturing, AdTech, and FinTech. They combine state-of-the-art research with robust engineering to solve real-world problems at production scale.
LLM Adaptation & Deployment: Fine-tune open-source models and optimize inference for production-scale latency and cost
Advanced RAG: Implement high-performance embedding, retrieval, and re-ranking pipelines for grounded outputs
Structured Generation: Enforce schemas and guardrails to minimize hallucinations and ensure reliable system behavior
Evaluation & Quality: Develop automated evaluation harnesses, regression tests, and versioning for prompts and models
Production Engineering: Ship containerized APIs with full CI/CD, observability, and reliability monitoring (SLOs)
Cross-functional Delivery: Collaborate with product teams to integrate GenAI features and mentor junior engineers
Requirements:
Senior AI Expertise: 5+ years building production ML/AI systems, including 2+ years in lead roles with strong Python engineering (performance, testing, packaging)
LLM & Agentic AI: Hands-on experience with orchestration, tool-calling, and workflow integration, including LLM adaptation (PEFT/LoRA) and safety engineering
Production RAG & Data: Proven track record of operating RAG pipelines, vector databases, and retrieval performance tuning in production
MLOps & Cloud: Proficiency in containerized services (REST/gRPC), CI/CD, and monitoring within cloud environments (AWS/GCP/Azure)
Advanced Optimization: Experience in inference optimization (vLLM/quantization), event-driven orchestration, and automated evaluation (LLM-as-judge)