CrawlJobs Logo

Lead Data Engineer - AI Search

France, Paris · Job Posted June 08, 2026
Apply Position
Job Link Share

Job Description

We are looking for a Lead Data Engineer — AI Search (all genders) to join Valtech and lead enterprise-scale AI, search, and conversational platform initiatives. At Valtech, we design and build scalable, high-performance digital solutions at the intersection of cloud engineering, AI, data, and experience design. The Tech Lead is responsible for mature study and translating architecture into scalable and production-ready solutions. This role drives detailed technical design choices, secures delivery quality, and coordinates development and integration activities across multidisciplinary teams.

Job Responsibility

  • Define detailed technical design, components, interfaces, and integration patterns
  • Lead development teams and enforce engineering standards and best practices
  • Coordinate architecture, cloud/platform, data, QA, and business stakeholders
  • Ensure delivery quality through testing, observability, and application security practices
  • Guide teams on GCP and AI Search implementation patterns
  • Lead technical discovery, estimation, and solution design workshops
  • Identify technical risks and propose mitigation strategies early
  • Ensure scalable, reliable, and production-ready AI solutions
  • Contribute hands-on in complex and high-impact technical areas
  • Support industrialization and operationalization of AI and search use cases

Requirements

  • 7+ years of professional software engineering experience in enterprise environments
  • Strong expertise in Google Cloud Platform (GCP) and AI ecosystems
  • Experience with AI Search, conversational AI, and generative AI integrations
  • Strong application architecture, API design, and enterprise integration experience
  • Solid understanding of cloud-native, microservices, and event-driven architectures
  • Experience with CI/CD pipelines, software quality, observability, and cloud security best practices
  • Experience working with Kubernetes and Infrastructure as Code
  • Technical leadership experience, including mentoring engineers and guiding technical decisions
  • Ability to collaborate across architecture, engineering, cloud, and data teams
  • Strong communication and stakeholder management skills
  • Experience delivering enterprise-scale digital solutions in Agile environments
  • Backend development experience with Java and/or Python
  • Advanced level of French and English

What we offer

  • Flexibility, with remote and hybrid work options (country-dependent)
  • Career advancement, with international mobility and professional development programs
  • Learning and development, with access to cutting-edge tools, training and industry experts

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Lead Data Engineer - AI Search

8 matching positions

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...
Location
Location
Poland , Kraków
Salary
Salary:
Not provided
teamquest.pl Logo
TeamQuest Sp. z o. o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Expertise (verifiable - ask for production examples): GCP is their primary cloud not secondary experience alongside AWS/Azure. Production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity. Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
  • AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only. Strong track record building distributed systems and integrating LLMs.
  • Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer. RAG pipelines shipped to production. Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration. AI agent workflows, ReAct prompting, and Function Calling in production environments
  • Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed. Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls.
  • API Design & Integrations: Proven ability to create secure, high-performance APIs capable of asynchronously managing traffic and communication between multiple decoupled services.
  • Enterprise Security: Practical knowledge of data isolation in multi-tenant SaaS architectures, IAM, and securing cloud-based environments.
  • Vector Databases: Hands-on experience with semantic search and at least one of: Pinecone, Weaviate, pgvector, or Vertex Matching Engine.
Job Responsibility
Job Responsibility
  • System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
  • AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
  • Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
  • Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
  • Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources
What we offer
What we offer
  • Attractive salary
  • Full remote work
  • Social benefits:sporto card,healthcare insurance
  • Fulltime
Read More
Arrow Right

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...
Location
Location
Poland , Katowice
Salary
Salary:
Not provided
teamquest.pl Logo
TeamQuest Sp. z o. o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Expertise (verifiable - ask for production examples): production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity
  • Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
  • AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only
  • Strong track record building distributed systems and integrating LLMs
  • Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer
  • RAG pipelines shipped to production
  • Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration
  • AI agent workflows, ReAct prompting, and Function Calling in production environments
  • Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed
  • Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls
Job Responsibility
Job Responsibility
  • System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
  • AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
  • Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
  • Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
  • Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources
What we offer
What we offer
  • Attractive salary
  • Full remote work
  • Social benefits: sport card, healthcare insurance
  • Fulltime
Read More
Arrow Right

Lead Ai Engineer

Fourth is the world’s largest and fastest-growing global leader of end-to-end re...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
hotschedules.com Logo
HotSchedules Corporate
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years experience in machine learning, AI engineering or software development, with a demonstrated track record of deploying machine learning models and AI solutions in production environments
  • Strong Python skills for building production AI services
  • Hands‑on experience running LLM‑based systems in production
  • Experience with RAG architectures, embeddings, and vector search
  • Proven ability to design scalable, maintainable systems and APIs
  • Solid understanding of production operations (monitoring, logging, error handling)
  • Cloud deployment experience (Azure preferred)
  • Strong understanding of data pipelines, SQL, and data quality
  • Working knowledge of AI governance, access control, and responsible AI practices
Job Responsibility
Job Responsibility
  • Own AI engineering and architectural decisions across enterprise solutions
  • Design and build production‑grade AI systems and pipelines
  • Take AI solutions from prototype to reliable production
  • Define engineering standards, quality bars, and production readiness
  • Shape feasibility, prioritization, and technical approach for AI initiatives in partnership with the Director, without owning overall AI strategy
  • Translate business problems into scalable, technically sound AI solutions
  • Work with Data, Security, Legal, Compliance and others on safe AI delivery
What we offer
What we offer
  • 25+ days off, as well as birthday day off and 4 charity days off per year
  • Flexible start and end of the working day and hybrid working mode, including a combination remote and in the office
  • Team-centric atmosphere
  • Encouraging healthy lifestyle and work-life balance including supplemental health insurance
  • New parents bonus scheme
  • Fulltime
Read More
Arrow Right

Lead AI Engineer

We’re looking for a hands-on Lead AI Engineer (6–10 years) who can: Lead design ...
Location
Location
India , Chennai
Salary
Salary:
Not provided
itechindia.co Logo
iTech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6–10 years total experience
  • 4+ years in AI / DL / NLP in production environments
  • Proven experience taking AI solutions from PoC to production
  • Strong in Python and core CS fundamentals (data structures, algorithms, clean code)
  • Hands-on with AI / NLP / LLMs – classification, NER, QA, summarization, document understanding
  • Experience with some of: PyTorch / TensorFlow / Hugging Face / LangChain / LlamaIndex / spaCy
  • Built and deployed RESTful ML services / APIs integrated with web or mobile applications
  • Worked with Docker and CI/CD pipelines
  • experience of AWS or GCP
  • Good SQL skills and experience with at least one RDBMS (PostgreSQL/MySQL)
Job Responsibility
Job Responsibility
  • Own the end-to-end lifecycle of AI features – from requirement understanding and solution design to development, deployment and monitoring
  • Build LLM/RAG-based solutions using vector databases (e.g., pgvector, FAISS, Chroma, OpenSearch) for search, QA, extraction and summarization over documents
  • Develop Document AI models / pipelines for OCR post-processing, entity/key–value extraction, classification and layout/table understanding
  • Deploy models as scalable APIs / microservices in cloud and on-prem environments using Python, Docker and CI/CD
  • Work directly with client stakeholders to refine requirements, present solutions and handle reviews / clarifications
  • Act as technical lead on 2–3 AI projects, guide the team on architecture, design and best practices
  • Mentor junior ML engineers via code reviews, design discussions and technical coaching
  • Design with security, privacy and cost in mind, especially for data-sensitive domains (BFSI, healthcare, legal, HR)
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

As Lead Data Engineer, you will own and scale the data platform that powers AirO...
Location
Location
United States , New York City; San Francisco
Salary
Salary:
Not provided
airops.com Logo
AirOps
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in data engineering with 2+ years leading projects
  • Expert SQL and Python with deep experience building production pipelines at scale
  • Hands-on with dbt and a workflow manager such as Airflow or Prefect
  • Strong background in dimensional and event-driven modeling and a company-wide metrics layer
  • Experience with Snowflake or BigQuery, plus Postgres for transactional use cases
  • Track record building data products for analytics and customer reporting
  • Cloud experience on AWS or GCP and infrastructure as code such as Terraform
Job Responsibility
Job Responsibility
  • Data platform ownership: design, build, and operate batch and streaming pipelines that ingest data from crawlers, partner APIs, product analytics, and CRM
  • Core modeling: define and maintain company-wide models for content entities, search queries, rankings, AI agent answers, engagement, and revenue attribution
  • Orchestration and CI: implement workflow management with Airflow or Prefect, dbt-based transformations, version control, and automated testing
  • Data quality and observability: set SLAs, add tests and data contracts, monitor lineage and freshness, and lead root cause analysis
  • Warehouse and storage: run Snowflake or BigQuery and Postgres with strong performance, cost management, and partitioning strategies
  • Semantic layer and metrics: deliver clear, documented metrics datasets that power dashboards, experiments, and product activation
  • Product and customer impact: partner with Product and Customer teams to define tracking plans and measure content impact across on-site and off-site channels
  • Tooling and vendors: evaluate, select, and integrate the right tools for ingestion, enrichment, observability, and reverse ETL
  • Team leadership: hire, mentor, and level up data and analytics engineers
  • establish code standards, review practices, and runbooks
What we offer
What we offer
  • Equity in a fast-growing startup
  • Competitive benefits package tailored to your location
  • Flexible time off policy
  • Parental Leave
  • A fun-loving and (just a bit) nerdy team that loves to move fast!
  • Fulltime
Read More
Arrow Right

Lead AI Engineer

Linnify is a global technology partner that helps visionary companies accelerate...
Location
Location
Romania , Cluj-Napoca
Salary
Salary:
Not provided
linnify.com Logo
Linnify
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on experience in AI/ML engineering, with at least 2 years building agentic or generative AI applications
  • Proven experience leading complex AI initiatives in production, from architecture through deployment
  • Deep expertise with LangChain, LangGraph, or OpenAI Agents and agentic workflows
  • Strong understanding of foundation models, LLMs, and orchestration challenges
  • Hands-on experience designing and deploying Retrieval-Augmented Generation (RAG) systems
  • Experience with fine-tuning LLMs using supervised or reinforcement learning approaches
  • Strong understanding of vector search systems (e.g., Pinecone, Weaviate, FAISS, Qdrant)
  • Experience with evaluation pipelines and model monitoring in production environments
  • Excellent communication skills and ability to lead cross-functional technical discussions
  • Languages: Python (required), familiarity with JS/TypeScript is a plus
Job Responsibility
Job Responsibility
  • Architect and lead development of AI-powered applications, especially those using agentic and LLM-based architectures
  • Own and evolve end-to-end AI workflows, including RAG pipelines, prompt orchestration, tool usage, and safety layers
  • Design and implement multi-agent systems with planning, delegation, memory, and inter-agent communication
  • Oversee fine-tuning of foundational models for downstream tasks in various domains
  • Define and monitor evaluation strategies to assess model performance, consistency, and reliability in production
  • Lead implementation of prompt safety, content moderation, and guardrails across AI components
  • Guide the integration of OCR and data extraction flows to process unstructured data
  • Provide technical leadership across AI initiatives and mentor mid-level and junior engineers
  • Collaborate with product and design teams to shape scalable, production-ready ML solutions
  • Stay up to date with AI research and tools, balancing innovation with production readiness
What we offer
What we offer
  • Flexible work schedule and remote work days
  • ESOP (Employee Stock Ownership Plan) so that you grow with us
  • Additional loyalty vacation days (up to 28 total vacation days based on experience and time with us)
  • Health package via Regina Maria Clinic
  • Meal tickets (40 RON/day)
  • Discounted gym subscription
  • Access to role-specific certifications/courses
  • Regular team knowledge-sharing sessions and mentoring opportunities
  • Clear visibility into your progress and opportunities for advancement
  • Top-tier gear: Receive your own MacBook for high-performance development
  • Fulltime
Read More
Arrow Right

Senior Staff Data Engineer- ML & AI Platform

At Marktplaats, data is at the heart of everything we do, but Intelligence is wh...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
adevinta.com Logo
Adevinta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience with a specific focus on the intersection of Data Engineering, MLOps, and AI Infrastructure
  • Deep knowledge of Spark internals, structured streaming, and performance tuning for large-scale data processing
  • Proven experience architecting end-to-end ML platforms for Traditional ML (Classic MLOps) while actively enabling the organization on Generative AI concepts
  • Strong background in building automated pipelines and ensuring system observability
  • Practical experience building infrastructure for Large Language Models, including managing the complexity of chaining models and tools
  • Solid experience serving models at low latency and high concurrency using containerized solutions
  • Ability to speak the language of AI/ML Engineers and effectively bridge the gap between experimental code and production systems
  • Expert level Python
  • Experience with PyTorch, Terraform, Terragrunt, Docker, Kubernetes, GitHub Actions, Datadog
  • Experience with Databricks AI Stack: MLflow, Mosaic AI, Unity Catalog, Feature Store, Databricks Model Serving, Vector Databases
Job Responsibility
Job Responsibility
  • Lead the evolution of our Machine Learning & AI Platform, designing the architecture for AI Agents and establishing patterns for Vector Databases
  • Act as a first mover: validate new Databricks features and integrate them into the platform
  • Write the guidelines for GenAI development, helping teams transition from notebook experiments to production-grade LLM applications
  • Design the Feature Store, manage the Model Registry, and set up the infrastructure for Vector Search and RAG (Retrieval Augmented Generation) workflows
  • Elevate the technical bar of the team, mentoring Staff and Senior engineers on design patterns, code quality, and architectural decisions
  • Translate complex requirements from ML Engineers and Data Scientists into robust engineering tickets and infrastructure roadmaps
What we offer
What we offer
  • An attractive Base Salary
  • Participation in our Short Term Incentive plan (annual bonus)
  • Work From Anywhere: Enjoy up to 20 days a year of working from anywhere
  • A 24/7 Employee Assistance Program for you and your family
  • Fulltime
Read More
Arrow Right

Senior Software Engineer- AI and Data Governance

At GEICO, we offer a rewarding career where your ambitions are met with endless ...
Location
Location
United States , Palo Alto
Salary
Salary:
100000.00 - 215000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advance knowledge of at least one modern OOP languages such as Go, Python, Java, etc.
  • Advance knowledge of web technologies such as HTML, CSS, JavaScript is preferred
  • Understand open-source databases like MySQL, PostgreSQL, etc., familiar with No-SQL databases like Cassandra, MongoDB, Elasticsearch, etc.
  • Experience in architecting, designing, building automation, workflows, custom objects/apps, declarative functionality, triggers, migration tools in BMC Helix platform and transition such platform to Open Source is a big plus
  • Experience building and configuring flows, and process builders
  • Strong understanding of web service integration (GRPC / REST) and enterprise middleware integration tiers
  • Ability to articulate channel dataflow and process flow including email, messaging, chat, mobile Push and SDK's
  • Excellent communication skills – needs to be able to lead projects from the front and interact with clients and sponsors on a regular basis
  • Experience partnering with engineering teams and transferring research to production
  • Experience with continuous delivery (CI/CD) and Infrastructure as Code
Job Responsibility
Job Responsibility
  • Collaborate with product managers, team members, customers, and other engineering teams to solve our toughest problems
  • Develop and execute technical software development strategy for the Platform Engineering domain including Service Management, Business Continuity, Recovery, Incident Response and Paging platforms
  • Accountable for the quality, usability, and performance of the solutions
  • Deep hands-on experience in complex system design and data pipeline and architectures, scale and performance, tuning, with good knowledge on Docker and Kubernetes
  • Consistently share best practices and improve processes within and across teams
  • Willing to take on-call and operational support
  • Experience designing recommendation systems, ranking, personalization, similarity search and embeddings
  • Experience with NLP, LLMs and RAG, as well as translating natural language into graph or data queries
  • Experience designing scalable AI systems and Data pipelines
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right