This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Efficy is Europe’s leading independent CRM provider, trusted by 13,500+ customers and 330,000+ users worldwide. We’re growing and looking for a Data Engineer to join our international team. You'll work on maturing our modern data warehouse on AWS, building and refining our medallion architecture (Bronze/Silver/Gold) with a domain-oriented Gold layer that powers both analytics and AI applications.
Job Responsibility:
Refactor and standardize Bronze and Silver layers - cleaning up technical debt, establishing consistent patterns organized by source systems, implementing data quality frameworks with automated validation and monitoring
Design and build domain-oriented Gold layer datasets - creating business-aligned data products organized by domain that serve as source for analytics, self-service BI and AI consumption
Build and optimize data pipelines using Databricks and/or Glue, working with PySpark move data through Bronze → Silver → domain-specific Gold
Develop AI-ready data infrastructure from Gold layer - implementing vectorization pipelines, building knowledge bases for LLMs, and preparing domain-specific structured data for AI agents
Collaborate with domain data owners to translate business requirements into scalable, domain-aligned data products
Requirements:
5+ years of hands-on experience as a Data Engineer, with proven expertise in building production data pipelines
Strong Python and SQL skills - comfortable writing complex transformations, optimizing queries, and working across different layers of data processing
Understanding of data governance and data quality frameworks
Practical experience or strong interest in AI/ML data preparation - vectorization, embeddings, and LLM data pipelines is a plus
Strong analytical mindset with attention to detail and a pragmatic approach to balancing technical processing needs with business consumption patterns
Fluent in English
Nice to have:
Practical experience or strong interest in AI/ML data preparation - vectorization, embeddings, and LLM data pipelines