This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Are you excited by the challenge of building scalable data infrastructure that powers search, browse, and recommendation systems used by millions of users? We are setting up a brand-new Data Engineering team within leading European Online Fashion & Beauty Retailer to take ownership of key pipelines that fuel the company's ML and data-driven services. The project focuses on migrating existing data pipelines from GA4 (Google Analytics 4) to an internal Customer Behavior Source - a company-specific representation of behavioral event data used across multiple downstream systems. The current scope is centered on this migration, with an emphasis on data correctness, stability, and alignment with a new internal event model. The role involves working with existing pipelines, adapting transformations to the new data source, and ensuring a smooth transition without disrupting data consumers. This is a hands-on data engineering role with a strong focus on pipeline migration, data quality, and collaboration with stakeholders validating behavioral data.
Job Responsibility:
Migrate existing data pipelines from GA4 (Google Analytics 4) to an internal Customer Behavior Source
Refactor and adapt Spark-based ETL pipelines to match a new behavioral event schema
Ensure data accuracy, consistency, and reliability during and after the migration
Work with large-scale datasets using PySpark and SQL
Collaborate with analytics and product teams to validate migrated data and resolve discrepancies
Document pipeline logic, data models, and migration decisions
Troubleshoot and resolve issues related to data quality or pipeline stability
Requirements:
Strong hands-on experience with PySpark and Apache Spark
Proficiency with Airflow for orchestrating data pipelines
Solid SQL skills for data transformation and validation
Experience working with Databricks in a production environment