This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a Lead Data Engineer to join a client-focused data pod delivering large-scale data engineering solutions within a cloud-native AWS environment. This is a delivery-first role requiring deep hands-on expertise in streaming data architectures, big data systems, and modern data warehousing practices. You will own streaming architectural direction while remaining actively involved in implementation. The environment is AWS-centric (Redshift, S3, Glue, Step Functions, Lambda, EMR), with DBT as the transformation framework. We are actively integrating streaming data from GCP sources into our AWS data platform. You will define engineering standards across data modeling, DBT implementation, testing, CI/CD, and production resiliency while collaborating directly with the client’s data team.
Job Responsibility:
Own the architectural direction for streaming data ingestion from GCP into AWS
Design resilient ingestion frameworks including error handling, retry strategies, monitoring, and failure isolation
Implement distributed processing pipelines using Spark / PySpark or similar frameworks
Create and maintain scalable data warehouses and associated ETL/ELT processes using DBT models in Amazon Redshift
Design and implement DBT projects including macros, tests, documentation, and reusable modeling patterns
Conduct Redshift query and DBT performance tuning to optimize warehouse efficiency and cost
Define and enforce best practices for: Data modeling
Version control (Git-based workflows)
CI/CD pipelines for DBT deployments
Automated testing at model, transformation, and pipeline levels
Ensure robust testing is embedded into every DBT model (schema tests, custom tests, data validation checks)
Lead code reviews and architectural design reviews
Work with AWS services including Redshift, S3, Glue, Step Functions, Lambda (Python), Athena, and EMR
Requirements:
6+ years of data engineering experience in big data environments
Proven experience designing and implementing streaming architectures