This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Data Engineer, you will be the foundational backbone of our data analytics delivery stream. You will design and implement high-performance data pipelines capable of handling both real-time streaming and batch processing. Beyond writing complex ETL/ELT logic and tuning SQL queries for maximum efficiency, you will collaborate with Data Scientists, ML Developers, and Cloud Architects to evolve our data platform environment. This role is perfect for a developer with a data-backend focus who champions clean code, automated data quality tracking, and scalable cloud architecture.
Job Responsibility
Build, improve, and maintain data infrastructure for ingesting, storing, and transforming data across advanced analytical workflows
Implement robust, scalable data pipelines supporting both stream and batch processing
Perform end-to-end ETL/ELT processes to extract and load data from a wide variety of structured and unstructured sources
Partner with enterprise architects and Information Services (IS) teams to design and evolve the overarching data platform environment in AWS
Write complex, automated SQL queries to manipulate data extracts and manage large-scale data flows
Perform continuous query optimization, performance analysis, and database tuning to minimize processing latency and cost
Assist with logical and physical data modeling, implementing optimized data structures (e.g., star and snowflake schemas)
Automate cloud deployments, data flows, and automated data quality validation checks
Collaborate closely with Data Scientists and ML developers to build, test, and operationalize automated ML pipelines
Design rigorous testing processes, formulating and executing comprehensive test cases for data validation
Conduct thorough code reviews, troubleshoot system defects, and provide constructive feedback to elevate code quality across the distributed engineering team
Requirements
Minimum of 3+ years in a dedicated Data Engineering role or a Software Developer role with a strong focus on data-backend development and complex transformations
Excellent, production-level knowledge of Python
Strong knowledge of SQL paired with a deep understanding of performance analysis and query optimization techniques
Rigid grasp of computer science fundamentals, including modularity, abstraction, data structures, and algorithms
Solid understanding of core data modeling concepts (normalized vs. denormalized data architectures, conceptual/logical/physical models, and star/snowflake schemas)
Proven experience building automated, production-grade ETL/ELT pipelines
Solid project experience leveraging various data storage technologies (including RDBMS, NoSQL, and Graph Databases)
Direct experience with cloud infrastructure provisioning and deployment automation on AWS
Nice to have
BSc. degree in Computer Science, Engineering, Mathematics, Physics, Statistics, or an equivalent quantitative field
In-depth knowledge of the broader AWS ecosystem combined with exposure to Microsoft Fabric
Experience working within fast-paced Agile development methodologies