This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We seek a skilled Data Engineer to build and optimize our data infrastructure. As a key contributor, you will collaborate closely with cross-functional teams to design and implement robust data pipelines that efficiently extract, transform, and load data into our AWS-based data lake and data warehouse. Your expertise will be instrumental in empowering data-driven decision making through advanced analytics and predictive modeling.
Job Responsibility:
Building and optimizing data pipelines, data warehouses, and data lakes on the AWS and Databricks platforms
Managing and maintaining the AWS and Databricks environments
Ensuring data integrity, accuracy, and consistency through rigorous quality checks and monitoring
Maintain system uptime and optimal performance
Working closely with cross-functional teams to understand business requirements and translate them into technical solutions
Exploring and implementing new tools and technologies to enhance ETL platform performance
Requirements:
Proficient in SQL for extracting, transforming, and analyzing complex datasets from both relational and columnar data stores. Proven ability to optimize query performance on big data platforms
Proficient in leveraging Python, PySpark, and Airflow to build scalable and efficient data ingestion, transformation, and loading processes
Ability to learn new technologies quickly
Strong problem-solving and analytical skills
Excellent communication and teamwork skills
Bachelor’s degree in computer science and engineering preferred, other Engineering field is considered
Nice to have:
Experienced with SQL/NOSQL database, vector database for large language models
Experienced with data modeling and performance tuning for both OLAP and OLTP databases
Experienced with Apache Spark, Apache Airflow
Experienced with software engineering best-practices, including but not limited to version control (Git, Subversion, etc.), CI/CD (Jenkins, Maven etc.), automated unit testing, and Dev Ops