This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Principal Data Engineer will lead the design, development, and optimization of our large-scale data infrastructure and data pipelines. They will serve as a technical leader, highly proficient in modern cloud data platforms, data governance tools, and business intelligence (BI) solutions. This role requires deep expertise in data architecture, data modeling, and performance tuning to ensure reliable, scalable, and high-quality data assets that drive business-critical decisions.
Job Responsibility:
Architect and Design: Define and implement the architectural strategy for our enterprise data platform, focusing on scalability, security, and performance. Design and build robust, high-volume, and performant data pipelines using cloud-native services like AWS Data Pipelines (Glue, EMR, S3, Redshift, etc.) or Azure Data Factory/Synapse Analytics
Technical Leadership: Act as a subject matter expert and mentor for junior and mid-level data engineers, setting best practices for code quality, testing, and deployment
Data Governance & Management: Utilize tools like IDMC (Informatica Data Management Cloud) for data integration, quality, governance, and cataloging across the enterprise
Data Processing and Analysis: Develop, optimize, and manage large-scale data processing jobs using Databricks(Spark/Delta Lake) for ETL/ELT workflows and advanced analytics
Coding and Scripting: Write high-quality, efficient, and well-documented code primarily in Python for data manipulation, automation, and pipeline orchestration
Deployment and Automation: Implement and maintain robust CI/CD pipelines and infrastructure-as-code (e.g., Terraform/CloudFormation) for automated deployment and management of data solutions
Business Intelligence: Ensure data readiness for reporting and analytics and possess working knowledge/experience with BI tools like Tableau and PowerBI to facilitate data consumption
Performance and Optimization: Monitor, troubleshoot, and tune existing data infrastructure and pipelines to ensure optimal performance and cost efficiency
Requirements:
Bachelor’s Degree or equivalent combination of education, training and/or relevant experience
Plus 7 years of relevant work experience
Proficiency and hands on experience designing and implementing end-to-end data solutions in AWS (e.g., S3, EMR, Glue, Redshift, Kenesis) and/or Azure (e.g. Azure Data Factory, Synaps Analytics, Azure Data Lakre Storage)
Experience with some of the following technologies, Databricks, Python, Apche Sparks, IDMC, Talend, CI/CD pipelines, Jenkins, Gitlab, Power BI, SQL and NoSQL
Nice to have:
Masters degree in computer science, data engineering, or related quantitative field
9+ years of relevant work experience
Cloud certifications (AWS Certified Data Analytics - Specialty, Azure Data Engineer Associate)
Experience with real-time data streaming technologies, (Kafka, Kinesis)
Experience with IDMC tooling
What we offer:
health, dental, and vision plans
health savings accounts
wellness programs
flexible spending accounts
401K retirement plan with employer match
life insurance
short and long term disability insurance
paid time off
back-up care
adoption assistance
surrogacy assistance
reimbursement of education expenses
Public Service Loan Forgiveness eligibility
Railroad Retirement sickness and retirement benefits
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.