This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a member of the data engineering team, you will be the key technical expert developing and overseeing PepsiCo's data product build & operations and drive a strong vision for how data engineering can proactively create a positive impact on the business. You'll be an empowered member of a team of data engineers who build data pipelines into various source systems, rest data on the PepsiCo Data Lake, and enable exploration and access for analytics, visualization, machine learning, and product development efforts across the company. As a member of the data engineering team, you will help lead the development of very large and complex data applications into public cloud environments directly impacting the design, architecture, and implementation of PepsiCo's flagship data products
Job Responsibility:
Active contributor to code development in projects and services
Manage and scale data pipelines from internal and external data sources to support new product launches and drive data quality across data products
Build and own the automation and monitoring frameworks that captures metrics and operational KPIs for data pipeline quality and performance
Responsible for implementing best practices around systems integration, security, performance and data management
Empower the business by creating value through the increased adoption of data, data science and business intelligence landscape
Collaborate with internal clients (data science and product teams) to drive solutioning and POC discussions
Requirements:
6+ years of overall technology experience
4+ years of hands-on software development, data engineering, and systems architecture
4+ years of experience with Data Lake Infrastructure, Data Warehousing, and Data Analytics tools
4+ years of experience in SQL optimization and performance tuning
development experience in programming languages like Python, PySpark, Scala
2+ years in cloud data engineering experience in Azure (Azure Data Factory(ADF), ADLS-2, Databricks)
Fluent with Azure cloud services
Experience with integration of multi cloud services with on-premises technologies
Experience with data modeling, data warehousing, and building high-volume ETL/ELT pipelines
Experience with data profiling and data quality tools
Experience building/operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets
Experience with at least one MPP database technology such as Redshift, Synapse or SnowFlake
Experience with running and scaling applications on the cloud infrastructure and containerized services like Kubernetes
Experience with version control systems like Github and deployment & CI tools
Experience with Azure Data Factory, Azure Databricks and Azure Machine learning tools
Working knowledge of agile development, including DevOps and DataOps concepts
Familiarity with business intelligence tools (such as PowerBI)
Nice to have:
Azure or Databricks Certification
Experience with Statistical/ML techniques
Experience with building solutions in the Supply chain space (Digital Procurement, Manufacturing, Cost, Warehouse, Network Design)
Understanding of metadata management, data lineage, and data glossaries
What we offer:
Bonus based on performance and eligibility target payout is 10% of annual salary paid out annually
Paid time off subject to eligibility, including paid parental leave, vacation, sick, and bereavement
Medical, Dental, Vision, Disability, Health, and Dependent Care Reimbursement Accounts