CrawlJobs Logo

Lead Data Engineer - GCP Platform Services & Frameworks

United States, Iselin Employment contract 119000.00 - 224000.00 USD / Year · Job Posted June 04, 2026

Job offer has expired

Job Link Share

Job Description

Within COO Technology we are seeking a Lead Data Engineer who will be responsible for designing GCP‑based data pipeline development, semantic layers, and reporting capabilities that support front‑office, middle‑office, and finance functions. The role will ensure data is accurate, reconciled, well‑governed, and consumable via standardized and self‑service analytics and dashboards, as well as consumable APIs for downstream systems.

Job Responsibility

  • Lead the design and implementation of scalable, secure data platforms on Google Cloud using managed services (BigQuery, Dataflow, Dataproc, Pub/Sub, Cloud Storage, Composer)
  • Build reusable frameworks and tooling (ingestion, transformation, quality, orchestration) that can be adopted by multiple product and domain teams
  • Develop shared transformation libraries in Python/SQL/Beam (e.g., common SCD patterns, data quality checks, masking/tokenization routines)
  • Provide orchestration capabilities via Cloud Composer or Cloud Workflows with reusable DAGs/templates and CI/CD integration
  • Optimize cost, performance, and reliability of GCP data workloads (partitioning, clustering, storage classes, autoscaling strategies)
  • Build opinionated data ingestion frameworks (e.g., config-driven pipelines, connectors, schema handling, error handling) on top of Dataflow, Dataproc, or Composer
  • Implement robust data modeling (dimensional, data vault, or canonical models) and semantic layers in BigQuery and related tools
  • Enforce data quality, lineage, and observability using standardized metrics, validation rules, and monitoring dashboards
  • Partner with domain data engineers, analytics, and ML teams to onboard use cases onto platform services and frameworks
  • Document patterns, runbooks, and best practices, and provide enablement through workshops and code examples
  • Contribute to platform roadmap, tool selection, and evaluation of new GCP services and open-source components

Requirements

  • 5+ years of Data Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 4+ years with orchestration tools (Cloud Composer/Airflow) and CI/CD (Cloud Build, Git-based workflows) for data workloads
  • 4+ Python and Spark for building data pipelines, libraries, and automation tooling
  • 4+ years of advanced SQL and data modeling for analytics and warehousing
  • experience with ETL/ELT design patterns
  • 4+ years' experience with automated testing, data quality checks, and monitoring for pipelines and platform services
  • 4+ years' experience with logging/monitoring stacks (Cloud Logging, Cloud Monitoring, error reporting, metrics dashboards
  • 3+ years of experience with core GCP data services: BigQuery, Dataflow/Apache Beam, Dataproc, Pub/Sub

Nice to have

  • Bachelor's degree in computer science, Engineering, or related field, or equivalent
  • Experience with knowledge graphs and semantic search
  • Excellent communication and presentation skills across technical audiences
  • Advanced Data Modeling skills, including data modeling in tabular and MPP models, design, composite models, and performance optimization of large, complex datasets
  • Strong SQL expertise and hands-on experience with Starburst and/or Trino, including connector configuration, query optimization, and secure data federation across GCP data stores
  • Collaborative mindset with a focus on partnering across business, finance, risk, and technology teams to deliver high-quality, reconciled data, APIs, and reporting outcomes
  • Experience in large financial services organizations (100k+ employees)
  • Experience with Agile transformations and technology roadmaps
  • Experience working with onshore and offshore teams

What we offer

  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Scholarships for dependent children
  • Adoption reimbursement

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Lead Data Engineer - GCP Platform Services & Frameworks

8 matching positions

Senior Data & AI/ML Engineer - GCP Specialization Lead

We are on a bold mission to create the best software services offering in the wo...
Location
Location
United States , Menlo Park
Salary
Salary:
Not provided
techjays.com Logo
techjays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Services: BigQuery, Dataflow, Pub/Sub, Vertex AI
  • ML Engineering: End-to-end ML pipelines using Vertex AI / Kubeflow
  • Programming: Python & SQL
  • MLOps: CI/CD for ML, Model deployment & monitoring
  • Infrastructure-as-Code: Terraform
  • Data Engineering: ETL/ELT, real-time & batch pipelines
  • AI/ML Tools: TensorFlow, scikit-learn, XGBoost
  • Min Experience: 10+ Years
Job Responsibility
Job Responsibility
  • Design and implement data architectures for real-time and batch pipelines, leveraging GCP services such as BigQuery, Dataflow, Dataproc, Pub/Sub, Vertex AI, and Cloud Storage
  • Lead the development of ML pipelines, from feature engineering to model training and deployment using Vertex AI, AI Platform, and Kubeflow Pipelines
  • Collaborate with data scientists to operationalize ML models and support MLOps practices using Cloud Functions, CI/CD, and Model Registry
  • Define and implement data governance, lineage, monitoring, and quality frameworks
  • Build and document GCP-native solutions and architectures that can be used for case studies and specialization submissions
  • Lead client-facing PoCs or MVPs to showcase AI/ML capabilities using GCP
  • Contribute to building repeatable solution accelerators in Data & AI/ML
  • Work with the leadership team to align with Google Cloud Partner Program metrics
  • Mentor engineers and data scientists toward achieving GCP certifications, especially in Data Engineering and Machine Learning
  • Organize and lead internal GCP AI/ML enablement sessions
What we offer
What we offer
  • Best in class packages
  • Paid holidays and flexible paid time away
  • Casual dress code & flexible working environment
  • Medical Insurance covering self & family up to 4 lakhs per person
Read More
Arrow Right
New

Lead Data Engineer

Within COO Technology, Wells Fargo is seeking a Lead Data Engineer to help shape...
Location
Location
United States , Iselin; Charlotte; Irving
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
June 08, 2026
Flip Icon
Requirements
Requirements
  • 5+ years of Database Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 5+ years of data management experience within Public Cloud (GCP, AWS, Azure)
  • 5+ years of hands on experience of Python or Java, plus Spark SQL for building data pipelines, libraries, and automation tooling.
  • 5+ years with orchestration tools (Cloud Composer/Airflow) and CI/CD (Cloud Build, Git‑based workflows) for data workloads
Job Responsibility
Job Responsibility
  • Design and implement scalable, secure data platforms on Google Cloud using managed services (BigQuery, Dataflow, Dataproc, Pub/Sub, Cloud Storage, Composer).)
  • Build reusable frameworks and tooling (ingestion, transformation, quality, orchestration) that can be adopted by multiple product and domain teams.
  • Enable self‑service data consumption and governance by standardizing patterns, templates, and platform capabilities rather than one‑off pipelines.
  • Design logical and physical data platform architectures leveraging BigQuery, Dataflow/Apache Beam, Dataproc/Spark, Pub/Sub, and Cloud Storage.
  • Define and implement standardized ingestion, transformation, and serving patterns (batch and streaming) as reusable blueprints.
  • Optimize cost, performance, and reliability of GCP data workloads (partitioning, clustering, storage classes, autoscaling strategies).
  • Build opinionated data ingestion frameworks (e.g., config‑driven pipelines, connectors, schema handling, error handling) on top of Dataflow, Dataproc, or Composer.
  • Develop shared transformation libraries in Python/SQL/Beam (e.g., common SCD patterns, data quality checks, masking/tokenization routines).
  • Provide orchestration capabilities via Cloud Composer or Cloud Workflows with reusable DAGs/templates and CI/CD integration.
  • Implement robust data modeling (dimensional, data vault, or canonical models) and semantic layers in BigQuery and related tools.
What we offer
What we offer
  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Fulltime
!
Read More
Arrow Right

Sr Data Platform Lead

At Amgen, if you feel like you’re part of something bigger, it’s because you are...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's degree OR Bachelor's degree in computer science or engineering field and 8 to 13 years of relevant experience
  • Strong hands‑on experience with various capabilities of Databricks, from Compute to Storage and from Unity Catalog to Data Engineering to BI and AI/ML capabilities, with a focus on governance and enterprise enablement
  • Proven hands‑on experience with cloud platforms, with strong preference for AWS (experience with Azure or GCP also acceptable)
  • Experience leading Data Quality platform initiatives (e.g., Ataccama, Monte Carlo), including tool evaluation, implementation, enterprise-wide adoption, and integration with enterprise DQ solutions
  • Experience owning and managing Databricks platform environments, including workspace architecture, environment strategy (dev/test/prod), and lifecycle management at scale
  • Proven ability to establish and enforce platform standards and operating models, including cluster policies, cost management, and workload orchestration frameworks
  • Strong focus on platform enablement and developer experience, including building reusable frameworks, defining best practices, and supporting engineering teams in adopting the platform effectively
  • Exposure to AI/ML capabilities on Databricks, including enabling AI‑driven features or accelerating adoption of AI‑assisted engineering practices
  • Solid knowledge of SQL and relational / dimensional data modelling, sufficient to support platform integrations, governance, and observability use cases
  • Experience working with core AWS services such as EKS, EC2, S3, Lambda, Glue, EMR, RDS, and Redshift/Spectrum, particularly in platform or shared‑services contexts
Job Responsibility
Job Responsibility
  • Act as a platform lead for delivery of data platform capabilities that enable next-gen data platform architecture, with a strong focus on Databricks platform and DQ platform features and services
  • Evaluate and enable Databricks platform capabilities through technical assessments and proof‑of‑concepts (PoCs), ensuring alignment with next-gen data platform architectural patterns and enterprise standards
  • Design, build, and productionize reusable platform frameworks, accelerators, and reference implementations that can be leveraged by next-gen data platform delivery teams (excluding ownership of data pipeline architecture or implementation)
  • Enable data governance, metadata layer, and data bundle capabilities by designing and implementing platform‑level integrations between Databricks and Collibra, Amgen’s enterprise data governance platform
  • Build platform‑level tooling and automation to support proactive governance, cost optimization, and best‑practice enforcement across Databricks and related data platform services
  • Define and enable platform observability capabilities, including KPIs, metrics, and telemetry for monitoring performance, usage, reliability, and cost of Databricks services
  • Identify and implement governed self‑service platform capabilities for data engineers through self-service portal, using Python‑based microservices deployed on Docker and Kubernetes
  • Lead user enablement and adoption initiatives, including onboarding content, guided learning experiences, workshops, and best‑practice sharing for the Databricks user community
  • Drive engineering excellence and adoption of AI across platform capabilities and solutions built, promoting modern engineering practices, automation, and responsible use of AI‑driven features
  • Enable key business programs and strategic initiatives by translating initiative‑driven requirements into scalable, reusable data platform capabilities, in alignment with next-gen data platform principles
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

We’re looking for a Senior/Lead Data Engineer to join our team. We are seeking a...
Location
Location
India , Noida
Salary
Salary:
Not provided
taazaa.com Logo
Taazaa Inc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field
  • 7–9 years of experience as a Data Engineer or in a similar role
  • Strong experience in building and maintaining ETL/ELT pipelines
  • Experience supporting analytics and AI/ML data workflows
  • Hands-on experience with ETL tools such as Apache Airflow, Airbyte, and dbt
  • Strong expertise in SQL and NoSQL databases (MySQL, PostgreSQL, MongoDB, Redis)
  • Proficiency in Python and Shell scripting for data processing and automation
  • Experience with cloud platforms (AWS, Azure, GCP) and their data services
  • Familiarity with data warehousing solutions such as Amazon Redshift or Snowflake
  • Experience with containerization tools like Docker and orchestration using Kubernetes
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable ETL/ELT pipelines for processing large datasets
  • Build data pipelines to ingest, transform, and load data into data lakes, warehouses, and feature stores
  • Optimise data workflows for performance, reliability, and scalability
  • Integrate data from multiple sources, including APIs, databases, and file systems (JSON, CSV, Parquet)
  • Manage relational and non-relational databases, including columnar databases like ClickHouse
  • Ensure efficient data storage and retrieval for both analytics and ML workloads
  • Implement data quality checks, validation frameworks, and monitoring systems
  • Maintain data lineage, metadata management, and governance standards
  • Ensure data accuracy, consistency, and compliance for analytics and AI/ML use cases
  • Deploy and manage data solutions on cloud platforms (AWS, Azure, GCP) or on-prem environments
What we offer
What we offer
  • Competitive salaries
  • Health benefits
  • Various perks
  • Competitive compensation and performance-based incentives
  • Opportunities for professional growth through workshops and certifications
  • Flexible work-life balance with remote options
  • Collaborative culture
  • Exposure to diverse projects across various industries
  • Clear career advancement pathways
  • Comprehensive health benefits
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Lead Data Engineer to design, build, and maintain the integrity of our core data...
Location
Location
United Kingdom , Swindon
Salary
Salary:
75000.00 GBP / Year
talenthawk.com Logo
TalentHawk
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience leading data engineering or BI teams within complex environments
  • Hands-on expertise in designing and implementing Enterprise Data Warehouses
  • Track record of building secure data pipelines across multiple source systems
  • A degree in Computer Science, Data Engineering, or a related field (or equivalent experience)
  • Relevant certifications (e.g., Azure/AWS Data Engineer, Snowflake) are highly desirable
  • Strong grasp of Data Vault, Kimball, or equivalent design patterns
  • Expert-level SQL, ETL/ELT pipeline development, and modern engineering tools
  • Proficiency with cloud-based services (Azure, AWS, or GCP) and Power BI
  • Deep understanding of data security, GDPR, and governance frameworks
  • Exceptional leadership and mentoring capabilities
Job Responsibility
Job Responsibility
  • Architecture & Implementation: Own the Data Warehouse lifecycle, ensuring high availability, security, and scalability
  • Data Integration: Build and maintain robust pipelines to ingest and transform data from diverse systems (Salesforce, NetSuite, and digital platforms)
  • Team Leadership: Manage and mentor the BI team, providing technical direction and fostering a high-performance culture
  • Data Governance & Security: Implement validation practices, metadata management, and data lineage to ensure GDPR compliance and data integrity
  • Stakeholder Collaboration: Act as a bridge between technical teams and business leaders to translate reporting needs into actionable technical solutions
  • Strategic Input: Evaluate new technologies and provide expert advice on programs requiring integrated data and analytics
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

The Data Engineer reports to the Director of Data and Analytics and plays a crit...
Location
Location
Canada , Saint-Laurent
Salary
Salary:
Not provided
psychobunny.ca Logo
Psycho Bunny
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6 to 8 years of experience in a related field
  • Diploma in Computer Science, Data Engineering, or a related field
  • Extensive experience with Snowflake, including Snowflake-specific capabilities like virtual warehouses, zero-copy cloning, and Snowpipe
  • Proficiency in Python, SQL, and Java or Scala for large-scale data processing
  • Hands-on experience with Kafka, Spark, or similar tools for streaming and batch processing
  • Advanced knowledge of AWS, Azure, or GCP
  • experience with integrating Snowflake into cloud ecosystems such AWS
  • Data Integration: Proficiency with ETL/ELT tools like Fivetran, Matillion, or Informatica
  • Strong project management skills to deliver on complex, multi-stakeholder data projects
  • Excellent communication skills to collaborate with both technical and non-technical stakeholders
Job Responsibility
Job Responsibility
  • Partner with business unit leaders to understand all data needs and requirements, ensuring alignment with business objectives
  • Design, develop, and maintain reliable data pipelines that efficiently process large volumes of data according to evolving business needs
  • Implement systems and practices to ensure data is accessible and usable for business intelligence tools, data analytics teams, and other stakeholders
  • Manage the loading and transformation of data through both technical processes and business logic
  • Produce strategic data that adds value and contributes to the organization’s growth and competitiveness
  • Establish and enforce data quality standards, methodologies, and systems to ensure data accuracy and reliability
  • Monitor data ingestion and processing, resolving any discrepancies and ensuring smooth data flows
  • Collaborate with data source providers, Psycho Bunny vendors and internal stakeholders to address data quality issues effectively
  • Catalog and document the data sources needed to implement self-service analytics across the organization
  • Process Improvement: Continually improve ongoing reporting and analysis processes and practices to enhance data quality and efficiency
What we offer
What we offer
  • Sweet discount on the coolest fits
  • Room to grow in a rapidly expanding brand
  • Surrounded by smart and passionate people
  • A group RRSP/DPSP plan, which includes a very generous match from Psycho Bunny!
  • On-site gym and on-site cafeteria / bistro with subsidized meals, including breakfast and lunch
  • Three (3) weeks of vacation
  • Six (6) wellness days and your birthday off, on us
  • Fulltime
Read More
Arrow Right

Senior Data Platform Engineer

For a large, data-driven organization, we are looking for a Senior Data Platform...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
levy-professionals.com Logo
Levy Professionals
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong expertise in Google Cloud Platform, including services such as BigQuery, Cloud Storage, Dataflow, etc.
  • Experience with SQL Server, data pipeline migrations, Python, and IDMC
  • Hands-on knowledge of data engineering frameworks and ETL tools
  • Ability to explain technical concepts clearly and support the development of less-experienced team members
  • Familiarity with cloud governance, security, and operational best practices
  • Strong communication skills and the ability to work effectively with both technical and non-technical stakeholders
  • Independent, able to review and validate own work
  • Fluent in English
Job Responsibility
Job Responsibility
  • Lead and support the migration of SQL Server workloads and data pipelines to GCP
  • Design, build, and optimize cloud-based data ingestion, transformation, and processing workflows
  • Support team members by sharing knowledge, reviewing work, and promoting cloud best practices
  • Contribute to the transition from a shadow-IT setup to a structured, standards-driven cloud environment
  • Define, establish, and document best practices for cloud data engineering and operations
  • Collaborate with stakeholders to align data solutions with business and technical requirements
Read More
Arrow Right

Lead Data Analytics Engineer

Lead Data Analytics Engineer role at ResMed. A Senior Staff–level technical lead...
Location
Location
United States , San Diego
Salary
Salary:
171000.00 - 257000.00 USD / Year
resmed.com Logo
ResMed
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in a STEM field or equivalent experience
  • Extensive hands-on experience as a senior IC in data engineering, analytics engineering, or data architecture (typically 8+ years)
  • Expert-level SQL and data modeling skills on large-scale platforms (Snowflake preferred)
  • Strong experience building production data pipelines and models using Python, cloud services, and modern data stack tools
  • Proficiency with dbt or similar transformation frameworks
  • Demonstrated ability to set technical direction, define architectural patterns, and establish engineering best practices
  • Solid experience with Git/GitHub workflows, including branching strategies and collaborative development
  • Experience building and maintaining CI/CD pipelines in GitHub Actions, including automated testing and secure deployments
  • Ability to operate across both analytics engineering and data engineering responsibilities
  • Experience with cloud platforms such as AWS or GCP
Job Responsibility
Job Responsibility
  • Set architectural strategy for data modeling, transformation, ingestion, and data products, and guide engineering best practices across teams
  • Lead analytics engineering by designing high-quality Snowflake/dbt models, establishing governance and testing standards, and mentoring engineers in scalable modeling and system design
  • Build and evolve data pipelines using Python, Spark, APIs, connector frameworks, and other ingestion technologies, introducing automation, observability, and resilient design patterns
  • Collaborate cross-functionally with product, engineering, and data science to shape impactful, scalable solutions
  • Drive future advanced analytics and ML capabilities by defining feature pipelines, supporting classical ML models, and enabling new AI-driven workloads including LLM-based and hybrid ML/AI architectures
What we offer
What we offer
  • comprehensive medical, vision, dental, and life, AD&D, short-term and long-term disability insurance, sleep care management, Health Savings Account (HSA), Flexible Spending Account (FSA), commuter benefits, 401(k), Employee Stock Purchase Plan (ESPP), Employee Assistance Program (EAP), tuition assistance, flexible time off (FTO), 11 paid holidays plus 3 floating days, eligible for 14 weeks of primary caregiver or two weeks of secondary caregiver leave
  • Fulltime
Read More
Arrow Right