CrawlJobs Logo

Data Governance Lead - AWS Platform

Canada, Toronto · Job Posted March 19, 2026
Apply Position
Job Link Share

Job Description

The Data Governance Lead will be responsible for establishing, leading, and operationalizing governance standards across the data ecosystem. This role places strong emphasis on Databricks and Unity Catalog as foundational components of the enterprise data platform. The ideal candidate will unify fragmented data sources into a consistent, well-managed, and trusted environment. This role partners closely with data engineering, platform teams, product teams, and business stakeholders to define and implement policies, frameworks, and best practices that ensure compliance, security, and high data quality across cloud and analytics platforms.

Job Responsibility

  • Develop and own the PSG enterprise data governance framework, including policies, standards, metadata strategy, and data stewardship processes
  • Lead governance initiatives across cloud and analytics platforms, prioritizing Databricks and Unity Catalog, followed by AWS data services and Dataiku
  • Define and implement catalog structures, data classification models, tagging strategies, and access control rules across multiple business domains
  • Establish metadata management, data lineage tracking, and data quality controls to improve trust, transparency, and usability
  • Drive unification of data across fragmented systems by designing scalable governance models, harmonized definitions, and reusable governance patterns
  • Partner with cross-functional teams to ensure compliance with data privacy, security, and regulatory requirements (e.g., PII, PHI, GxP, HIPAA, GDPR where applicable)
  • Oversee data access controls, entitlements, and policy enforcement across analytics and cloud environments
  • Establish governance processes for issue management, remediation, audit readiness, and continuous improvement
  • Mentor and guide data stewards while promoting adoption of governance standards across business teams
  • Support platform modernization initiatives enabling analytics, AI/ML, automation, and self-service capabilities

Requirements

  • 7+ years of experience in data governance, data management, enterprise data architecture, or related disciplines
  • 3+ years of hands-on experience with Databricks and Unity Catalog, including catalog design and structure, access policies and entitlement management, data lineage implementation, classification and tagging strategies
  • 3+ years working with AWS data services (e.g., S3, Glue, Lake Formation, Redshift, IAM)
  • 3+ years of proven experience governing data across fragmented systems, complex pipelines, and multi-cloud or hybrid environments
  • 3+ years of experience implementing metadata management, MDM strategies, and enterprise data quality frameworks
  • 3+ years of hands-on experience with governance workflows, tooling, and model management processes
  • 1+ years of demonstrated experience supporting regulatory and compliance requirements (PII, PHI, GxP, HIPAA, GDPR, etc.)

Nice to have

  • Experience with Dataiku or similar advanced analytics platforms
  • Experience supporting AI/ML governance and data enablement initiatives
  • Industry experience in regulated environments (e.g., healthcare, life sciences, financial services)

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Governance Lead - AWS Platform

8 matching positions

Data Governance Lead - AWS Platform

The Data Governance Lead will be responsible for establishing, leading, and oper...
Location
Location
United States , Pittsburgh
Salary
Salary:
77760.00 - 164476.00 USD / Year
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in data governance, data management, enterprise data architecture, or related disciplines
  • 3+ years of hands-on experience with Databricks and Unity Catalog, including catalog design and structure, access policies and entitlement management, data lineage implementation, classification and tagging strategies
  • 3+ years working with AWS data services (e.g., S3, Glue, Lake Formation, Redshift, IAM)
  • 3+ years of proven experience governing data across fragmented systems, complex pipelines, and multi-cloud or hybrid environments
  • 3+ years of experience implementing metadata management, MDM strategies, and enterprise data quality frameworks
  • 3+ years of hands-on experience with governance workflows, tooling, and model management processes
  • 1+ years of demonstrated experience supporting regulatory and compliance requirements (PII, PHI, GxP, HIPAA, GDPR, etc.)
Job Responsibility
Job Responsibility
  • Develop and own the PSG enterprise data governance framework, including policies, standards, metadata strategy, and data stewardship processes
  • Lead governance initiatives across cloud and analytics platforms, prioritizing Databricks and Unity Catalog, followed by AWS data services and Dataiku
  • Define and implement catalog structures, data classification models, tagging strategies, and access control rules across multiple business domains
  • Establish metadata management, data lineage tracking, and data quality controls to improve trust, transparency, and usability
  • Drive unification of data across fragmented systems by designing scalable governance models, harmonized definitions, and reusable governance patterns
  • Partner with cross-functional teams to ensure compliance with data privacy, security, and regulatory requirements (e.g., PII, PHI, GxP, HIPAA, GDPR where applicable)
  • Oversee data access controls, entitlements, and policy enforcement across analytics and cloud environments
  • Establish governance processes for issue management, remediation, audit readiness, and continuous improvement
  • Mentor and guide data stewards while promoting adoption of governance standards across business teams
  • Support platform modernization initiatives enabling analytics, AI/ML, automation, and self-service capabilities
  • Fulltime
Read More
Arrow Right

Senior AWS Data Engineer / Data Platform Engineer

We are seeking a highly experienced Senior AWS Data Engineer to design, build, a...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
northbaysolutions.com Logo
NorthBay
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering and data platform development
  • Strong hands-on experience with: AWS Glue
  • Amazon EMR (Spark)
  • AWS Lambda
  • Apache Airflow (MWAA)
  • Amazon EC2
  • Amazon CloudWatch
  • Amazon Redshift
  • Amazon DynamoDB
  • AWS DataZone
Job Responsibility
Job Responsibility
  • Design, develop, and optimize scalable data pipelines using AWS native services
  • Lead the implementation of batch and near-real-time data processing solutions
  • Architect and manage data ingestion, transformation, and storage layers
  • Build and maintain ETL/ELT workflows using AWS Glue and Apache Spark on EMR
  • Orchestrate complex data workflows using Apache Airflow (MWAA)
  • Develop and manage serverless data processing using AWS Lambda
  • Design and optimize data warehouses using Amazon Redshift
  • Implement and manage NoSQL data models using Amazon DynamoDB
  • Utilize AWS DataZone for data governance, cataloging, and access management
  • Monitor, log, and troubleshoot data pipelines using Amazon CloudWatch
  • Fulltime
Read More
Arrow Right

Aws Data Engineer (Cloud Data Platform & Pipeline Specialist)

Design, develop, and maintain scalable cloud-based data pipelines using AWS serv...
Location
Location
United States , Atlanta
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in data engineering, with strong hands-on expertise in AWS data services (Glue, EMR, S3, RDS, DataSync, DMS)
  • 5+ years of Proven experience building and managing data pipelines (batch and streaming) in cloud environments
  • 5+ years of Strong experience in data migration, transformation frameworks, and large-scale data replication
  • 5+ years of Deep understanding of data modeling, data transformation, and reconciliation techniques
  • 5+ years of Experience designing and implementing secure data access and governance (least privilege principles)
  • 5+ years of Hands-on experience with data validation, auditing, and reconciliation processes
  • Familiarity with regulatory or finance data environments and reporting workloads
  • 5+ years of Strong problem-solving skills and ability to work in a collaborative, fast-paced environment
  • AWS data services
  • data pipelines
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable cloud-based data pipelines using AWS services such as Glue, EMR, S3, RDS, DataSync, and DMS
  • Build and optimize batch and streaming data orchestration workflows to support enterprise data platforms
  • Lead large-scale data migration efforts, including legacy-to-cloud transformations and replication strategies
  • Perform data modeling, transformation, and reconciliation to ensure high-quality, consistent datasets across systems
  • Implement secure data access patterns following least-privilege principles for pipelines and datasets
  • Collaborate with data architects, analysts, and business stakeholders to understand data requirements and deliver solutions
  • Establish robust data validation, reconciliation, and audit mechanisms to meet regulatory and reporting requirements
  • Troubleshoot and optimize performance of ETL/ELT pipelines and data workflows in AWS environments
  • Support governance, compliance, and audit readiness for data platforms in regulated environments (finance/reporting)
  • Fulltime
Read More
Arrow Right

Sr Data Platform Lead

At Amgen, if you feel like you’re part of something bigger, it’s because you are...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's degree OR Bachelor's degree in computer science or engineering field and 8 to 13 years of relevant experience
  • Strong hands‑on experience with various capabilities of Databricks, from Compute to Storage and from Unity Catalog to Data Engineering to BI and AI/ML capabilities, with a focus on governance and enterprise enablement
  • Proven hands‑on experience with cloud platforms, with strong preference for AWS (experience with Azure or GCP also acceptable)
  • Experience leading Data Quality platform initiatives (e.g., Ataccama, Monte Carlo), including tool evaluation, implementation, enterprise-wide adoption, and integration with enterprise DQ solutions
  • Experience owning and managing Databricks platform environments, including workspace architecture, environment strategy (dev/test/prod), and lifecycle management at scale
  • Proven ability to establish and enforce platform standards and operating models, including cluster policies, cost management, and workload orchestration frameworks
  • Strong focus on platform enablement and developer experience, including building reusable frameworks, defining best practices, and supporting engineering teams in adopting the platform effectively
  • Exposure to AI/ML capabilities on Databricks, including enabling AI‑driven features or accelerating adoption of AI‑assisted engineering practices
  • Solid knowledge of SQL and relational / dimensional data modelling, sufficient to support platform integrations, governance, and observability use cases
  • Experience working with core AWS services such as EKS, EC2, S3, Lambda, Glue, EMR, RDS, and Redshift/Spectrum, particularly in platform or shared‑services contexts
Job Responsibility
Job Responsibility
  • Act as a platform lead for delivery of data platform capabilities that enable next-gen data platform architecture, with a strong focus on Databricks platform and DQ platform features and services
  • Evaluate and enable Databricks platform capabilities through technical assessments and proof‑of‑concepts (PoCs), ensuring alignment with next-gen data platform architectural patterns and enterprise standards
  • Design, build, and productionize reusable platform frameworks, accelerators, and reference implementations that can be leveraged by next-gen data platform delivery teams (excluding ownership of data pipeline architecture or implementation)
  • Enable data governance, metadata layer, and data bundle capabilities by designing and implementing platform‑level integrations between Databricks and Collibra, Amgen’s enterprise data governance platform
  • Build platform‑level tooling and automation to support proactive governance, cost optimization, and best‑practice enforcement across Databricks and related data platform services
  • Define and enable platform observability capabilities, including KPIs, metrics, and telemetry for monitoring performance, usage, reliability, and cost of Databricks services
  • Identify and implement governed self‑service platform capabilities for data engineers through self-service portal, using Python‑based microservices deployed on Docker and Kubernetes
  • Lead user enablement and adoption initiatives, including onboarding content, guided learning experiences, workshops, and best‑practice sharing for the Databricks user community
  • Drive engineering excellence and adoption of AI across platform capabilities and solutions built, promoting modern engineering practices, automation, and responsible use of AI‑driven features
  • Enable key business programs and strategic initiatives by translating initiative‑driven requirements into scalable, reusable data platform capabilities, in alignment with next-gen data platform principles
  • Fulltime
Read More
Arrow Right

Data Engineer (Big Data, Cloud - AWS, Databricks) - Assistant Vice President

Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Scala, Spark/Pyspark is must, Hadoop ( BIG Data ), + AWS,Databricks
  • 8 to 11 years’ experience implementing data-intensive solutions using agile methodologies
  • Experience of relational databases and using SQL for data querying, transformation and manipulation
  • Experience of modelling data for analytical consumers
  • Ability to automate and streamline the build, test and deployment of data pipelines
  • Experience in cloud native technologies and patterns
  • A passion for learning new technologies, and a desire for personal growth, through self-study, formal classes, or on-the-job training
  • Excellent communication and problem-solving skills
  • An inclination to mentor
  • an ability to lead and deliver medium sized components independently
Job Responsibility
Job Responsibility
  • Developing and supporting scalable, extensible, and highly available data solutions
  • Deliver on critical business priorities while ensuring alignment with the wider architectural vision
  • Identify and help address potential risks in the data supply chain
  • Follow and contribute to technical standards
  • Design and develop analytical data models
  • Fulltime
Read More
Arrow Right

Data Platform Engineer (Python / Golang)

You will work on a live, high-load fleet management platform that connects tens ...
Location
Location
Poland; Ukraine
Salary
Salary:
Not provided
Intellias
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years Python backend in production
  • 2+ years as a lead or principal on a data-heavy platform
  • Typed Python at depth: Pydantic, mypy/pyright, clean code and OOP principles
  • sets standards the team follows
  • AWS data services in production (must-have): Lambda, Kinesis, DynamoDB, S3, Athena/Presto or Redshift, SQS/SNS — owned and operated end-to-end, not just used
  • Event-driven architecture ownership — designs streaming pipelines, DLQ strategies, retry/idempotency patterns
  • makes the calls others follow
  • Data engineering at principal depth: ETL/ELT, schema validation, data contracts, data lake/warehouse architecture
  • defines patterns the team reuses
  • SQL at depth — PostgreSQL and analytical engines (Athena, Redshift)
Job Responsibility
Job Responsibility
  • Own data platform architecture across platform — streaming pipelines, partner integrations, and core backend services
  • Define and enforce backend and data engineering standards: service contracts, error handling, logging, secrets management
  • Own code quality and architectural consistency
  • Own event-driven integrations with 20+ external data partners — data contracts, ingestion, transformation, failure handling
  • Design and govern data models across PostgreSQL, DynamoDB, S3, and analytical systems
  • Define and maintain IaC architecture for owned services using Terraform
  • Collaborate with DevOps on deployment patterns, observability, and incident runbooks
  • Monitor production systems, drive alerting standards, and lead resolution of critical data incidents
  • Represent backend and data constraints in planning and API contract discussions — raises risks before implementation starts
  • Produce and maintain architecture documentation, ADRs, and onboarding materials
What we offer
What we offer
  • Comfortable atmosphere
  • support for well-being
  • charge professional growth
  • equity, diversity, and inclusion
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Rapid7 is seeking a Data Engineer, Data Engineering & Analytics to join a high-p...
Location
Location
India , Pune
Salary
Salary:
Not provided
rapid7.com Logo
Rapid7
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ability to thrive in a fast-paced hybrid organization
  • Comfort working in a highly agile, intensely iterative environment
  • Demonstrated capacity to clearly and concisely communicate complex business activities, technical requirements, and recommendations
  • 8+ years of experience in data engineering, analytics, or business intelligence
  • 8+ years experience designing, implementing, operating, and extending enterprise dimensional data models
  • 3+ years experience building reports and dashboards in Tableau and/or other similar data visualization tools
  • Experience in DBT modeling and understanding modular, performant models
  • Solid understanding of Snowflake, SQL, and data warehouse management
  • Understanding of ETL/ELT processes, data pipelines, and cloud-based data architectures
  • Familiarity with modern data stacks (DBT, Airflow, Fivetran, Matillion, or similar tools)
Job Responsibility
Job Responsibility
  • Implement data modeling best practices to enhance data accessibility and reporting capabilities
  • Ensure data integrity, security, and compliance with industry standards and regulations
  • Document plans and results in user-stories, issues, PRs, the team’s handbook - following the tradition of documentation first
  • Implement the Corp Data philosophy in everything you do
  • Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment
  • Maintain and advocate for these standards through code review
  • Collaborate with IT and DevOps teams to optimize cloud infrastructure and data governance policies
  • Manage and enhance the existing Tableau reporting suite, ensuring self-service analytics and actionable insights for stakeholders
  • Design, develop, and extend DBT code repository to extend the Enterprise Dimensional Warehouse capabilities and infrastructure
  • Develop and maintain a single source of truth for business metrics, ensuring consistency across reporting platforms
  • Fulltime
Read More
Arrow Right

Senior Data Lead

At Barclays you will spearhead the transformation of our data landscape, driving...
Location
Location
India , Pune
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficiency in designing and implementing scalable data platforms on AWS using services such as S3, Glue (ETL Jobs, Data Catalog), Redshift, Athena, and Iceberg with strong hands-on experience with Databricks for building notebooks, ELT pipelines, and working with distributed data processing frameworks such as Apache Spark
  • Expertise in migrating large-scale SQL Server workloads to AWS, including defining source-to-target mappings and ensuring performance and data integrity
  • Proficiency in Python and SQL for developing data pipelines, transformations, and analytical workflows and Experience in building and orchestrating ETL/ELT pipelines using AWS Glue, DBT, and tools such as Airflow/Astronomer
  • Ability to define data contracts, design Iceberg table schemas, and implement partitioning, metadata management, and Glue Catalog integration
  • Strong understanding of data governance, including implementation of Lake Formation policies for row- and column-level security
  • Experience embedding MNPI controls, including data classification, access restrictions, and working with Compliance/CISO stakeholders
  • Hands-on experience with CI/CD pipelines, DevOps practices, and Agile delivery methodologies
  • Design and optimize data lake and warehouse solutions for structured and unstructured data across AWS platforms
Job Responsibility
Job Responsibility
  • At Barclays you will spearhead the transformation of our data landscape, driving innovation and excellence across enterprise data platforms
  • As a Senior Data Lead, you will leverage cutting-edge AWS technologies and Databricks to modernize and migrate SQL Server workloads into scalable, secure, and compliant cloud-native solutions
  • With a primary focus on AWS services including S3, Glue, Redshift, Athena, and Iceberg, alongside Python, SQL, and orchestration tools like Airflow, you will design and deliver robust data pipelines, establish strong data governance frameworks including MNPI controls, and enable high-quality, insight-driven decision-making
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right