Data Governance Lead - AWS Platform Job at NTT DATA (Toronto)

Data Governance Lead - AWS Platform

The Data Governance Lead will be responsible for establishing, leading, and oper...

Location

United States , Pittsburgh

Salary:

77760.00 - 164476.00 USD / Year

NTT DATA

Expiration Date

Until further notice

Requirements

7+ years of experience in data governance, data management, enterprise data architecture, or related disciplines
3+ years of hands-on experience with Databricks and Unity Catalog, including catalog design and structure, access policies and entitlement management, data lineage implementation, classification and tagging strategies
3+ years working with AWS data services (e.g., S3, Glue, Lake Formation, Redshift, IAM)
3+ years of proven experience governing data across fragmented systems, complex pipelines, and multi-cloud or hybrid environments
3+ years of experience implementing metadata management, MDM strategies, and enterprise data quality frameworks
3+ years of hands-on experience with governance workflows, tooling, and model management processes
1+ years of demonstrated experience supporting regulatory and compliance requirements (PII, PHI, GxP, HIPAA, GDPR, etc.)

Job Responsibility

Develop and own the PSG enterprise data governance framework, including policies, standards, metadata strategy, and data stewardship processes
Lead governance initiatives across cloud and analytics platforms, prioritizing Databricks and Unity Catalog, followed by AWS data services and Dataiku
Define and implement catalog structures, data classification models, tagging strategies, and access control rules across multiple business domains
Establish metadata management, data lineage tracking, and data quality controls to improve trust, transparency, and usability
Drive unification of data across fragmented systems by designing scalable governance models, harmonized definitions, and reusable governance patterns
Partner with cross-functional teams to ensure compliance with data privacy, security, and regulatory requirements (e.g., PII, PHI, GxP, HIPAA, GDPR where applicable)
Oversee data access controls, entitlements, and policy enforcement across analytics and cloud environments
Establish governance processes for issue management, remediation, audit readiness, and continuous improvement
Mentor and guide data stewards while promoting adoption of governance standards across business teams
Support platform modernization initiatives enabling analytics, AI/ML, automation, and self-service capabilities

Fulltime

Senior AWS Data Engineer / Data Platform Engineer

We are seeking a highly experienced Senior AWS Data Engineer to design, build, a...

Location

United Arab Emirates , Dubai

Salary:

Not provided

NorthBay

Expiration Date

Until further notice

Requirements

8+ years of experience in data engineering and data platform development
Strong hands-on experience with: AWS Glue
Amazon EMR (Spark)
AWS Lambda
Apache Airflow (MWAA)
Amazon EC2
Amazon CloudWatch
Amazon Redshift
Amazon DynamoDB
AWS DataZone

Job Responsibility

Design, develop, and optimize scalable data pipelines using AWS native services
Lead the implementation of batch and near-real-time data processing solutions
Architect and manage data ingestion, transformation, and storage layers
Build and maintain ETL/ELT workflows using AWS Glue and Apache Spark on EMR
Orchestrate complex data workflows using Apache Airflow (MWAA)
Develop and manage serverless data processing using AWS Lambda
Design and optimize data warehouses using Amazon Redshift
Implement and manage NoSQL data models using Amazon DynamoDB
Utilize AWS DataZone for data governance, cataloging, and access management
Monitor, log, and troubleshoot data pipelines using Amazon CloudWatch

Fulltime

Aws Data Engineer (Cloud Data Platform & Pipeline Specialist)

Design, develop, and maintain scalable cloud-based data pipelines using AWS serv...

Location

United States , Atlanta

Salary:

Not provided

NTT DATA

Expiration Date

Until further notice

Requirements

5+ years of experience in data engineering, with strong hands-on expertise in AWS data services (Glue, EMR, S3, RDS, DataSync, DMS)
5+ years of Proven experience building and managing data pipelines (batch and streaming) in cloud environments
5+ years of Strong experience in data migration, transformation frameworks, and large-scale data replication
5+ years of Deep understanding of data modeling, data transformation, and reconciliation techniques
5+ years of Experience designing and implementing secure data access and governance (least privilege principles)
5+ years of Hands-on experience with data validation, auditing, and reconciliation processes
Familiarity with regulatory or finance data environments and reporting workloads
5+ years of Strong problem-solving skills and ability to work in a collaborative, fast-paced environment
AWS data services
data pipelines

Job Responsibility

Design, develop, and maintain scalable cloud-based data pipelines using AWS services such as Glue, EMR, S3, RDS, DataSync, and DMS
Build and optimize batch and streaming data orchestration workflows to support enterprise data platforms
Lead large-scale data migration efforts, including legacy-to-cloud transformations and replication strategies
Perform data modeling, transformation, and reconciliation to ensure high-quality, consistent datasets across systems
Implement secure data access patterns following least-privilege principles for pipelines and datasets
Collaborate with data architects, analysts, and business stakeholders to understand data requirements and deliver solutions
Establish robust data validation, reconciliation, and audit mechanisms to meet regulatory and reporting requirements
Troubleshoot and optimize performance of ETL/ELT pipelines and data workflows in AWS environments
Support governance, compliance, and audit readiness for data platforms in regulated environments (finance/reporting)

Fulltime

Sr Data Platform Lead

At Amgen, if you feel like you’re part of something bigger, it’s because you are...

Location

India , Hyderabad

Salary:

Not provided

Amgen

Expiration Date

Until further notice

Requirements

Master's degree OR Bachelor's degree in computer science or engineering field and 8 to 13 years of relevant experience
Strong hands‑on experience with various capabilities of Databricks, from Compute to Storage and from Unity Catalog to Data Engineering to BI and AI/ML capabilities, with a focus on governance and enterprise enablement
Proven hands‑on experience with cloud platforms, with strong preference for AWS (experience with Azure or GCP also acceptable)
Experience leading Data Quality platform initiatives (e.g., Ataccama, Monte Carlo), including tool evaluation, implementation, enterprise-wide adoption, and integration with enterprise DQ solutions
Experience owning and managing Databricks platform environments, including workspace architecture, environment strategy (dev/test/prod), and lifecycle management at scale
Proven ability to establish and enforce platform standards and operating models, including cluster policies, cost management, and workload orchestration frameworks
Strong focus on platform enablement and developer experience, including building reusable frameworks, defining best practices, and supporting engineering teams in adopting the platform effectively
Exposure to AI/ML capabilities on Databricks, including enabling AI‑driven features or accelerating adoption of AI‑assisted engineering practices
Solid knowledge of SQL and relational / dimensional data modelling, sufficient to support platform integrations, governance, and observability use cases
Experience working with core AWS services such as EKS, EC2, S3, Lambda, Glue, EMR, RDS, and Redshift/Spectrum, particularly in platform or shared‑services contexts

Job Responsibility

Act as a platform lead for delivery of data platform capabilities that enable next-gen data platform architecture, with a strong focus on Databricks platform and DQ platform features and services
Evaluate and enable Databricks platform capabilities through technical assessments and proof‑of‑concepts (PoCs), ensuring alignment with next-gen data platform architectural patterns and enterprise standards
Design, build, and productionize reusable platform frameworks, accelerators, and reference implementations that can be leveraged by next-gen data platform delivery teams (excluding ownership of data pipeline architecture or implementation)
Enable data governance, metadata layer, and data bundle capabilities by designing and implementing platform‑level integrations between Databricks and Collibra, Amgen’s enterprise data governance platform
Build platform‑level tooling and automation to support proactive governance, cost optimization, and best‑practice enforcement across Databricks and related data platform services
Define and enable platform observability capabilities, including KPIs, metrics, and telemetry for monitoring performance, usage, reliability, and cost of Databricks services
Identify and implement governed self‑service platform capabilities for data engineers through self-service portal, using Python‑based microservices deployed on Docker and Kubernetes
Lead user enablement and adoption initiatives, including onboarding content, guided learning experiences, workshops, and best‑practice sharing for the Databricks user community
Drive engineering excellence and adoption of AI across platform capabilities and solutions built, promoting modern engineering practices, automation, and responsible use of AI‑driven features
Enable key business programs and strategic initiatives by translating initiative‑driven requirements into scalable, reusable data platform capabilities, in alignment with next-gen data platform principles

Fulltime

Data Engineer (Big Data, Cloud - AWS, Databricks) - Assistant Vice President

Location

India , Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Scala, Spark/Pyspark is must, Hadoop ( BIG Data ), + AWS,Databricks
8 to 11 years’ experience implementing data-intensive solutions using agile methodologies
Experience of relational databases and using SQL for data querying, transformation and manipulation
Experience of modelling data for analytical consumers
Ability to automate and streamline the build, test and deployment of data pipelines
Experience in cloud native technologies and patterns
A passion for learning new technologies, and a desire for personal growth, through self-study, formal classes, or on-the-job training
Excellent communication and problem-solving skills
An inclination to mentor
an ability to lead and deliver medium sized components independently

Job Responsibility

Developing and supporting scalable, extensible, and highly available data solutions
Deliver on critical business priorities while ensuring alignment with the wider architectural vision
Identify and help address potential risks in the data supply chain
Follow and contribute to technical standards
Design and develop analytical data models

Fulltime

Data Platform Engineer (Python / Golang)

You will work on a live, high-load fleet management platform that connects tens ...

Location

Poland; Ukraine

Salary:

Not provided

Intellias

Expiration Date

Until further notice

Requirements

5+ years Python backend in production
2+ years as a lead or principal on a data-heavy platform
Typed Python at depth: Pydantic, mypy/pyright, clean code and OOP principles
sets standards the team follows
AWS data services in production (must-have): Lambda, Kinesis, DynamoDB, S3, Athena/Presto or Redshift, SQS/SNS — owned and operated end-to-end, not just used
Event-driven architecture ownership — designs streaming pipelines, DLQ strategies, retry/idempotency patterns
makes the calls others follow
Data engineering at principal depth: ETL/ELT, schema validation, data contracts, data lake/warehouse architecture
defines patterns the team reuses
SQL at depth — PostgreSQL and analytical engines (Athena, Redshift)

Job Responsibility

Own data platform architecture across platform — streaming pipelines, partner integrations, and core backend services
Define and enforce backend and data engineering standards: service contracts, error handling, logging, secrets management
Own code quality and architectural consistency
Own event-driven integrations with 20+ external data partners — data contracts, ingestion, transformation, failure handling
Design and govern data models across PostgreSQL, DynamoDB, S3, and analytical systems
Define and maintain IaC architecture for owned services using Terraform
Collaborate with DevOps on deployment patterns, observability, and incident runbooks
Monitor production systems, drive alerting standards, and lead resolution of critical data incidents
Represent backend and data constraints in planning and API contract discussions — raises risks before implementation starts
Produce and maintain architecture documentation, ADRs, and onboarding materials

What we offer

Comfortable atmosphere
support for well-being
charge professional growth
equity, diversity, and inclusion

Fulltime

Lead Data Engineer

Rapid7 is seeking a Data Engineer, Data Engineering & Analytics to join a high-p...

Location

India , Pune

Salary:

Not provided

Rapid7

Expiration Date

Until further notice

Requirements

Ability to thrive in a fast-paced hybrid organization
Comfort working in a highly agile, intensely iterative environment
Demonstrated capacity to clearly and concisely communicate complex business activities, technical requirements, and recommendations
8+ years of experience in data engineering, analytics, or business intelligence
8+ years experience designing, implementing, operating, and extending enterprise dimensional data models
3+ years experience building reports and dashboards in Tableau and/or other similar data visualization tools
Experience in DBT modeling and understanding modular, performant models
Solid understanding of Snowflake, SQL, and data warehouse management
Understanding of ETL/ELT processes, data pipelines, and cloud-based data architectures
Familiarity with modern data stacks (DBT, Airflow, Fivetran, Matillion, or similar tools)

Job Responsibility

Implement data modeling best practices to enhance data accessibility and reporting capabilities
Ensure data integrity, security, and compliance with industry standards and regulations
Document plans and results in user-stories, issues, PRs, the team’s handbook - following the tradition of documentation first
Implement the Corp Data philosophy in everything you do
Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment
Maintain and advocate for these standards through code review
Collaborate with IT and DevOps teams to optimize cloud infrastructure and data governance policies
Manage and enhance the existing Tableau reporting suite, ensuring self-service analytics and actionable insights for stakeholders
Design, develop, and extend DBT code repository to extend the Enterprise Dimensional Warehouse capabilities and infrastructure
Develop and maintain a single source of truth for business metrics, ensuring consistency across reporting platforms

Fulltime

Senior Data Lead

At Barclays you will spearhead the transformation of our data landscape, driving...

Location

India , Pune

Salary:

Not provided

Barclays

Expiration Date

Until further notice

Requirements

Proficiency in designing and implementing scalable data platforms on AWS using services such as S3, Glue (ETL Jobs, Data Catalog), Redshift, Athena, and Iceberg with strong hands-on experience with Databricks for building notebooks, ELT pipelines, and working with distributed data processing frameworks such as Apache Spark
Expertise in migrating large-scale SQL Server workloads to AWS, including defining source-to-target mappings and ensuring performance and data integrity
Proficiency in Python and SQL for developing data pipelines, transformations, and analytical workflows and Experience in building and orchestrating ETL/ELT pipelines using AWS Glue, DBT, and tools such as Airflow/Astronomer
Ability to define data contracts, design Iceberg table schemas, and implement partitioning, metadata management, and Glue Catalog integration
Strong understanding of data governance, including implementation of Lake Formation policies for row- and column-level security
Experience embedding MNPI controls, including data classification, access restrictions, and working with Compliance/CISO stakeholders
Hands-on experience with CI/CD pipelines, DevOps practices, and Agile delivery methodologies
Design and optimize data lake and warehouse solutions for structured and unstructured data across AWS platforms

Job Responsibility

At Barclays you will spearhead the transformation of our data landscape, driving innovation and excellence across enterprise data platforms
As a Senior Data Lead, you will leverage cutting-edge AWS technologies and Databricks to modernize and migrate SQL Server workloads into scalable, secure, and compliant cloud-native solutions
With a primary focus on AWS services including S3, Glue, Redshift, Athena, and Iceberg, alongside Python, SQL, and orchestration tools like Airflow, you will design and deliver robust data pipelines, establish strong data governance frameworks including MNPI controls, and enable high-quality, insight-driven decision-making

What we offer

Competitive holiday allowance
Life assurance
Private medical care
Pension contribution

Fulltime

Select Country

Data Governance Lead - AWS Platform

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?

Data Governance Lead - AWS Platform

Data Governance Lead - AWS Platform

Senior AWS Data Engineer / Data Platform Engineer

Aws Data Engineer (Cloud Data Platform & Pipeline Specialist)

Sr Data Platform Lead

Data Engineer (Big Data, Cloud - AWS, Databricks) - Assistant Vice President

Data Platform Engineer (Python / Golang)

Lead Data Engineer

Senior Data Lead

Our AI answers in your language