CrawlJobs Logo

Data Engineer - AWS

India, Pune · Job Posted June 14, 2026
Apply Position
Job Link Share

Job Description

We are currently seeking a Data Engineer - AWS to join our team in Pune, Mahārāshtra (IN-MH), India (IN). Job Duties: Role Overview We are looking for a skilled Data Engineer to design, build, and maintain scalable, reliable data pipelines and platforms that support analytics, reporting, and operational decision-making. The role’s primary focus is enabling an end-to-end data ingestion and processing pipeline—extracting data preferably from Salesforce, landing it in Amazon S3, and transforming/loading it into Amazon Redshift for analytics-ready consumption. The engineer will also work on SQL modernization (including Oracle SQL development and conversion/optimization for Redshift), data quality, governance, monitoring, and performance tuning.

Job Responsibility

  • Build and operate robust ETL/ELT pipelines for Salesforce → Amazon S3 → Amazon Redshift
  • Automated extraction, secure landing, transformation, load, and publishing for reporting/analytics
  • Strong data quality, reconciliation, monitoring, and scheduling built into the pipeline
  • Build and maintain pipelines that extract data from Salesforce (API-based or connector-based), land data in Amazon S3, and load into Amazon Redshift
  • Implement incremental loads / CDC patterns where applicable
  • manage full loads and historical backfills as needed
  • Establish scheduling and orchestration for daily/near-real-time jobs with reliability and retry mechanisms
  • Design, develop, and optimize complex SQL in Oracle
  • Analyze and convert Oracle SQL to Redshift-compatible SQL, optimizing for Redshift performance and cost
  • Tune Redshift queries using best practices such as sort keys, distribution styles, and query patterns
  • Design and maintain ETL/ELT jobs, transformations, and reusable frameworks
  • Build and optimize data models for warehousing/lakehouse patterns (facts/dimensions, curated layers)
  • Support both batch and (where applicable) near-real-time processing patterns
  • Implement data quality checks (completeness, accuracy, consistency), reconciliation, and validation rules
  • Ensure data integrity, metadata documentation, lineage, and governance practices
  • Apply security and compliance standards (GDPR/regulatory needs where applicable)
  • Monitor pipelines and infrastructure using AWS monitoring tools
  • troubleshoot performance and reliability issues
  • Improve pipeline resilience through alerting, logging, retries, and error handling
  • Contribute to modernization and cloud migration initiatives and automation (DataOps/CI-CD where relevant)
  • Partner with analytics/reporting and business stakeholders to gather requirements and deliver reliable datasets
  • Work effectively with cross-functional teams and provide clear documentation of pipelines and datasets

Requirements

  • Strong hands-on experience building ETL/ELT pipelines in cloud environments
  • Proven experience integrating Salesforce data into a data platform (extraction, S3 landing, transformat)
  • AWS: Amazon S3, Redshift, IAM, CloudWatch
  • Salesforce Integration: Salesforce APIs / connectors (extraction & ingestion patterns)
  • Programming & Querying: Python, SQL
  • Oracle: Complex SQL, stored procedures (as needed), performance tuning
  • Orchestration/Scheduling: AWS Glue, Lambda, Step Functions, cron-based scheduling (or equivalent)

Nice to have

  • ETL tools: Informatica, Talend, Azure Data Factory
  • Warehousing: Snowflake, Azure Synapse (plus Redshift as primary)
  • Big data: Spark, Hadoop
  • Streaming & APIs: Kafka, Event Hub, REST APIs
  • DevOps/DataOps: CI/CD for data pipelines, infrastructure-as-code exposure

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Engineer - AWS

8 matching positions

Data Engineer - AWS

We are seeking an AWS Data Engineer with 4–7 years of experience to design and b...
Location
Location
India , Mumbai
Salary
Salary:
Not provided
necsws.com Logo
NEC Software Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Engineering, or related field
  • 4–7 years of experience in application development and data engineering
  • 3+ years of experience with big data technologies
  • 3+ years of experience with cloud platforms (AWS preferred
  • Azure or GCP also acceptable)
  • Proficiency in Python, SQL, Scala, or Java (3+ years)
  • Experience with distributed computing tools such as Hadoop, Hive, EMR, Kafka, or Spark (3+ years)
  • Hands-on experience with real-time data and streaming applications (3+ years)
  • NoSQL database experience (MongoDB, Cassandra) – 3+ years
  • Data warehousing expertise (Redshift or equivalent) – 3+ years
Job Responsibility
Job Responsibility
  • Develop, test, deploy, orchestrate, monitor, and troubleshoot cloud-based data pipelines and automation workflows in alignment with best practices and security standards
  • Collaborate with data scientists, architects, ETL developers, and business stakeholders to capture, format, and integrate data from internal systems, external sources, and data warehouses
  • Research and experiment with batch and streaming data technologies to evaluate their business impact and suitability for current use cases
  • Contribute to the definition and continuous improvement of data engineering processes and procedures
  • Ensure data integrity, accuracy, and security across corporate data assets
  • Maintain high data quality standards for Data Services, Analytics, and Master Data Management
  • Build automated, scalable, and test-driven data pipelines
  • Apply software development practices including Git-based version control, CI/CD, and release management to enhance AWS CI/CD pipelines
  • Partner with DevOps engineers and architects to improve DataOps tools and frameworks
  • Fulltime
Read More
Arrow Right

Senior AWS Data Engineer / Data Platform Engineer

We are seeking a highly experienced Senior AWS Data Engineer to design, build, a...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
northbaysolutions.com Logo
NorthBay
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering and data platform development
  • Strong hands-on experience with: AWS Glue
  • Amazon EMR (Spark)
  • AWS Lambda
  • Apache Airflow (MWAA)
  • Amazon EC2
  • Amazon CloudWatch
  • Amazon Redshift
  • Amazon DynamoDB
  • AWS DataZone
Job Responsibility
Job Responsibility
  • Design, develop, and optimize scalable data pipelines using AWS native services
  • Lead the implementation of batch and near-real-time data processing solutions
  • Architect and manage data ingestion, transformation, and storage layers
  • Build and maintain ETL/ELT workflows using AWS Glue and Apache Spark on EMR
  • Orchestrate complex data workflows using Apache Airflow (MWAA)
  • Develop and manage serverless data processing using AWS Lambda
  • Design and optimize data warehouses using Amazon Redshift
  • Implement and manage NoSQL data models using Amazon DynamoDB
  • Utilize AWS DataZone for data governance, cataloging, and access management
  • Monitor, log, and troubleshoot data pipelines using Amazon CloudWatch
  • Fulltime
Read More
Arrow Right

Aws Data Engineer (Cloud Data Platform & Pipeline Specialist)

Design, develop, and maintain scalable cloud-based data pipelines using AWS serv...
Location
Location
United States , Atlanta
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in data engineering, with strong hands-on expertise in AWS data services (Glue, EMR, S3, RDS, DataSync, DMS)
  • 5+ years of Proven experience building and managing data pipelines (batch and streaming) in cloud environments
  • 5+ years of Strong experience in data migration, transformation frameworks, and large-scale data replication
  • 5+ years of Deep understanding of data modeling, data transformation, and reconciliation techniques
  • 5+ years of Experience designing and implementing secure data access and governance (least privilege principles)
  • 5+ years of Hands-on experience with data validation, auditing, and reconciliation processes
  • Familiarity with regulatory or finance data environments and reporting workloads
  • 5+ years of Strong problem-solving skills and ability to work in a collaborative, fast-paced environment
  • AWS data services
  • data pipelines
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable cloud-based data pipelines using AWS services such as Glue, EMR, S3, RDS, DataSync, and DMS
  • Build and optimize batch and streaming data orchestration workflows to support enterprise data platforms
  • Lead large-scale data migration efforts, including legacy-to-cloud transformations and replication strategies
  • Perform data modeling, transformation, and reconciliation to ensure high-quality, consistent datasets across systems
  • Implement secure data access patterns following least-privilege principles for pipelines and datasets
  • Collaborate with data architects, analysts, and business stakeholders to understand data requirements and deliver solutions
  • Establish robust data validation, reconciliation, and audit mechanisms to meet regulatory and reporting requirements
  • Troubleshoot and optimize performance of ETL/ELT pipelines and data workflows in AWS environments
  • Support governance, compliance, and audit readiness for data platforms in regulated environments (finance/reporting)
  • Fulltime
Read More
Arrow Right

Data Engineer (Big Data, Cloud - AWS, Databricks) - Assistant Vice President

Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Scala, Spark/Pyspark is must, Hadoop ( BIG Data ), + AWS,Databricks
  • 8 to 11 years’ experience implementing data-intensive solutions using agile methodologies
  • Experience of relational databases and using SQL for data querying, transformation and manipulation
  • Experience of modelling data for analytical consumers
  • Ability to automate and streamline the build, test and deployment of data pipelines
  • Experience in cloud native technologies and patterns
  • A passion for learning new technologies, and a desire for personal growth, through self-study, formal classes, or on-the-job training
  • Excellent communication and problem-solving skills
  • An inclination to mentor
  • an ability to lead and deliver medium sized components independently
Job Responsibility
Job Responsibility
  • Developing and supporting scalable, extensible, and highly available data solutions
  • Deliver on critical business priorities while ensuring alignment with the wider architectural vision
  • Identify and help address potential risks in the data supply chain
  • Follow and contribute to technical standards
  • Design and develop analytical data models
  • Fulltime
Read More
Arrow Right
New

Data Engineer (AWS & PySpark)

We are looking for a hands-on Data Engineer with strong expertise in AWS data se...
Location
Location
India , Bangalore South
Salary
Salary:
Not provided
votredircom.fr Logo
Wissen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python programming skills
  • Hands-on PySpark development experience
  • AWS EMR
  • AWS Athena
  • AWS Glue
  • SQL
  • Data Warehousing Concepts
  • ETL Development
  • Data Pipeline Design
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable data pipelines using PySpark and AWS services
  • Build ETL workflows using AWS Glue and EMR
  • Develop data ingestion, transformation, and processing frameworks
  • Optimize large-scale data processing jobs and improve performance
  • Write efficient SQL queries for analytics and reporting requirements
  • Work with structured and semi-structured datasets in cloud environments
  • Collaborate with data analysts, architects, and business stakeholders
  • Ensure data quality, reliability, and operational excellence
  • Fulltime
Read More
Arrow Right

Lead AWS Data Engineer

We are seeking an accomplished and detail-oriented Lead Data Engineer – AWS to j...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in data engineering and cloud-based platform delivery
  • Strong understanding of distributed data processing and scalable system design
  • Ability to lead delivery while remaining hands-on technically
  • Strong analytical, problem-solving, and communication skills
  • Experience working in client-facing and delivery-focused environments
  • Ability to mentor and develop engineering teams
  • Strong hands-on experience with: AWS cloud services, especially AWS Glue
  • Python / PySpark for large-scale data processing
  • SQL for querying, transformation, and validation
  • Configuration-driven development (e.g., YAML)
Job Responsibility
Job Responsibility
  • Act as a senior engineer within data engineering and cloud platform initiatives, supporting delivery across complex transformation programmes
  • Collaborate with architects and stakeholders to define and implement scalable AWS-based data solutions
  • Contribute to solution design, estimation, and delivery planning
  • Lead engineering workstreams and ensure high-quality technical delivery
  • Design, build, and optimise scalable data pipelines and data processing frameworks on AWS
  • Develop and maintain ETL/ELT pipelines using: AWS Glue
  • Python / PySpark
  • SQL
  • Configuration-driven frameworks (e.g., YAML)
  • Implement robust data ingestion, transformation, and processing patterns
What we offer
What we offer
  • Range of tailored benefits that support your physical, emotional, and financial wellbeing
  • Continuous growth and development opportunities
  • Opportunity to have flexible work options
  • Fulltime
Read More
Arrow Right

AWS Data Engineer

Join us as an AWS Data Engineer Barclays, responsible for supporting the success...
Location
Location
India , Pune
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Firsthand Experience in developing, testing and maintaining applications on AWS Cloud
  • Strong hands‑on experience with the AWS Data Analytics stack (Amazon S3, AWS Glue, Athena, Lambda, IAM, Lake Formation, KMS, STS, and Step Functions), with a proven ability to build, test, and support secure, scalable, and well‑governed data pipelines
  • Firsthand experience in Airflow and PySpark and strong knowledge of Python
  • Design and implement scalable and efficient data transformation/storage solutions using Snowflake
  • Experience in Data ingestion to Snowflake for different storage format such Parquet, Iceberg, JSON, CSV etc
  • Experience in using DBT (Data Build Tool) with snowflake for ELT pipeline development
  • Experience in Writing advanced SQL and PL SQL programs
  • Experience in AWS data pipeline development
  • HandsOn Experience for building reusable components using Snowflake and AWS Tools/Technology
  • Must have completed two major projects
Job Responsibility
Job Responsibility
  • Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data
  • Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures
  • Development of processing and analysis algorithms fit for the intended data complexity and volumes
  • Collaboration with data scientist to build and deploy machine learning models
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right

AWS Data Engineer

We are seeking a detail-oriented and capable Data Engineer – AWS to join our Dat...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in data engineering or software engineering with data focus
  • Strong interest in cloud-based data platforms and distributed processing
  • Good analytical and problem-solving skills
  • Attention to detail and commitment to data quality and reliability
  • Effective communication and teamwork skills
  • Willingness to learn and develop in AWS and modern data engineering practices
  • Hands-on experience with AWS cloud services, especially AWS Glue, Python / PySpark, SQL querying and data manipulation
  • Exposure to YAML or configuration-driven pipelines (desirable)
  • Experience building or supporting data pipelines, ETL/ELT processes
  • Familiarity with data lakes and/or Lakehouse concepts, Distributed processing frameworks (e.g., Spark)
Job Responsibility
Job Responsibility
  • Support delivery across data engineering and platform development initiatives
  • Collaborate with architects, engineers, and stakeholders to implement data solutions on AWS
  • Assist in planning and executing engineering tasks, releases, and deliverables
  • Build and maintain data pipelines and workflows on AWS platforms
  • Develop ETL/ELT pipelines using AWS Glue, Python / PySpark, SQL, Configuration-driven frameworks (e.g., YAML)
  • Support ingestion, transformation, and processing of structured and semi-structured data
  • Contribute to the development of scalable, reusable data components and services
  • Test and validate data pipelines and processing jobs running on AWS services
  • Develop and execute data validation and reconciliation queries using SQL
  • Work with AWS services including AWS Glue, S3-based data lakes, Related data processing and orchestration services
Read More
Arrow Right