Data Engineer - AWS, PySpark Job at Barclays (Bengaluru)

Data Engineer - AWS, Pyspark

You will be responsible for supporting the successful delivery of Location Strat...

Location

India , Bengaluru

Salary:

Not provided

Barclays

Expiration Date

Until further notice

Requirements

Hands on experience in pyspark and strong knowledge on Dataframes, RDD and SparkSQL
Hands on Experience in developing, testing and maintaining applications on AWS Cloud
Strong hold on AWS Data Analytics Technology Stack (Glue, S3, Lambda, Lake formation, Athena)
Design and implement scalable and efficient data transformation/storage solutions using Snowflake
Experience in Data ingestion to Snowflake for different storage format such Parquet, Iceberg, JSON, CSV etc
Experience in using DBT (Data Build Tool) with snowflake for ELT pipeline development
Experience in Writing advanced SQL and PL SQL programs
Hands On Experience for building reusable components using Snowflake and AWS Tools/Technology
Should have worked at least on two major project implementations
Exposure to data governance or lineage tools such as Immuta and Alation is added advantage

Job Responsibility

Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data
Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures
Development of processing and analysis algorithms fit for the intended data complexity and volumes
Collaboration with data scientist to build and deploy machine learning models
To build and maintain the systems that collect, store, process, and analyse data, such as data pipelines, data warehouses and data lakes to ensure that all data is accurate, accessible, and secure

What we offer

Competitive holiday allowance
Life assurance
Private medical care
Pension contribution

Fulltime

AWS Data Engineer

Join us as an AWS Data Engineer Barclays, responsible for supporting the success...

Location

India , Pune

Salary:

Not provided

Barclays

Expiration Date

Until further notice

Requirements

Firsthand Experience in developing, testing and maintaining applications on AWS Cloud
Strong hands‑on experience with the AWS Data Analytics stack (Amazon S3, AWS Glue, Athena, Lambda, IAM, Lake Formation, KMS, STS, and Step Functions), with a proven ability to build, test, and support secure, scalable, and well‑governed data pipelines
Firsthand experience in Airflow and PySpark and strong knowledge of Python
Design and implement scalable and efficient data transformation/storage solutions using Snowflake
Experience in Data ingestion to Snowflake for different storage format such Parquet, Iceberg, JSON, CSV etc
Experience in using DBT (Data Build Tool) with snowflake for ELT pipeline development
Experience in Writing advanced SQL and PL SQL programs
Experience in AWS data pipeline development
HandsOn Experience for building reusable components using Snowflake and AWS Tools/Technology
Must have completed two major projects

Job Responsibility

Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data
Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures
Development of processing and analysis algorithms fit for the intended data complexity and volumes
Collaboration with data scientist to build and deploy machine learning models

What we offer

Competitive holiday allowance
Life assurance
Private medical care
Pension contribution

Fulltime

AWS Data Engineer

We are seeking a detail-oriented and capable Data Engineer – AWS to join our Dat...

Location

United Kingdom , London

Salary:

Not provided

NTT DATA

Expiration Date

Until further notice

Requirements

Experience in data engineering or software engineering with data focus
Strong interest in cloud-based data platforms and distributed processing
Good analytical and problem-solving skills
Attention to detail and commitment to data quality and reliability
Effective communication and teamwork skills
Willingness to learn and develop in AWS and modern data engineering practices
Hands-on experience with AWS cloud services, especially AWS Glue, Python / PySpark, SQL querying and data manipulation
Exposure to YAML or configuration-driven pipelines (desirable)
Experience building or supporting data pipelines, ETL/ELT processes
Familiarity with data lakes and/or Lakehouse concepts, Distributed processing frameworks (e.g., Spark)

Job Responsibility

Support delivery across data engineering and platform development initiatives
Collaborate with architects, engineers, and stakeholders to implement data solutions on AWS
Assist in planning and executing engineering tasks, releases, and deliverables
Build and maintain data pipelines and workflows on AWS platforms
Develop ETL/ELT pipelines using AWS Glue, Python / PySpark, SQL, Configuration-driven frameworks (e.g., YAML)
Support ingestion, transformation, and processing of structured and semi-structured data
Contribute to the development of scalable, reusable data components and services
Test and validate data pipelines and processing jobs running on AWS services
Develop and execute data validation and reconciliation queries using SQL
Work with AWS services including AWS Glue, S3-based data lakes, Related data processing and orchestration services

Senior AWS Data Engineer

We are seeking a highly skilled Senior Data Engineer with 5+ years of experience...

Location

India , Pune

Salary:

Not provided

Vidushi Infotech SSP Pvt. Ltd.

Expiration Date

Until further notice

Requirements

5+ years of experience in hands-on programming with Python and PySpark
Expertise in Boto3 and various Python frameworks and libraries, adhering strictly to Python best practices (PEP 8)
Strong experience in Spark SQL and PySpark optimization techniques (e.g., partitioning, caching, broadcast joins)
Deep architectural knowledge of AWS services, including: S3, EC2, Lambda, Redshift, CloudFormation
Advanced understanding of Git (branching strategies, PR reviews)
Experience with JFrog Artifactory for dependency management and artifact storage
Proficiency in CI/CD pipelines and automated testing frameworks
Analytical Mindset: Ability to debug complex, non-obvious issues in distributed systems
Clean Coder: Passion for writing 'clean code' and mentoring junior engineers on maintainability
Architectural Thinking: Ability to explain the 'why' behind choosing specific AWS components over others

Job Responsibility

Design, develop, and maintain complex ETL/ELT pipelines to build high-value data assets
Lead the code refactorization of legacy codebases, improving readability, maintainability, and performance
Perform deep code optimization using Spark SQL and PySpark to handle large-scale datasets efficiently
Implement a Test-Driven Development (TDD) approach, writing comprehensive unit tests to ensure functionality and catch bugs early in the lifecycle
Isolate and resolve difficult bugs, including those related to performance bottlenecks, concurrency issues, and complex logic flaws
Design and deploy solutions utilizing the full AWS stack, explaining the trade-offs and benefits of specific services for various use cases

Fulltime

Senior Software Engineer (AWS & Data)

We are seeking a Senior Software Engineer – AWS & Data to join a remote developm...

Location

Poland

Salary:

130.00 - 150.00 PLN / Hour

Cyclad Sp. z o.o.

Expiration Date

Until further notice

Requirements

Proven experience in AWS cloud-native development, preferably within data-related projects
Strong hands-on experience with AWS Glue and PySpark
Experience with ECS and/or Kubernetes (development and basic administration)
Solid frontend development experience with React
Strong experience in building REST APIs using FastAPI
Familiarity with DevOps tools and practices (e.g., GitHub Actions, Terraform)
Experience with Terraform in infrastructure automation
Knowledge of CI/CD pipeline design and optimization
Exposure to scalable data architecture patterns
Strong problem-solving skills and attention to detail

Job Responsibility

Design, develop, and maintain cloud-native applications using AWS services (including AWS Glue and ECS)
Build and optimize data processing solutions using AWS Glue and PySpark
Develop and maintain RESTful APIs using FastAPI
Design and implement user interfaces using React, ensuring seamless frontend-backend integration
Work with containerized environments (ECS/Kubernetes) for application deployment and management
Collaborate with product owners and cross-functional teams to deliver high-quality solutions
Participate in code reviews and contribute to best development practices
Troubleshoot and resolve issues related to performance, scalability, and security
Continuously improve development processes by staying up to date with new technologies

What we offer

Remote working model
Private medical care with dental care (covering 70% of costs)
Multisport card (also for an accompanying person)
Life insurance
Honey Pot

Fulltime

Senior AWS Data Engineer

This role is to support and enhance enterprise business intelligence and analyti...

Location

United States , Torrance

Salary:

Not provided

Robert Half

Expiration Date

Until further notice

Requirements

Bachelor’s degree in Computer Science, Information Technology, Data Engineering, or equivalent practical experience
7+ years of experience building enterprise data platforms or data engineering solutions
5+ years of hands‑on experience with AWS cloud services
Strong hands‑on experience with AWS Glue and PySpark for ETL processing
Experience developing serverless applications using AWS Lambda
Deep expertise with Amazon Redshift, including performance tuning and advanced SQL
Experience with workflow orchestration tools such as Apache Airflow
Experience implementing secure vendor integrations using AWS Transfer Family
Experience designing and supporting data migration and replication pipelines using AWS DMS
Advanced SQL skills and experience analyzing complex datasets

Job Responsibility

Design, build, and maintain scalable ETL/ELT pipelines using AWS Glue with PySpark for large‑scale data processing
Develop and support serverless integrations using AWS Lambda for event‑driven workflows and system integrations
Design and optimize Amazon Redshift data warehouse solutions, including: Advanced SQL analytics, Stored procedures, Performance tuning
Lead implementation of secure vendor file transfer and ingestion solutions using AWS Transfer Family
Design and implement database migration and replication pipelines using AWS Database Migration Service (DMS)
Build and manage workflow orchestration using Apache Airflow or similar orchestration tools
Analyze data quality, transformation logic, and pipeline performance using SQL and data analysis techniques
Troubleshoot and resolve production data pipeline and integration issues across AWS services
Provide technical guidance to development team members on: AWS best practices, Cost optimization, Performance optimization
Partner with enterprise architecture, security, and compliance teams to ensure SOX and regulatory compliance

What we offer

medical, vision, dental, and life and disability insurance
eligible to enroll in our company 401(k) plan

Senior AWS Data Engineer

This role is to support and enhance enterprise business intelligence and analyti...

Location

United States , Torrance

Salary:

Not provided

Robert Half

Expiration Date

Until further notice

Requirements

Bachelor’s degree in Computer Science, Information Technology, Data Engineering, or equivalent practical experience
7+ years of experience building enterprise data platforms or data engineering solutions
5+ years of hands‑on experience with AWS cloud services
Strong hands‑on experience with AWS Glue and PySpark for ETL processing
Experience developing serverless applications using AWS Lambda
Deep expertise with Amazon Redshift, including performance tuning and advanced SQL
Experience with workflow orchestration tools such as Apache Airflow
Experience implementing secure vendor integrations using AWS Transfer Family
Experience designing and supporting data migration and replication pipelines using AWS DMS
Advanced SQL skills and experience analyzing complex datasets

Job Responsibility

Design, build, and maintain scalable ETL/ELT pipelines using AWS Glue with PySpark for large‑scale data processing
Develop and support serverless integrations using AWS Lambda for event‑driven workflows and system integrations
Design and optimize Amazon Redshift data warehouse solutions, including: Advanced SQL analytics, Stored procedures, Performance tuning
Lead implementation of secure vendor file transfer and ingestion solutions using AWS Transfer Family
Design and implement database migration and replication pipelines using AWS Database Migration Service (DMS)
Build and manage workflow orchestration using Apache Airflow or similar orchestration tools
Analyze data quality, transformation logic, and pipeline performance using SQL and data analysis techniques
Troubleshoot and resolve production data pipeline and integration issues across AWS services
Provide technical guidance to development team members on: AWS best practices, Cost optimization, Performance optimization
Partner with enterprise architecture, security, and compliance teams to ensure SOX and regulatory compliance

What we offer

medical, vision, dental, and life and disability insurance
eligible to enroll in our company 401(k) plan

Data Engineer (AWS)

The Data Engineer (AWS) role involves designing and implementing data solutions ...

Location

Mexico , Guadalajara

Salary:

Not provided

NTT DATA

Expiration Date

Until further notice

Requirements

4+ years of experience supporting Software Engineering, Data Engineering, or Data Analytics projects specifically with AWS
4+ years of experience on at least one of the following: PySpark or Scala or Data lake house solutions on AWS
2+ years of experience leading a team supporting data related projects to develop end-to-end technical solutions
Ability to travel at least 25%
Undergraduate or Graduate degree preferred
Proficiency in coding skills, utilizing languages such as Python, Java, and Scala
Demonstrate production experience in other core data platforms such as Snowflake, Databricks, Azure, GCP, Hadoop, and more
Possess hands-on knowledge of Cloud and Distributed Data Storage, including expertise in HDFS, S3, ADLS, GCS, Kudu, ElasticSearch/Solr, Cassandra, or other NoSQL storage systems
Exhibit a strong understanding of Data integration technologies, encompassing Spark, Kafka, eventing/streaming, Streamsets, NiFi, AWS Data Migration Services, Azure DataFactory, Google DataProc
Showcase professional written and verbal communication skills

Job Responsibility

Design and implement tailored data solutions to meet customer needs and use cases, spanning from streaming to data lakes, analytics, and beyond within a dynamically evolving technical stack
Provide thought leadership by recommending the most appropriate technologies and solutions for a given use case, covering the entire spectrum from the application layer to infrastructure
Collaborate seamlessly across diverse technical stacks, including Cloudera, Databricks, Snowflake, and AWS
Develop and deliver detailed presentations to effectively communicate complex technical concepts
Generate comprehensive solution documentation, including sequence diagrams, class hierarchies, logical system views, etc
Adhere to Agile practices throughout the solution development process
Design, build, and deploy databases and data stores to support organizational requirements

Fulltime

Select Country

Data Engineer - AWS, PySpark

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Data Engineer - AWS, PySpark

Data Engineer - AWS, Pyspark

AWS Data Engineer

AWS Data Engineer

Senior AWS Data Engineer

Senior Software Engineer (AWS & Data)

Senior AWS Data Engineer

Senior AWS Data Engineer

Data Engineer (AWS)

Our AI answers in your language