CrawlJobs Logo
Briefcase Icon
Category Icon

Data Engineer - Pyspark Jobs

2342 Job Offers

Filters
New
Data Engineer
Save Icon
Senior Data Engineer sought for a Bengaluru-based role to design and optimize large-scale data pipelines using PySpark, Python, Hadoop, and ETL. Requires 7–10 years of experience with RDBMS, Unix, and exposure to GCP Vertex AI or Agentic AI. Collaborate with AI teams on advanced cloud analytics. ...
Location Icon
Location
India , Bengaluru
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
New
Lead Specialty Software Engineer - Capital Markets Reference Data
Save Icon
Lead Specialty Software Engineer role at Wells Fargo, modernizing the Capital Markets Reference Data Platform. You will architect high-performance distributed systems using Java 17+, Spring, Kafka, and SQL, while accelerating delivery with agentic AI tools. Based in Charlotte, this position offer...
Location Icon
Location
United States , Charlotte
Salary Icon
Salary
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
Until further notice
New
Senior Manager, Data Engineering
Save Icon
Senior Manager, Data Engineering at Stanford Health Care in Palo Alto, CA. Lead multiple teams building enterprise data pipelines and cloud platforms (GCP, AWS, Azure) to drive clinical innovation and operational efficiency. Requires 8+ years in data engineering, 3+ years managing technical teams...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
83.98 - 111.27 USD / Hour
stanfordhealthcare.org Logo
Stanford Health Care
Expiration Date
Until further notice
New
Data Engineer - Python AND Kafka AND (Hadoop OR HDFS OR Hive) AND Snowflake AND apache AND (iceberg
Save Icon
Seeking a skilled **Data Engineer** for a critical migration from on-prem **DataLake** to **AWS LakeHouse** in **Bangalore, India**. You will leverage **Python**, **Kafka**, **Hadoop/HDFS/Hive**, **Snowflake**, and **Apache Iceberg** to refactor pipelines and ensure data integrity. Requires 3-5 y...
Location Icon
Location
India , Bangalore
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
New
Manager, Software Engineering (Data Security)
Save Icon
Palo Alto Networks seeks an experienced Engineering Manager to lead Data Security innovation in Santa Clara, CA. You will drive cloud-native SaaS solutions, leveraging 8+ years in enterprise development and expertise in microservices, AWS/GCP, and security protocols. This role demands 3+ years ma...
Location Icon
Location
United States , Santa Clara
Salary Icon
Salary
165000.00 - 267500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
New
Lead Data Engineer
Save Icon
Lead Data Engineer sought to drive technical leadership across data engineering, delivery, and governance in the UK. You’ll mentor teams in an onshore-offshore model, leveraging ETL/ELT, AWS (Glue), and SQL. This role demands hands-on experience with big-data solutions, CI/CD for data, and Python...
Location Icon
Location
United Kingdom
Salary Icon
Salary
70000.00 GBP / Year
zebrapeople.com Logo
Zebra People
Expiration Date
Until further notice
New
Principal Engineer - Data Engineering
Save Icon
Principal Engineer – Data Engineering at Wells Fargo in Bengaluru. Lead HR data strategy, architecting scalable cloud-native solutions on Azure Fabric and GCP. Drive migration from legacy platforms to modern lakehouse architectures, enabling advanced analytics and Generative AI. Requires 7+ years...
Location Icon
Location
India , Bengaluru
Salary Icon
Salary
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
Until further notice
New
Data Engineer - Streaming (WkStream 2 - Kafka)
Save Icon
Seeking a **Data Engineer - Streaming** in **Bangalore, India** to build high-performance ingestion pipelines using **PySpark Structured Streaming** and **Apache Kafka**. You will design applications that read from Confluent Kafka, parse Avro/JSON payloads, and write atomically to **Apache Iceber...
Location Icon
Location
India , Bangalore
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
New
Data Engineer III - Analytics/Information Solutions
Save Icon
Seeking a skilled **Data Engineer III** to join MUSC’s Analytics team in **South Carolina**. You will serve as a subject matter expert, resolving complex technical issues and building decision-support dashboards. Requires a bachelor’s degree with 5 years’ experience; **Epic Certification** is pre...
Location Icon
Location
United States , South Carolina
Salary Icon
Salary
Not provided
muschealth.org Logo
MUSC Health
Expiration Date
Until further notice
New
Data Engineer
Save Icon
Join a fast-growing AI startup building "Google Analytics for LLMs" as a Data Engineer in New York. You’ll scale real-time data infrastructure, optimize pipelines with Snowflake, ClickHouse, dbt, and Dagster, and support ML workflows. Strong SQL, Python, and AWS skills required. Competitive compe...
Location Icon
Location
United States , New York
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
New
Data Engineering Head
Save Icon
Seeking a visionary **Data Engineering Head** in **Pune, India** to lead a high-impact Data & AI practice. This strategic role demands 20+ years of experience, deep expertise in **cloud platforms (AWS, Azure, GCP)**, and a strong grasp of **AI/ML** and **Agentic AI**. You will drive P&L, foster i...
Location Icon
Location
India , Pune
Salary Icon
Salary
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
Until further notice
New
Data Engineer - Gcp & Bigquery
Save Icon
Seeking an experienced Data Engineer to design scalable, cloud-native solutions on Google Cloud Platform. This role requires 5-9 years of expertise in SQL, PL/SQL, and Google BigQuery, with familiarity in GCP services like Data Fusion and Cloud Storage. Based in Pune, India, you will build and op...
Location Icon
Location
India , Pune
Salary Icon
Salary
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
New
Principal Engineer - Data path - HPE Alletra Storage MP X10000 (Object Storage product development)
Save Icon
Principal Engineer role driving data path architecture for HPE Alletra Storage MP X10000 Object Storage in Bengaluru. Requires 15+ years in storage engineering, delivering V1 products for AI/cloud use cases. Lead end-to-end development, performance tuning, and distributed systems innovation. Enjo...
Location Icon
Location
India , Bengaluru
Salary Icon
Salary
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
New
Senior Technical Support Engineer – Enterprise & Data Center Switching (L3 TAC)
Save Icon
Senior Technical Support Engineer (L3 TAC) role at Juniper Networks in Bengaluru, India. You will own complex escalations for enterprise and data-center switching, leveraging deep expertise in BGP, EVPN, VXLAN, and routing protocols. Requires 7–10+ years of networking experience and a Bachelor's ...
Location Icon
Location
India , Bengaluru
Salary Icon
Salary
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
New
Associate Data Engineer
Save Icon
Amgen seeks an Associate Data Engineer in Hyderabad, India, to join its mission-driven biotech team. You will design and maintain complex ETL/ELT pipelines using Databricks, PySpark, and AWS, processing large-scale datasets. Ideal candidates have 3-6 years of experience with big data technologies...
Location Icon
Location
India , Hyderabad
Salary Icon
Salary
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
New
Senior Specialty Software Engineer - Capital Markets Reference Data
Save Icon
Senior Specialty Software Engineer needed for Wells Fargo’s Capital Markets Reference Data Platform in Charlotte, NC. Design and build high-performance Java microservices for financial instrument data ingestion and distribution. Requires 4+ years in Java 17+, Spring ecosystem, and SQL optimizatio...
Location Icon
Location
United States of America , Charlotte
Salary Icon
Salary
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
Until further notice
New
Data Engineer
Save Icon
Data Engineer needed in Bangalore to lead on-prem DataLake migration to AWS LakeHouse. Leverage 3-5 years of experience with Python, SQL, and Spark to refactor pipelines and ensure data integrity. Engage with stakeholders while optimizing legacy patterns for Snowflake and Iceberg. Join a collabor...
Location Icon
Location
India , Bangalore
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
New
Senior Big Data Engineer
Save Icon
Senior Big Data Engineer sought by a fast-growing fintech leader in Athens, Greece. Leverage 4+ years of experience with the Hadoop ecosystem (YARN, HDFS, Spark) to lead technical improvements in scalability and efficiency. Design end-to-end data pipelines using Scala, Spark, and Python, while co...
Location Icon
Location
Greece , Athens Northern Suburbs
Salary Icon
Salary
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
Until further notice
New
Data Engineer
Save Icon
Seeking a skilled **Data Engineer** for a critical migration project in **Bangalore, India**. You will lead the transition from an on-prem **DataLake** to **AWS LakeHouse**, leveraging **Python**, **SQL**, and **Apache Spark**. This role demands 3-5 years of experience with **SDLC**, **CI/CD**, a...
Location Icon
Location
India , Bangalore
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
New
Data Engineer Internship - Fall 2026
Save Icon
Join Amazon as a Data Engineer Intern for Fall 2026 and gain hands-on experience designing scalable data pipelines and optimizing SQL/NoSQL databases. This full-time, 12-week internship in Seattle, Bay Area, or other US locations requires Python or Scala skills and enrollment in a technical degre...
Location Icon
Location
United States , Seattle
Salary Icon
Salary
101300.00 - 160000.00 USD / Year
Amazon
Expiration Date
Until further notice
Previous 1 2 3 4 5 6 ... 118 Next

About the Data Engineer - Pyspark role

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain.

In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices.

To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary