CrawlJobs Logo

Filters

Location
Salary
Clear all filters

Data Engineer - Pyspark Jobs

2485 Job Offers

Lead Data Engineer
Save Icon
Lead Data Engineer role for a UK resident. Develop and deploy data products on Azure, using ADF, Databricks, and dbt. Lead a team, ensure data governance, and build scalable pipelines. Enjoy remote work, 25+8 days holiday, pension, and enhanced family pay.
Location Icon
Location
United Kingdom
Salary Icon
Salary
75000.00 GBP / Year
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Junior Data Engineer
Save Icon
Join our client's team in Boston as a Junior Data Engineer. This role requires a strong academic background in Computer Science or a related field, with 1-2+ years of experience using Python and SQL. You will build data pipelines, work with large datasets, and utilize frameworks like Spark. Exper...
Location Icon
Location
United States , Boston
Salary Icon
Salary
50.00 - 60.00 USD / Hour
bhsg.com Logo
Beacon Hill
Expiration Date
Until further notice
Junior Data Engineer
Save Icon
Join our client in Boston as a Junior Data Engineer. Utilize your Python and advanced SQL skills to work with large datasets and build data pipelines. This role is ideal for a recent top-tier graduate with 1-2+ years of experience. Knowledge of Databricks, Spark, or Kafka is a plus.
Location Icon
Location
United States , Boston
Salary Icon
Salary
50.00 - 60.00 USD / Hour
bhsg.com Logo
Beacon Hill
Expiration Date
Until further notice
Data Engineer
Save Icon
Seeking a hands-on Data Engineer to build and optimize production ETL pipelines. This role requires deep expertise in Informatica (PowerCenter/IICS), Snowflake, and complex SQL for data transformations. You will troubleshoot issues and support data workflows in a delivery-focused environment. The...
Location Icon
Location
Salary Icon
Salary
Not provided
dataideology.com Logo
Data Ideology
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Seeking a Senior Data Engineer in Hyderabad to design and optimize scalable data pipelines on AWS and Databricks. You will leverage Python/Scala and ETL orchestration within a SAFE Agile team to ensure data integrity and performance. This role requires 6+ years' experience, mastery of modern data...
Location Icon
Location
India , Hyderabad
Salary Icon
Salary
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Data Engineer
Save Icon
Join a dynamic healthcare organization in Keller, TX as a Data Engineer. Design and build foundational data systems using Apache Spark, Python, and Hadoop to enable predictive analytics. Transform clinical and financial data into strategic insights, impacting key decisions. Enjoy full benefits in...
Location Icon
Location
United States , Keller, TX
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Data Analyst / Engineer
Save Icon
Join our team as a Data Analyst/Engineer in a remote contract role. Leverage your 7+ years of expertise in SQL Server, Teradata, and financial data processes to automate workflows and deliver precise insights. Enjoy comprehensive benefits including medical, dental, and 401(k) while driving effici...
Location Icon
Location
United States , Woburn
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
GCP Data Engineer
Save Icon
Lead the design of next-gen data solutions on GCP, tackling low-latency, high-throughput engineering challenges at true scale. As the BigQuery authority, you'll own automated pipelines and shape a platform powering real-time insights. Join a skilled UK team in a complex, scaling environment with ...
Location Icon
Location
United Kingdom
Salary Icon
Salary
75000.00 - 85000.00 GBP / Year
linuxrecruit.co.uk Logo
Linux Recruit
Expiration Date
Until further notice
Software Engineer, Data Engine
Save Icon
Join our team in San Francisco as a Software Engineer for the Data Engine. You will build robust systems and tools to collect, process, and manage large-scale robotic training datasets. This role requires expertise in Rust/C++ and involves hands-on work across hardware, software, and data pipelin...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
120000.00 - 160000.00 USD / Year
workatastartup.com Logo
YC Work at a Startup
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Seeking a Senior Data Engineer with current experience in asset management, investment accounting, or insurance data. This remote UK role involves building scalable data pipelines on GCP (BigQuery, DBT, Databricks) and delivering insights from large financial datasets. Strong Python and SQL skill...
Location Icon
Location
United Kingdom
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join our team in Roswell, GA, as a Senior Data Engineer. Design and build scalable data solutions using Microsoft Fabric, Databricks, and Azure. Leverage your expertise in ETL, SQL, Python, and Power BI to enable data-driven decisions. Enjoy comprehensive benefits in this key role partnering with...
Location Icon
Location
United States , Roswell
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Data Engineering Manager
Save Icon
Lead our Data Engineering team in Austin, building a cutting-edge Data Platform with tools like Snowflake and Airflow. You'll manage projects, mentor engineers, and deliver revenue-generating data products in the fast-paced freight industry. Enjoy a collaborative culture, comprehensive benefits, ...
Location Icon
Location
United States , Austin
Salary Icon
Salary
Not provided
arrivelogistics.com Logo
Arrive Logistics
Expiration Date
Until further notice
Intern, Data Engineering
Save Icon
Join Workato's enterprise orchestration team as a Data Engineering Intern in Singapore. You will design data pipelines, optimize workflows with dbt Cloud and Snowflake, and contribute to AI initiatives. This 6-month role requires SQL skills, database knowledge, and a passion for data transformati...
Location Icon
Location
Singapore
Salary Icon
Salary
Not provided
workato.com Logo
Workato
Expiration Date
Until further notice
Lead Data Engineer
Save Icon
Lead Data Engineer role in Pennington, US. Drive enterprise-scale data initiatives using modern tools like Airflow, OpenShift, and Python. Lead scrum teams, optimize ETL pipelines, and implement CI/CD automation. Strong leadership and hands-on technical expertise in data orchestration required.
Location Icon
Location
United States , Pennington
Salary Icon
Salary
Not provided
enormousenterprise.com Logo
Enormous Enterprise
Expiration Date
Until further notice
Data Engineer 3
Save Icon
Seeking a Senior Data Engineer in Chennai to design and deploy robust data architecture and pipelines. Leverage 5-7 years of experience with on-prem (Kubernetes, Teradata) and cloud platforms (Databricks, AWS). Ensure data quality and governance while building solutions with AI/ML. Enjoy a compre...
Location Icon
Location
India , Chennai
Salary Icon
Salary
Not provided
comcastcorporation.com Logo
Comcast
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join our team as a Senior Data Engineer in Ho Chi Minh City. Design and build robust cloud data pipelines using Azure, Fabric, and Spark. Leverage your expertise in SQL, Python, and ETL/ELT to enable advanced analytics. Enjoy competitive benefits, flexible work, and high-impact projects.
Location Icon
Location
Vietnam , Ho Chi Minh City
Salary Icon
Salary
Not provided
tcdata.vn Logo
TC Data
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Octopus Energy in London as a Data Engineer. Build scalable databases, APIs, and pipelines using SQL, Python, and Airflow to support global energy markets. Work on critical forecasting and trading systems while driving the transition to Net Zero. A role offering ownership, travel, and a uniq...
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
Not provided
octopus.energy Logo
Octopus Energy
Expiration Date
Until further notice
Senior Data Platform Engineer
Save Icon
Join our Data & Machine Learning Platform team in Paris as a Senior Data Platform Engineer. Design and manage scalable data infrastructure using Python, Kafka, and Kubernetes on AWS/Azure/GCP. Enjoy top benefits like comprehensive health insurance and enhanced parental leave while building soluti...
Location Icon
Location
France , Paris
Salary Icon
Salary
Not provided
doctolib.fr Logo
Doctolib
Expiration Date
Until further notice
Data Scientist/Systems Engineer Internship
Save Icon
Join our team in Heredia, Costa Rica, as a Data Scientist/Systems Engineer Intern. You will analyze device telemetry and network data using Python and Power BI to drive product reliability and customer insights. This role involves data modeling, ETL processes, and collaborating with engineering t...
Location Icon
Location
Costa Rica , Heredia
Salary Icon
Salary
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Data Platform Engineer
Save Icon
Join Aurora's Data Platform Team in Pittsburgh as a Data Platform Engineer. Design and implement scalable data pipelines for autonomy sensor data and vehicle logs using GoLang/Python and AWS. You will enhance data availability and lifecycle management while collaborating across teams. This role o...
Location Icon
Location
United States , Pittsburgh
Salary Icon
Salary
105000.00 - 157000.00 USD / Year
aurora.tech Logo
Aurora Innovation
Expiration Date
Until further notice

About the Data Engineer - Pyspark role

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain.

In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices.

To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.