CrawlJobs Logo
Briefcase Icon
Category Icon

Data Engineer - Pyspark Jobs

2515 Job Offers

Filters
Staff Software Engineer, Privacy & Data Security
Save Icon
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
Not provided
vanta.com Logo
Vanta
Expiration Date
Until further notice
Data Engineer
Save Icon
Location Icon
Location
United States
Salary Icon
Salary
150000.00 - 185000.00 USD / Year
onebrief.com Logo
Onebrief
Expiration Date
Until further notice
Software Engineer, Distributed Data Systems
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
230000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Software Engineer, Data Infrastructure - Research
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
250000.00 - 380000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Data Scientist, Financial Engineering
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
230000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Machine Learning Engineer, Distributed Data Systems
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
295000.00 - 445000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Data Centre Engineer
Save Icon
Location Icon
Location
United Kingdom , Acton, West London
Salary Icon
Salary
30000.00 - 42000.00 GBP / Year
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Data Centre Engineer
Save Icon
Location Icon
Location
United Kingdom , Welwyn Garden City
Salary Icon
Salary
30000.00 - 42000.00 GBP / Year
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Ads Data Engineer
Save Icon
Location Icon
Location
United States , Boston; Austin; Miami
Salary Icon
Salary
Not provided
hopper.com Logo
Hopper
Expiration Date
Until further notice
Staff Data Engineer
Save Icon
Location Icon
Location
United States
Salary Icon
Salary
213000.00 - 251000.00 USD / Year
vanta.com Logo
Vanta
Expiration Date
Until further notice
Ads Data Engineer
Save Icon
Location Icon
Location
Canada , Toronto
Salary Icon
Salary
Not provided
hopper.com Logo
Hopper
Expiration Date
Until further notice
Senior Software Engineer, Data Acquisition
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
293000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Software Engineer, Data Acquisition
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
293000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Data Engineer (Compliance)
Save Icon
Location Icon
Location
Singapore , Singapore
Salary Icon
Salary
Not provided
plaud.ai Logo
Plaud
Expiration Date
Until further notice
Data Engineer
Save Icon
Location Icon
Location
Singapore , Singapore
Salary Icon
Salary
Not provided
plaud.ai Logo
Plaud
Expiration Date
Until further notice
Senior Data Engineer (Data Warehouse)
Save Icon
Location Icon
Location
Singapore , Singapore
Salary Icon
Salary
Not provided
plaud.ai Logo
Plaud
Expiration Date
Until further notice
Software Engineer, Data Infrastructure
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
185000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Software Engineer, Research - Human Data
Save Icon
Location Icon
Location
United States; United Kingdom , San Francisco; London
Salary Icon
Salary
230000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Data Engineer
Save Icon
Location Icon
Location
Netherlands , Amsterdam
Salary Icon
Salary
65000.00 - 75000.00 EUR / Year
wetravel.com Logo
WeTravel
Expiration Date
Until further notice
Data Engineer
Save Icon
Location Icon
Location
Belgium , Brussels
Salary Icon
Salary
Not provided
https://www.soprasteria.com Logo
Sopra Steria
Expiration Date
Until further notice

About the Data Engineer - Pyspark role

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain.

In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices.

To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary