CrawlJobs Logo

Filters

Location
Salary
Clear all filters

Data Engineer - Pyspark Jobs

2499 Job Offers

Data Quality Engineer, AI Business
Save Icon
Seeking a Data Quality Engineer to be the quality guardian for AI data services. You will design measurement systems and automation to ensure trustworthy, scalable data. This role requires 5+ years in quality engineering, proficiency in Python/SQL, and systems thinking to embed quality into study...
Location Icon
Location
Salary Icon
Salary
Not provided
prolific.com Logo
Prolific
Expiration Date
Until further notice
Senior Machine Learning Engineer, Data for Embodied AI
Save Icon
Join us in London as a Senior Machine Learning Engineer for Embodied AI. You will build and scale next-generation world model architectures and high-throughput data pipelines. Your work on multimodal data acquisition and curation will directly accelerate the training of advanced robotics and foun...
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Suno's founding team as a Data Engineer in a key US tech hub. Design and scale core data foundations using SQL, Python, and modern tools like Airflow and Snowflake. Enjoy equity, unlimited PTO, and a culture passionate about music and engineering excellence.
Location Icon
Location
United States , Boston, NYC, Los Angeles, San Francisco
Salary Icon
Salary
170000.00 - 240000.00 USD / Year
suno.ai Logo
Suno
Expiration Date
Until further notice
Machine Learning Data Engineer - Systems & Retrieval
Save Icon
Join our team in Palo Alto as a Machine Learning Data Engineer focused on Systems & Retrieval. You will architect high-performance data pipelines and retrieval systems for LLMs, using Python and distributed data systems. This role is central to building scalable, secure infrastructure that powers...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
Not provided
zyphra.com Logo
Zyphra
Expiration Date
Until further notice
Analyst, Data Engineering, Digital & Artificial Intelligence Department
Save Icon
Join Bombardier's Digital & AI team in Dorval as a Data Engineering Analyst. Design and deploy cutting-edge generative AI agents and agentic workflows using LLMs and frameworks like TensorFlow. Leverage your expertise in Python, knowledge graphs, and Microsoft Azure to build scalable enterprise s...
Location Icon
Location
Canada , Dorval
Salary Icon
Salary
Not provided
bombardier.com Logo
Bombardier
Expiration Date
Until further notice
Data Engineer III
Save Icon
Join our team in Johnston as a Data Engineer III. Design and implement cutting-edge data pipelines using Azure Data Factory, SQL, and modern lakehouse architectures. You'll ensure data quality and collaborate on impactful projects, supported by excellent benefits and career development opportunit...
Location Icon
Location
United States , Johnston
Salary Icon
Salary
103040.00 - 148100.00 USD / Year
fm.com Logo
FM
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our team in Columbus, Ohio, as a Data Engineer on a long-term contract. You will optimize SQL Server and Azure SQL performance through stored procedure tuning and indexing strategies. This role involves designing scalable data solutions and collaborating within a dynamic team. We offer compr...
Location Icon
Location
United States , Columbus
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join our team in Denver as a Senior Data Engineer. Design and implement scalable data pipelines using Python, SQL, and ETL tools. Leverage your expertise in cloud, on-prem, and Microsoft Fabric to drive business insights. We offer comprehensive benefits including medical, dental, vision, and a 40...
Location Icon
Location
United States , Denver
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Seeking a Senior Data Engineer with 5+ years in BI, ETL, and Data Warehousing. Design and build robust data platforms using SQL, Python, and tools like Spark/Databricks and Azure Data Factory. Enjoy benefits including medical care and training in this Poland-based role.
Location Icon
Location
Poland
Salary Icon
Salary
Not provided
dcg.pl Logo
DCG Sp. z o. o.
Expiration Date
Until further notice
Senior Azure Data Engineer
Save Icon
Join our remote team as a Senior Azure Data Engineer. Design and build scalable cloud data platforms using Azure Data Factory and modern Databricks (Unity Catalog). Leverage your 5+ years in data engineering and strong Python/SQL skills. Enjoy benefits like private medical care, a Multisport card...
Location Icon
Location
Poland , Warszawa
Salary Icon
Salary
150.00 - 180.00 PLN / Hour
cyclad.pl Logo
Cyclad Sp. z o.o.
Expiration Date
Until further notice
Senior Marketing Data Engineer
Save Icon
Lead the data architecture for EF Tours' transition to Salesforce as a Senior Marketing Data Engineer. Design and build production-grade dbt models in Snowflake to fuel SFMC campaigns with precise segmentation. Refactor performance reporting and consolidate data into a marketing 'Golden Record'. ...
Location Icon
Location
Panama , Panama City
Salary Icon
Salary
Not provided
careers.ef.com Logo
EF Education First
Expiration Date
Until further notice
Senior Data Integration Engineer
Save Icon
Join our multidisciplinary data team in Amsterdam as a Senior Data Integration Engineer. You will design and maintain the integration layer for a modern Data Lake, using enterprise ETL tools like IBM DataStage. Leverage your strong Oracle and SQL expertise to ensure reliable, scalable data delive...
Location Icon
Location
Netherlands , Amsterdam
Salary Icon
Salary
Not provided
levy-professionals.com Logo
Levy Professionals
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join our team in Amsterdam as a Senior Data Engineer. You will support product teams by building and optimizing cloud-native data pipelines. This role requires expertise in cloud platforms, SQL, Python, and CI/CD to enhance reliability and efficiency. Drive automation and quality in a dynamic, co...
Location Icon
Location
Netherlands , Amsterdam
Salary Icon
Salary
Not provided
levy-professionals.com Logo
Levy Professionals
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join our team in Utrecht as a Senior Data Engineer. You will leverage your 5+ years of DevOps experience with Azure Cloud, Databricks, and Apache Airflow. Your role focuses on building robust CI/CD pipelines, automating with Python/PySpark, and ensuring data quality. We value ownership, automatio...
Location Icon
Location
Netherlands , Utrecht
Salary Icon
Salary
Not provided
levy-professionals.com Logo
Levy Professionals
Expiration Date
Until further notice
Ecom Data Engineer Specialist
Save Icon
Join PepsiCo's Data Management team as an Ecom Data Engineer Specialist in Purchase, NY. You will own end-to-end data pipeline development, leveraging Python, SQL, and cloud platforms like Snowflake. This role focuses on building high-volume ETL/ELT processes and distributed systems for eCommerce...
Location Icon
Location
United States , Purchase, New York
Salary Icon
Salary
64900.00 - 132550.00 USD / Year
pepsico.com Logo
Pepsico
Expiration Date
Until further notice
Principal Data Engineer
Save Icon
Lead the development of PepsiCo's flagship data products as a Principal Data Engineer in Plano. Design and scale high-volume cloud data pipelines using Azure, Databricks, and Python to power analytics and ML. This senior role requires deep expertise in data architecture, ETL/ELT, and SQL optimiza...
Location Icon
Location
United States , Plano
Salary Icon
Salary
89000.00 - 149000.00 USD / Year
pepsico.com Logo
Pepsico
Expiration Date
Until further notice
Big Data & Scala Engineer
Save Icon
Seeking a Big Data & Scala Engineer for a hybrid role in Porto. You will work with Hadoop, Spark, Kafka, and microservices in a modern tech environment. We offer a personalized career path, health insurance, and a flexible work policy to ensure an excellent work-life balance. Fluency in English i...
Location Icon
Location
Portugal , Porto
Salary Icon
Salary
Not provided
wearemeta.io Logo
We Are Meta
Expiration Date
Until further notice
Data Engineer with GCP
Save Icon
Join WE ARE META as a Data Engineer with GCP expertise. This hybrid role in Porto requires 3+ years in data engineering, ETL, SQL, and hands-on GCP experience. Fluency in English and French is mandatory. Enjoy a welcome kit, health insurance, career growth, and a Coverflex meal card.
Location Icon
Location
Portugal , Porto
Salary Icon
Salary
Not provided
wearemeta.io Logo
We Are Meta
Expiration Date
Until further notice
Lead Data Engineer
Save Icon
Lead Data Engineer role to design and build scalable data products using AWS, GCP, and big data tech like Spark & Kafka. You'll lead a team, applying agile principles and engineering excellence. Enjoy a competitive package with bonus, pension, and flexible UK locations.
Location Icon
Location
United Kingdom , Belfast; Birmingham; Glasgow; Bristol; Manchester; London
Salary Icon
Salary
Not provided
plus.net Logo
Plusnet
Expiration Date
Until further notice
Senior Azure Data Engineer with Databricks
Save Icon
Seeking a Senior Azure Data Engineer with Databricks expertise in Poland. You will design at-scale data infrastructure, build processing patterns, and develop automated pipelines using Azure, Databricks, Spark, and Python. This role requires strong SQL, CI/CD, and data modeling skills. We offer p...
Location Icon
Location
Poland
Salary Icon
Salary
Not provided
dcg.pl Logo
DCG Sp. z o. o.
Expiration Date
Until further notice

About the Data Engineer - Pyspark role

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain.

In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices.

To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.