CrawlJobs Logo

Filters

Location
Salary
Clear all filters

Data Engineer - Pyspark Jobs

2499 Job Offers

Engineering Manager - Data Partner Experience
Save Icon
Lead the Data Partner Experience team at Plaid in New York. Drive 20x partner growth and 10x faster onboarding by managing critical infrastructure and API integrations. You'll need 6+ years in engineering and 3+ managing teams to build seamless partner tools.
Location Icon
Location
United States , New York
Salary Icon
Salary
216000.00 - 367200.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Data Engineering Manager
Save Icon
Lead our data engineering team in San Antonio, leveraging your 7+ years of experience and dbt expertise. You will design dimensional models and manage production data transformations, partnering with stakeholders to deliver key insights. This role offers a full benefits package including medical,...
Location Icon
Location
United States , San Antonio
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our transportation client in Memphis as a Data Engineer. Build scalable, cloud-based data pipelines using Python, SQL, and Azure (Data Factory, Synapse) to power logistics and supply chain analytics. This hands-on role requires 3+ years of experience designing ETL processes. We offer compreh...
Location Icon
Location
United States , Memphis
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Senior Google Cloud Data Engineer
Save Icon
Seeking a Senior Google Cloud Data Engineer in NYC for remote/hybrid work. Drive cybersecurity analytics projects independently, focusing on anomaly detection and threat visualization. Must have expert skills in Python, SQL, BigQuery, Dataflow, and Looker Core. Enjoy flexibility, career growth, a...
Location Icon
Location
United States , New York City
Salary Icon
Salary
170000.00 - 240000.00 USD / Year
valtech.com Logo
Valtech
Expiration Date
Until further notice
Product Manager, Data Engine
Save Icon
Lead the evolution of the Public Sector Data Engine as a Technical Product Manager in Washington, DC. You will architect ML Ops tooling and build a foundational engine for AI model development and evaluation. This role requires a background in software engineering, computer vision/ML Ops, and an ...
Location Icon
Location
United States , Washington, DC
Salary Icon
Salary
214500.00 - 267300.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Lead Data Engineer - Data Pipelines
Save Icon
Lead Data Engineer role in Prague, building scalable data pipelines on platforms like Databricks. Utilize Python and strong SQL to transform billions of transactions into economic insights. Drive technical design, mentor engineers, and ensure platform reliability for global clients.
Location Icon
Location
Czechia , Prague
Salary Icon
Salary
Not provided
mastercard.com Logo
Mastercard
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our team as a Data Engineer in Saddle Brook. Design and optimize robust data infrastructure using advanced SQL, Python, and Azure services. Build efficient ETL pipelines to ensure data integrity and support key business insights. We offer comprehensive benefits including medical, dental, and...
Location Icon
Location
United States , Saddle Brook
Salary Icon
Salary
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Data Engineer
Save Icon
Seeking a skilled Data Engineer in Chennai to design and maintain scalable ETL pipelines and data lake solutions. The role requires 5+ years' experience with Azure Data Factory, Snowflake, and SQL, ideally within banking. You will ensure data integrity, security, and compliance while collaboratin...
Location Icon
Location
India , Chennai
Salary Icon
Salary
Not provided
whiteblue.com Logo
WhiteBlue
Expiration Date
Until further notice
Data QA Engineer
Save Icon
Join our team as a Data QA Engineer in Chennai. You will validate critical NCUA regulatory data extracts using Snowflake SQL and ETL testing. This role requires strong data quality analysis skills and experience with compliance filings. Ensure data accuracy and integrity in a fast-paced financial...
Location Icon
Location
India , Chennai
Salary Icon
Salary
Not provided
whiteblue.com Logo
WhiteBlue
Expiration Date
Until further notice
Data Engineer 2
Save Icon
Join Uber's Delivery Data Solutions team in Bangalore as a Data Engineer. You will build batch and real-time data products, develop metrics, and optimize data infrastructure. The role requires expertise in Python/Java, Spark, Kafka, and data warehousing. Be part of the center of excellence for da...
Location Icon
Location
India , Bangalore
Salary Icon
Salary
Not provided
uber.com Logo
Uber
Expiration Date
Until further notice
Data Engineer – Java Focused
Save Icon
Join our international team in Veldhoven as a Java-focused Data Engineer. Build scalable ETL pipelines and microservices using Java 11 and Spring Boot. Apply your expertise in SQL, data modeling, and AWS to transform data into insights. Enjoy a trust-based culture, extensive training, and a vibra...
Location Icon
Location
Netherlands , Veldhoven
Salary Icon
Salary
Not provided
amaris.com Logo
Amaris Consulting
Expiration Date
Until further notice
Engineering Manager – Data Center Commissioning
Save Icon
Lead capital projects as an Engineering Manager for Data Center Commissioning in Dallas. You will guide project engineers, oversee budgets using SAP, and ensure technical objectives are met. This role requires an accredited engineering degree and strong leadership skills. We offer comprehensive b...
Location Icon
Location
United States , Dallas
Salary Icon
Salary
Not provided
veolianorthamerica.com Logo
Veolia
Expiration Date
Until further notice
Senior Software Engineer | Azure Data Analytics
Save Icon
Join Microsoft's Azure Data Engineering team in Vancouver as a Senior Software Engineer. You will build the data platform for the AI era, focusing on products like Microsoft Fabric and Azure Synapse. This role requires expertise in distributed systems and languages like C# or Python. Help transfo...
Location Icon
Location
Canada , Vancouver
Salary Icon
Salary
114400.00 - 203900.00 CAD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Principal Engineer – CMMS Data
Save Icon
Lead the global CMMS data strategy and execution for a leading biotech firm in Hyderabad. This principal engineering role focuses on asset lifecycle management, system reliability, and compliance within regulated facilities. You will drive improvements using data analytics and collaborate with cr...
Location Icon
Location
India , Hyderabad
Salary Icon
Salary
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our Client & Investor Lifecycle team in Hyderabad as a Data Engineer. You will design scalable data pipelines using Databricks, Apache Spark, and Fenergo APIs for critical KYC/AML operations. Transform complex data into actionable insights by developing executive Power BI dashboards. This ro...
Location Icon
Location
India , Hyderabad
Salary Icon
Salary
Not provided
alterdomus.com Logo
Alter Domus
Expiration Date
Until further notice
Data Center Engineer
Save Icon
Join NTT DATA as a Data Center Engineer in a remote role based in Virginia. Utilize your 5+ years of experience and technical skills in hardware installation, cabling, and incident resolution. This position offers a dynamic environment for mentorship, collaboration, and providing exceptional cust...
Location Icon
Location
United States of America , Remote, Virginia
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Data Engineering Manager
Save Icon
Lead a team of 6-12 data engineers building a scalable Affiliate Marketing data platform. You will provide technical leadership, foster a collaborative culture, and deliver AI-ready analytics solutions. This role requires 8+ years in data engineering with strong people management and Kimball mode...
Location Icon
Location
Germany; Romania; United Kingdom; Spain; Italy; Poland , Berlin; Hannover; Iași; London; Madrid; Milano; München; Warsaw
Salary Icon
Salary
Not provided
awin.com Logo
Awin Global
Expiration Date
Until further notice
Senior Data Security Engineer
Save Icon
Lead the implementation of Awin's data security framework and DLP controls across a complex AWS, Azure, and SaaS hybrid environment. This senior, hands-on role requires deep expertise in data classification, cloud security, and policy architecture. Enjoy a flexible four-day Flexi-Week, remote wor...
Location Icon
Location
Germany; Romania; United Kingdom; Spain; Italy; Sweden; Poland; France , Berlin; Iași; London; Madrid; Milano; München; Paris; Stockholm; Warsaw
Salary Icon
Salary
Not provided
awin.com Logo
Awin Global
Expiration Date
Until further notice
Data Engineering Manager, Analytics
Save Icon
Lead a high-performing data engineering team developing AI/ML solutions for global operations at Meta. You will architect scalable data systems, deploy advanced models, and drive measurable efficiency gains. This Menlo Park role requires expertise in ETL, distributed systems, and mentoring teams,...
Location Icon
Location
United States , Menlo Park
Salary Icon
Salary
252981.00 - 284900.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Data Integration Engineer
Save Icon
Join a strategic data platform evolution for a major European energy TSO. As a Data Integration Engineer, design robust .NET solutions using REST, GraphQL, and event streaming. Work in Belgium within an Agile team, leveraging CI/CD and distributed systems expertise. French or Dutch language skill...
Location Icon
Location
Belgium
Salary Icon
Salary
Not provided
apollo-solutions.com Logo
Apollo Solutions
Expiration Date
Until further notice

About the Data Engineer - Pyspark role

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain.

In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices.

To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.