CrawlJobs Logo
Briefcase Icon
Category Icon

Data Engineer - Pyspark Jobs (On-site work)

452 Job Offers

Filters
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in San Francisco. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience with PyTorch and distributed systems. Enjoy full benefits, equity, and a role...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Senior Software Engineer - Data Infrastructure
Save Icon
Join Plaid's Data Infrastructure team in San Francisco as a Senior Software Engineer. You will build and scale core data and ML platforms using Spark, Data Warehouses, and orchestration tools. Lead key projects, mentor others, and enable product innovation. We offer comprehensive benefits, equity...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in New York. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience, proficiency in Python/PyTorch, and expertise in distributed systems and MLOps. En...
Location Icon
Location
United States , New York
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Plaid's Data Engineering team in San Francisco to build robust golden datasets that power data-driven products. You'll leverage SQL, Python, and tools like DBT and Airflow to design pipelines on petabyte-scale data. Enjoy full benefits while solving complex data challenges to create insights...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
163200.00 - 223200.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our Rally Engineering team in Banbury as a Data Engineer. You will be responsible for the vehicle's data architecture, analysis, and acquisition hardware at World Rally Raid Championship events. The role requires proficiency in tools like Motec i2 and a degree in a relevant engineering disci...
Location Icon
Location
United Kingdom , Banbury
Salary Icon
Salary
Not provided
prodrive.com Logo
Prodrive
Expiration Date
Until further notice
Big Data Engineer
Save Icon
Join our team in St. Louis as a Big Data Engineer. You will design and implement optimal data solutions using Hadoop, Spark, and Kafka. Your role involves building ETL processes, managing data infrastructure, and guiding technology design. A relevant degree and 2-4 years of hands-on experience wi...
Location Icon
Location
United States , St. Louis
Salary Icon
Salary
Not provided
protocolinfotech.com Logo
Protocol Infotech
Expiration Date
Until further notice
Sr Data Engineer
Save Icon
Join our team in Irving, Texas, as a Senior Data Engineer. You will architect large-scale data solutions, optimize pipelines, and leverage Azure Synapse, Data Factory, and Hadoop. This role requires 6+ years of expertise in ETL, data modeling, and cloud technologies to drive data reliability and ...
Location Icon
Location
United States , Irving
Salary Icon
Salary
Not provided
rigusinc.com Logo
Resource Informatics Group
Expiration Date
Until further notice
Data Engineer – Snowflake & ETL
Save Icon
Join our team in Hyderabad as a Data Engineer, specializing in Snowflake and ETL. You will leverage your 5+ years of experience in SQL, Matillion ETL, and cloud platforms (AWS/Azure/GCP) to build robust data solutions. Expertise in Python, API integrations, and data governance is key. SnowPro cer...
Location Icon
Location
India , Hyderabad
Salary Icon
Salary
Not provided
rightanglesol.com Logo
Right Angle Solutions
Expiration Date
Until further notice
Senior Software Engineer, Data Engineering
Save Icon
Join our elite team to democratize finance as a Senior Data Engineer in Menlo Park. Build foundational datasets using Python, Spark, and Airflow to power analytics and machine learning. We seek an expert with 5+ years crafting scalable data pipelines and strong SQL skills. Enjoy top-tier benefits...
Location Icon
Location
United States , Menlo Park
Salary Icon
Salary
146000.00 - 198000.00 USD / Year
robinhood.com Logo
Robinhood
Expiration Date
Until further notice
Software Engineer, Data Engineering
Save Icon
Join Robinhood in Toronto to shape the future of finance as a Data Engineer. You will build scalable data pipelines using Python, Spark, and Airflow to democratize data across the company. This role requires 3+ years of experience in end-to-end pipeline development and strong SQL skills. We offer...
Location Icon
Location
Canada , Toronto
Salary Icon
Salary
124000.00 - 145000.00 CAD / Year
robinhood.com Logo
Robinhood
Expiration Date
Until further notice
Data Engineer II
Save Icon
Join our Warsaw team as a Data Engineer II, specializing in data migration and conversion for telecommunications systems. You will utilize ESRI, FME, SQL, and Python to transform data into our proprietary 3-GIS model. This role requires strong analytical skills and offers the opportunity to work ...
Location Icon
Location
Poland , Warsaw
Salary Icon
Salary
Not provided
sspinnovations.com Logo
SSP Innovations
Expiration Date
Until further notice
Technology Services Engineer – Data Protection & Disaster Recovery
Save Icon
Seeking a Data Protection & Disaster Recovery Engineer in Alpharetta, GA. This full-time, on-site role requires deep Veeam expertise to design 3-2-1 strategies, ensure compliance, and manage backup/DR for MSP clients. Ideal candidates have 2+ years in an MSP with strong Windows Server and automat...
Location Icon
Location
United States , Alpharetta, Georgia
Salary Icon
Salary
Not provided
tier4group.com Logo
Tier4 Group
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our team in Barcelona as a Data Engineer. You will build and scale robust data pipelines using Python and SQL, ensuring our DWH is reliable and efficient. Partner with analysts to support analytics needs and govern data integration from diverse sources. We offer competitive pay, equity, heal...
Location Icon
Location
Spain , Barcelona
Salary Icon
Salary
Not provided
yokoy.io Logo
Yokoy
Expiration Date
Until further notice
AI Research Engineer, Data Infrastructure
Save Icon
Join our team in Palo Alto as an AI Research Engineer, Data Infrastructure. You will design and build the core data engine for our humanoid robot fleet, creating scalable pipelines for collection, querying, and training. Your work will involve ETL automation, dataset tooling, and ML models for au...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
180000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Adyen in Amsterdam as a Data Engineer II, shaping the backbone of our data-driven insights. You will design and build high-quality ELT pipelines using Python, PySpark, and Airflow on our Big Data Platform. This role requires 3+ years of experience and offers the chance to champion data best ...
Location Icon
Location
Netherlands , Amsterdam
Salary Icon
Salary
Not provided
adyen.com Logo
Adyen
Expiration Date
Until further notice
Data Platform Engineer
Save Icon
Join Adyen in Amsterdam to build and scale our massive on-premise Big Data Platforms. You will develop foundational tooling using Python, Spark, and Kubernetes to support thousands of daily jobs. Collaborate with data and ML teams to enhance data discoverability and platform performance.
Location Icon
Location
Netherlands , Amsterdam
Salary Icon
Salary
Not provided
adyen.com Logo
Adyen
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our team in Brussels as a Data Engineer. You will modernize the GSmart application suite, focusing on a Kimball-style Data Warehouse and ETL pipelines using Microsoft SQL Server, SSIS, and SSAS. This role involves agile development, building reporting solutions, and contributing to scalable ...
Location Icon
Location
Belgium , Brussels
Salary Icon
Salary
Not provided
airswift.com Logo
Airswift Sweden
Expiration Date
Until further notice
Gcp Data Engineer
Save Icon
Join AlgebraIT in Austin as a GCP Data Engineer. Design robust data systems using BigQuery, Dataflow, and Pub/Sub. Build scalable pipelines, ensure data governance, and collaborate with stakeholders. Requires 3+ years of GCP experience, Python, SQL, and a Computer Science degree.
Location Icon
Location
United States , Austin
Salary Icon
Salary
Not provided
algebrait.com Logo
AlgebraIT
Expiration Date
Until further notice
Power Methodology Engineer, Data Center Hardware IPs
Save Icon
Seeking a Senior Power Methodology Engineer in Santa Clara to optimize power efficiency for cutting-edge data center hardware IPs. You will develop power models and algorithms for AI/ML accelerators, GPUs, and CPUs using tools like PowerArtist. This role requires expertise in ASIC/SoC power analy...
Location Icon
Location
United States , Santa Clara
Salary Icon
Salary
191040.00 - 286560.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Is Data Center Operations Engineer
Save Icon
Join Amgen as a Data Center Operations Engineer in New Albany. Bridge IT and MEP systems to ensure operational continuity and reliability. You will install, cable, and support enterprise hardware while utilizing AI-enabled monitoring. This role offers a comprehensive benefits package and requires...
Location Icon
Location
United States , New Albany
Salary Icon
Salary
91731.00 - 114948.00 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary