CrawlJobs Logo
Briefcase Icon
Category Icon

Data Engineer - Pyspark Jobs

1580 Job Offers

Filters
Data Infrastructure Engineer
Save Icon
Join a venture-backed AI & national security startup as a Data Infrastructure Engineer. Design and deploy secure, scalable infrastructure for ML applications in New York City. Work on mission-critical systems requiring expertise in Python/Go, data pipelines, and ML Ops. Receive significant equity...
Location Icon
Location
United States , New York City Metropolitan Area
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Staff Data Engineer
Save Icon
Join a VC-backed retail AI scale-up as a Staff Data Engineer. Architect and scale the data backbone for cutting-edge AI systems using deep Spark expertise. Build high-performance, cloud-based data pipelines in a fully remote role with great equity. Strong Python and distributed computing skills a...
Location Icon
Location
United States
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Seeking a Senior Data Engineer with expertise in Actuarial/Asset Management Excel files for a UK-based financial services role. Utilize Python, SQL, GCP, and Databricks to build end-to-end data pipelines on complex datasets. Ideal candidates have investment domain knowledge and experience automat...
Location Icon
Location
United Kingdom
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Electrical Commissioning Engineer – Data Centre Systems
Save Icon
Join a leading team as an Electrical Commissioning Engineer for critical data centre systems. You will commission integrated changeover panels and generator controls in live environments, primarily around London with some EU travel. The role requires electrical engineering experience and knowledg...
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
60000.00 - 70000.00 GBP / Year
perigonsearch.co.uk Logo
Perigon Search
Expiration Date
Until further notice
Service & Commissioning Engineer – Data Centre Cooling
Save Icon
Join a global leader in Madrid's booming data centre sector. Commission and service advanced liquid cooling systems for hyperscale and AI infrastructure. Utilize your HVAC/mechanical expertise in a customer-facing role with European travel. Enjoy a competitive package, extensive benefits, and cle...
Location Icon
Location
Spain , Madrid
Salary Icon
Salary
55000.00 - 65000.00 EUR / Year
perigonsearch.co.uk Logo
Perigon Search
Expiration Date
Until further notice
Principal Data Engineer
Save Icon
Lead the technical strategy for our data platform as a Principal Data Engineer. This hands-on leadership role requires deep expertise in Databricks, PySpark, and cloud platforms (AWS/Azure). You will design scalable architectures, ensure data governance, and mentor engineers. This is a hybrid rol...
Location Icon
Location
United Kingdom , Thame; Leeds
Salary Icon
Salary
80000.00 - 100000.00 GBP / Year
pexa.co.uk Logo
PEXA UK
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Seeking a Senior Data Engineer in Flowood, USA, to design and manage cloud data pipelines on Snowflake/Azure. You will develop ETL processes, ensure data governance, and optimize for performance. Expertise in SQL, data modeling, and visualization tools like Tableau/Power BI is essential.
Location Icon
Location
United States , Flowood
Salary Icon
Salary
Not provided
phasorsoft.com Logo
PhasorSoft Group
Expiration Date
Until further notice
Senior Big Data Engineer
Save Icon
Join our team in Flowood as a Senior Big Data Engineer. Design and develop scalable data pipelines using Python, PySpark, and technologies like Hadoop and Spark. You will build ETL processes, implement ML models, and ensure data governance. Cloud platform experience is a valuable asset.
Location Icon
Location
United States , Flowood
Salary Icon
Salary
Not provided
phasorsoft.com Logo
PhasorSoft Group
Expiration Date
Until further notice
Data Engineer
Save Icon
Seeking a recent graduate Data Engineer for roles in Starkville, Dallas, San Francisco, or Syracuse. You will design data architecture, optimize ETL pipelines, and work with SQL, NoSQL, Scala, Java, and Python. This entry-level role requires strong data modeling skills and familiarity with Agile ...
Location Icon
Location
United States , Starkville; Dallas; San Francisco; Syracuse
Salary Icon
Salary
Not provided
phasorsoft.com Logo
PhasorSoft Group
Expiration Date
Until further notice
Cloud Big-data Engineer
Save Icon
Seeking a Cloud Big-data Engineer with 4-5 years of expertise in Hadoop, Spark, and Python. The role requires deep experience with AWS/Azure ecosystems, ETL processes, and SQL/NoSQL databases. This position is open in Starkville, Dover, or Minneapolis for H1, GC, or US Citizens. Join us to design...
Location Icon
Location
United States , Starkville; Dover; Minneapolis
Salary Icon
Salary
45.00 USD / Hour
phasorsoft.com Logo
PhasorSoft Group
Expiration Date
Until further notice
Data Engineer
Save Icon
Join PixelPlex as a Data Engineer for an innovative NFT-focused BI platform. You will build robust ETL processes, ensure data quality with Great Expectations, and manage data marts. Key requirements include expert SQL, Airflow, and experience with high-load distributed systems. This role offers a...
Location Icon
Location
Salary Icon
Salary
Not provided
pixelplex.io Logo
PixelPlex
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Plaid in San Francisco as a Senior Data Engineer. You will build robust golden datasets and scalable pipelines using SQL, Python, DBT, and Airflow on petabyte-scale data. Drive key projects, ensure data quality, and work with technologies like Redshift and Spark. Enjoy comprehensive benefits...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in San Francisco. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience with PyTorch and distributed systems. Enjoy full benefits, equity, and a role...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Senior Software Engineer - Data Infrastructure
Save Icon
Join Plaid's Data Infrastructure team in San Francisco as a Senior Software Engineer. You will build and scale core data and ML platforms using Spark, Data Warehouses, and orchestration tools. Lead key projects, mentor others, and enable product innovation. We offer comprehensive benefits, equity...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in New York. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience, proficiency in Python/PyTorch, and expertise in distributed systems and MLOps. En...
Location Icon
Location
United States , New York
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid in San Francisco as a Staff Machine Learning Engineer focused on Fraud Data. You will design scalable ML infrastructure for fraud detection using the world's largest financial dataset. We require 8+ years of experience, including 5+ years building production ML systems with Python, PyT...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid's Fraud Data team in New York as a Staff Machine Learning Engineer. Design and build scalable ML infrastructure for cutting-edge fraud detection, leveraging the world's largest financial dataset. Lead the evolution of model deployment and monitoring with Python, PyTorch, and Spark. Req...
Location Icon
Location
United States , New York
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Plaid's Data Engineering team in San Francisco to build robust golden datasets that power data-driven products. You'll leverage SQL, Python, and tools like DBT and Airflow to design pipelines on petabyte-scale data. Enjoy full benefits while solving complex data challenges to create insights...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
163200.00 - 223200.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid's Fraud Data team in Seattle as a Staff Machine Learning Engineer. You will design and build scalable ML infrastructure for cutting-edge fraud detection, leveraging the world's largest financial dataset. This role requires 8+ years of experience, including 5+ years deploying production...
Location Icon
Location
United States , Seattle
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid's Fraud Data team as a Staff Machine Learning Engineer in Washington DC. You will design and build scalable ML infrastructure for cutting-edge fraud detection, leveraging the world's largest financial dataset. This role requires 8+ years of experience, including 5+ years deploying prod...
Location Icon
Location
United States , Washington DC
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary