CrawlJobs Logo
Briefcase Icon
Category Icon

Data Engineer - Pyspark Jobs

1552 Job Offers

Filters
Data Engineer
Save Icon
Seeking a recent graduate Data Engineer for roles in Starkville, Dallas, San Francisco, or Syracuse. You will design data architecture, optimize ETL pipelines, and work with SQL, NoSQL, Scala, Java, and Python. This entry-level role requires strong data modeling skills and familiarity with Agile ...
Location Icon
Location
United States , Starkville; Dallas; San Francisco; Syracuse
Salary Icon
Salary
Not provided
phasorsoft.com Logo
PhasorSoft Group
Expiration Date
Until further notice
Cloud Big-data Engineer
Save Icon
Seeking a Cloud Big-data Engineer with 4-5 years of expertise in Hadoop, Spark, and Python. The role requires deep experience with AWS/Azure ecosystems, ETL processes, and SQL/NoSQL databases. This position is open in Starkville, Dover, or Minneapolis for H1, GC, or US Citizens. Join us to design...
Location Icon
Location
United States , Starkville; Dover; Minneapolis
Salary Icon
Salary
45.00 USD / Hour
phasorsoft.com Logo
PhasorSoft Group
Expiration Date
Until further notice
Data Engineer
Save Icon
Join PixelPlex as a Data Engineer for an innovative NFT-focused BI platform. You will build robust ETL processes, ensure data quality with Great Expectations, and manage data marts. Key requirements include expert SQL, Airflow, and experience with high-load distributed systems. This role offers a...
Location Icon
Location
Salary Icon
Salary
Not provided
pixelplex.io Logo
PixelPlex
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Plaid in San Francisco as a Senior Data Engineer. You will build robust golden datasets and scalable pipelines using SQL, Python, DBT, and Airflow on petabyte-scale data. Drive key projects, ensure data quality, and work with technologies like Redshift and Spark. Enjoy comprehensive benefits...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in San Francisco. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience with PyTorch and distributed systems. Enjoy full benefits, equity, and a role...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Senior Software Engineer - Data Infrastructure
Save Icon
Join Plaid's Data Infrastructure team in San Francisco as a Senior Software Engineer. You will build and scale core data and ML platforms using Spark, Data Warehouses, and orchestration tools. Lead key projects, mentor others, and enable product innovation. We offer comprehensive benefits, equity...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in New York. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience, proficiency in Python/PyTorch, and expertise in distributed systems and MLOps. En...
Location Icon
Location
United States , New York
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid in San Francisco as a Staff Machine Learning Engineer focused on Fraud Data. You will design scalable ML infrastructure for fraud detection using the world's largest financial dataset. We require 8+ years of experience, including 5+ years building production ML systems with Python, PyT...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid's Fraud Data team in New York as a Staff Machine Learning Engineer. Design and build scalable ML infrastructure for cutting-edge fraud detection, leveraging the world's largest financial dataset. Lead the evolution of model deployment and monitoring with Python, PyTorch, and Spark. Req...
Location Icon
Location
United States , New York
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Plaid's Data Engineering team in San Francisco to build robust golden datasets that power data-driven products. You'll leverage SQL, Python, and tools like DBT and Airflow to design pipelines on petabyte-scale data. Enjoy full benefits while solving complex data challenges to create insights...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
163200.00 - 223200.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid's Fraud Data team in Seattle as a Staff Machine Learning Engineer. You will design and build scalable ML infrastructure for cutting-edge fraud detection, leveraging the world's largest financial dataset. This role requires 8+ years of experience, including 5+ years deploying production...
Location Icon
Location
United States , Seattle
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid's Fraud Data team as a Staff Machine Learning Engineer in Washington DC. You will design and build scalable ML infrastructure for cutting-edge fraud detection, leveraging the world's largest financial dataset. This role requires 8+ years of experience, including 5+ years deploying prod...
Location Icon
Location
United States , Washington DC
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Principal Data Engineer
Save Icon
Lead the design of scalable, real-time data pipelines using Kafka, Flink, and Spark Streaming in this hands-on Principal Data Engineer role. Drive technical excellence, mentor a team, and shape the data ecosystem with modern cloud technologies. Enjoy comprehensive benefits from Day 1, including f...
Location Icon
Location
United States
Salary Icon
Salary
183200.00 - 203500.00 USD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Senior Software Engineer, Core Data
Save Icon
Join Pomelo as a Senior Software Engineer, Core Data in the United States. Design and scale robust data infrastructure using SQL, Python, dbt, and Dagster. Mentor engineers and build pipelines that deliver actionable insights for data-driven products. Enjoy competitive equity, unlimited vacation,...
Location Icon
Location
United States
Salary Icon
Salary
190000.00 - 220000.00 USD / Year
pomelocare.com Logo
Pomelo Care
Expiration Date
Until further notice
Data Engineering & Analytics Lead
Save Icon
Lead our data evolution as a hands-on Data Engineering & Analytics Lead in Brooklyn. You will architect a modern data ecosystem, build pipelines, and drive analytics to enhance patient care and operations. This role blends technical leadership with implementation, requiring strong data engineerin...
Location Icon
Location
United States , Brooklyn
Salary Icon
Salary
Not provided
premiumhealth.org Logo
Premium Health
Expiration Date
Until further notice
Data Scientist (Machine Learning Engineer- CGM Algorithm Dev.)
Save Icon
Join a pioneering team in Basel developing life-changing CGM algorithms. Utilize Python and advanced ML (TensorFlow/PyTorch) to transform sensor data into clinical insights. A Master's/PhD in a quantitative field and experience with time-series data are essential. Collaborate in an Agile, multidi...
Location Icon
Location
Switzerland , Basel
Salary Icon
Salary
Not provided
proclinical.com Logo
Proclinical
Expiration Date
Until further notice
Data Engineer
Save Icon
Join our Rally Engineering team in Banbury as a Data Engineer. You will be responsible for the vehicle's data architecture, analysis, and acquisition hardware at World Rally Raid Championship events. The role requires proficiency in tools like Motec i2 and a degree in a relevant engineering disci...
Location Icon
Location
United Kingdom , Banbury
Salary Icon
Salary
Not provided
prodrive.com Logo
Prodrive
Expiration Date
Until further notice
Big Data Engineer
Save Icon
Join our team in St. Louis as a Big Data Engineer. You will design and implement optimal data solutions using Hadoop, Spark, and Kafka. Your role involves building ETL processes, managing data infrastructure, and guiding technology design. A relevant degree and 2-4 years of hands-on experience wi...
Location Icon
Location
United States , St. Louis
Salary Icon
Salary
Not provided
protocolinfotech.com Logo
Protocol Infotech
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Provectus as a Senior Data Engineer to drive AI transformations across diverse industries. Design and build scalable data pipelines using AWS, Python, SQL, and tools like Airflow and Spark. Enjoy long-term B2B collaboration, medical/sports compensation, and educational opportunities in a dyn...
Location Icon
Location
Salary Icon
Salary
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Provectus as a Senior Data Engineer and build innovative data platforms using AWS, Python, and modern tools like Airflow and Kafka. Design scalable ETL pipelines and data APIs while collaborating with ML engineers on AI-driven solutions. Enjoy a 100% remote role with flexible hours, professi...
Location Icon
Location
Salary Icon
Salary
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary