Filters

Cities

Work Mode

Data Engineer - Pyspark United States, San Francisco Jobs (Hybrid work)

45 Job Offers

Filters

Staff Data Engineer

Lead our data infrastructure strategy as a Staff Data Engineer in New York or San Francisco. Architect scalable pipelines with dbt, Airflow, and cloud warehouses, setting technical direction for the entire organization. Enjoy top benefits like full health insurance, 401k match, and generous stipe...

Location

United States , New York; San Francisco

Salary

170000.00 - 210000.00 USD / Year

Taskrabbit

Expiration Date

Until further notice

Data Engineer II

Join Dedrone in San Francisco as a Data Engineer II. Transform raw sensor data into scalable insights using advanced SQL, Python, and AWS. Build ETL pipelines, analytical models, and Tableau dashboards to empower data-driven decisions. Enjoy competitive benefits including 401k match, comprehensiv...

Location

United States , San Francisco

Salary

101250.00 - 162000.00 USD / Year

Axon

Expiration Date

Until further notice

Staff Data Platform Engineer

Lead the architecture of Vercel's next-gen Data Platform as a Principal Engineer. Design scalable, real-time systems using Kafka, ClickHouse, and modern lakehouse tech. Partner cross-functionally to define data strategy and build for AI/ML. Enjoy competitive compensation, equity, and flexible wor...

Location

United States , San Francisco; New York City

Salary

196000.00 - 294000.00 USD / Year

Vercel

Expiration Date

Until further notice

Senior Data Engineer

Join GoFundMe as a Senior Data Engineer in San Francisco. You will own the normalized data layer, build reliable pipelines, and optimize Snowflake performance. Leverage AI to enhance data modeling and work with a collaborative Analytics Engineering squad. Enjoy competitive pay, comprehensive heal...

Location

United States , San Francisco

Salary

156000.00 - 234000.00 USD / Year

GoFundMe

Expiration Date

Until further notice

Staff Data Platform Engineer

Location

United States , San Francisco; New York City

Salary

196000.00 - 294000.00 USD / Year

Vercel

Expiration Date

Until further notice

1 2 3

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.

Filters

Countries

United States (710)
Canada (1)

Location

Salary

All (710)

Specified salary (644)