CrawlJobs Logo
Briefcase Icon
Category Icon

Filters

×
Cities

Data Engineer - Pyspark United States, San Francisco Jobs

63 Job Offers

Filters
Data Engineer
Save Icon
Join as our first Data Engineer in NYC or SF. Architect end-to-end data solutions, build scalable ETL pipelines, and manage our central data lake. We seek 5+ years of experience with Python, SQL, and cloud data stacks. Enjoy unlimited PTO, comprehensive benefits, and a competitive equity package.
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
190000.00 - 250000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Frontend Engineer, Growth and Data
Save Icon
Join Hebbia's Growth and Data team as a Frontend Engineer in New York or San Francisco. You will build innovative React/TypeScript interfaces to unlock unique customer value and drive platform growth. Collaborate cross-functionally to own product experiences from ideation to launch. Enjoy top ben...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Kiddom's mission to power personalized education with data. As a Senior Data Engineer in San Francisco, you'll design and deploy robust data infrastructure on AWS, using Python, SQL, and Golang. You'll ensure data security and PII compliance while collaborating across teams. Enjoy meaningful...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
150000.00 - 220000.00 USD / Year
kiddom.co Logo
Kiddom
Expiration Date
Until further notice
Data Engineer
Save Icon
Seeking a recent graduate Data Engineer for roles in Starkville, Dallas, San Francisco, or Syracuse. You will design data architecture, optimize ETL pipelines, and work with SQL, NoSQL, Scala, Java, and Python. This entry-level role requires strong data modeling skills and familiarity with Agile ...
Location Icon
Location
United States , Starkville; Dallas; San Francisco; Syracuse
Salary Icon
Salary
Not provided
phasorsoft.com Logo
PhasorSoft Group
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Plaid in San Francisco as a Senior Data Engineer. You will build robust golden datasets and scalable pipelines using SQL, Python, DBT, and Airflow on petabyte-scale data. Drive key projects, ensure data quality, and work with technologies like Redshift and Spark. Enjoy comprehensive benefits...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in San Francisco. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience with PyTorch and distributed systems. Enjoy full benefits, equity, and a role...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Senior Software Engineer - Data Infrastructure
Save Icon
Join Plaid's Data Infrastructure team in San Francisco as a Senior Software Engineer. You will build and scale core data and ML platforms using Spark, Data Warehouses, and orchestration tools. Lead key projects, mentor others, and enable product innovation. We offer comprehensive benefits, equity...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid in San Francisco as a Staff Machine Learning Engineer focused on Fraud Data. You will design scalable ML infrastructure for fraud detection using the world's largest financial dataset. We require 8+ years of experience, including 5+ years building production ML systems with Python, PyT...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Plaid's Data Engineering team in San Francisco to build robust golden datasets that power data-driven products. You'll leverage SQL, Python, and tools like DBT and Airflow to design pipelines on petabyte-scale data. Enjoy full benefits while solving complex data challenges to create insights...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
163200.00 - 223200.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Forward Deployed Engineer - Data-as-a-Service
Save Icon
Join Snorkel AI as a Forward Deployed Engineer in NYC, Redwood City, or San Francisco. This customer-facing role involves end-to-end ownership of AI/ML data pipelines, leveraging Python, SQL, and LLM workflows. Partner with top enterprises to deliver high-impact Data-as-a-Service solutions and re...
Location Icon
Location
United States , New York City; Redwood City; San Francisco
Salary Icon
Salary
172000.00 - 300000.00 USD / Year
snorkel.ai Logo
Snorkel AI
Expiration Date
Until further notice
Staff Data Engineer
Save Icon
Lead our data infrastructure strategy as a Staff Data Engineer in New York or San Francisco. Architect scalable pipelines with dbt, Airflow, and cloud warehouses, setting technical direction for the entire organization. Enjoy top benefits like full health insurance, 401k match, and generous stipe...
Location Icon
Location
United States , New York; San Francisco
Salary Icon
Salary
170000.00 - 210000.00 USD / Year
taskrabbit.com Logo
Taskrabbit
Expiration Date
Until further notice
Data Engineer II
Save Icon
Join Dedrone in San Francisco as a Data Engineer II. Transform raw sensor data into scalable insights using advanced SQL, Python, and AWS. Build ETL pipelines, analytical models, and Tableau dashboards to empower data-driven decisions. Enjoy competitive benefits including 401k match, comprehensiv...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
101250.00 - 162000.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Staff Data Platform Engineer
Save Icon
Lead the architecture of Vercel's next-gen Data Platform as a Principal Engineer. Design scalable, real-time systems using Kafka, ClickHouse, and modern lakehouse tech. Partner cross-functionally to define data strategy and build for AI/ML. Enjoy competitive compensation, equity, and flexible wor...
Location Icon
Location
United States , San Francisco; New York City
Salary Icon
Salary
196000.00 - 294000.00 USD / Year
vercel.com Logo
Vercel
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join GoFundMe as a Senior Data Engineer in San Francisco. You will own the normalized data layer, build reliable pipelines, and optimize Snowflake performance. Leverage AI to enhance data modeling and work with a collaborative Analytics Engineering squad. Enjoy competitive pay, comprehensive heal...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
156000.00 - 234000.00 USD / Year
gofundme.com Logo
GoFundMe
Expiration Date
Until further notice
AI Data Engineer
Save Icon
Shape the future of influencer marketing as an AI Data Engineer in San Francisco. Build scalable data pipelines and autonomous AI agents from raw video to actionable insights. This role offers competitive equity and the chance to own architecture decisions end-to-end in a fast-paced, venture-back...
Location Icon
Location
United States , San Francisco Bay Area
Salary Icon
Salary
200000.00 USD / Year
influur.com Logo
Influur
Expiration Date
Until further notice
Senior Software Engineer, Data Engineering
Save Icon
Join our team as a Senior Data Engineer in San Francisco. You will build and maintain robust data pipelines and software, collaborating with cross-functional teams. We require 5+ years of experience in data/software engineering, proficiency in Python/SQL, and modern data tools. Enjoy top benefits...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
229500.00 - 280500.00 USD / Year
mixpanel.com Logo
Mixpanel
Expiration Date
Until further notice
Robotics Data Infrastructure Engineer
Save Icon
Join Verne as a founding Robotics Data Infrastructure Engineer in San Francisco. Architect and deploy critical AWS and edge data pipelines for real-world robots. You'll manage large-scale multimodal datasets and build MLOps tooling, directly impacting production systems. Requires strong Python, A...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
110000.00 - 175000.00 USD / Year
workatastartup.com Logo
YC Work at a Startup
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Crusoe Energy as a Senior Data Engineer in San Francisco. Architect the foundational data platform powering AI and cloud operations. We seek expertise in Python, distributed systems, SQL, and data infrastructure. Enjoy competitive pay, equity, comprehensive health benefits, and a 401(k) match.
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
Not provided
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Staff Data Platform Engineer
Save Icon
Lead the architecture of Vercel's next-gen Data Platform as a Principal Engineer. Design scalable, real-time systems using Kafka, ClickHouse, and modern lakehouse tech. Partner cross-functionally to define data strategy and build for AI/ML. Enjoy competitive compensation, equity, and flexible wor...
Location Icon
Location
United States , San Francisco; New York City
Salary Icon
Salary
196000.00 - 294000.00 USD / Year
vercel.com Logo
Vercel
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Suno's founding team as a Data Engineer in a key US tech hub. Design and scale core data foundations using SQL, Python, and modern tools like Airflow and Snowflake. Enjoy equity, unlimited PTO, and a culture passionate about music and engineering excellence.
Location Icon
Location
United States , Boston, NYC, Los Angeles, San Francisco
Salary Icon
Salary
170000.00 - 240000.00 USD / Year
suno.ai Logo
Suno
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Category
Location
Work Mode
Salary