Filters

Cities

Work Mode

Data Engineer - Pyspark United States, New York City Jobs (On-site work)

7 Job Offers

Filters

New

Founding Software Engineer - Data Platform

Join Arlo as a Founding Software Engineer for our Data Platform in NYC. Build scalable data infrastructure and ETL pipelines for a revolutionary ML-driven health insurance system. We seek a backend/data engineer with expertise in Python, Spark, and cloud platforms. Enjoy high ownership, equity, a...

Location

United States , New York City

Salary

200000.00 - 220000.00 USD / Year

Arlo

Expiration Date

Until further notice

New

Data Engineer

Join Profound in NYC as a Data Engineer to own and scale our AI data platform. You will build and optimize pipelines using Python, SQL, dbt, Dagster, and AWS. Support ML workflows and ensure data quality for our product and data science teams. Competitive compensation, equity, and benefits offered.

Location

United States , New York City

Salary

140000.00 - 260000.00 USD / Year

Profound

Expiration Date

Until further notice

New

Software Engineer, Data Platform

Join Profound in NYC as a Software Engineer for our Data Platform. Design and scale core infrastructure using Snowflake, ClickHouse, and AWS to empower data science and ML teams. You will build tools for data quality, governance, and MLOps in a fast-paced, impact-driven environment. This role inc...

Location

United States , New York City

Salary

140000.00 - 260000.00 USD / Year

Profound

Expiration Date

Until further notice

Backend Engineer, Growth and Data

Join Hebbia's Growth and Data team as a Backend Engineer in New York City or San Francisco. You will architect high-scale backend systems, APIs, and infrastructure using Python/Java/Go and AWS. Build solutions for universal indexing and performance optimization while enjoying top benefits like un...

Location

United States , New York City; San Francisco

Salary

160000.00 - 300000.00 USD / Year

Hebbia

Expiration Date

Until further notice

Data Engineer

Join as our first Data Engineer in NYC or SF. Architect end-to-end data solutions, build scalable ETL pipelines, and manage our central data lake. We seek 5+ years of experience with Python, SQL, and cloud data stacks. Enjoy unlimited PTO, comprehensive benefits, and a competitive equity package.

Location

United States , New York City; San Francisco

Salary

190000.00 - 250000.00 USD / Year

Hebbia

Expiration Date

Until further notice

Frontend Engineer, Growth and Data

Join Hebbia's Growth and Data team as a Frontend Engineer in New York or San Francisco. You will build innovative React/TypeScript interfaces to unlock unique customer value and drive platform growth. Collaborate cross-functionally to own product experiences from ideation to launch. Enjoy top ben...

Location

United States , New York City; San Francisco

Salary

160000.00 - 300000.00 USD / Year

Hebbia

Expiration Date

Until further notice

Data Engineer Co-op Intern

Join Amazon as a Data Engineer Co-op Intern in a full-time, in-office role. Design automated data pipelines, optimize data warehouses, and utilize SQL and Python. This 12-week internship is for students in a US co-op program, with multiple location options across the United States.

Location

United States , Seattle; Bellevue; Redmond; San Francisco; Sunnyvale; Santa Clara; DC; MD; VA; Austin; New York City; Minneapolis

Salary

101300.00 - 160000.00 USD / Year

Amazon Pforzheim GmbH

Expiration Date

Until further notice

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.

Filters

Location

Salary

All (132)

Specified salary (116)