CrawlJobs Logo
Briefcase Icon
Category Icon

Filters

×
Cities

Data Engineer - Pyspark United States, New York Jobs

59 Job Offers

Filters
Data Engineer
Save Icon
Join Figma's mission to make design accessible as a Data Engineer in San Francisco or New York. You will own and scale data pipelines using SQL, Python, and tools like Snowflake and dbt. Partner with Data Science and business teams to build foundational datasets that drive growth, supported by ex...
Location Icon
Location
United States , San Francisco; New York
Salary Icon
Salary
164000.00 - 338000.00 USD / Year
figma.com Logo
Figma
Expiration Date
Until further notice
Software Engineer, Data Infrastructure
Save Icon
Join Figma's Data Infrastructure team in San Francisco or New York. Design and build scalable distributed data systems using technologies like Spark, Kafka, and Snowflake. You'll empower AI, analytics, and business intelligence across the company. Enjoy top-tier benefits, equity, and a culture of...
Location Icon
Location
United States , San Francisco; New York
Salary Icon
Salary
149000.00 - 350000.00 USD / Year
figma.com Logo
Figma
Expiration Date
Until further notice
Backend Engineer, Growth and Data
Save Icon
Join Hebbia's Growth and Data team as a Backend Engineer in New York City or San Francisco. You will architect high-scale backend systems, APIs, and infrastructure using Python/Java/Go and AWS. Build solutions for universal indexing and performance optimization while enjoying top benefits like un...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Data Engineer
Save Icon
Join as our first Data Engineer in NYC or SF. Architect end-to-end data solutions, build scalable ETL pipelines, and manage our central data lake. We seek 5+ years of experience with Python, SQL, and cloud data stacks. Enjoy unlimited PTO, comprehensive benefits, and a competitive equity package.
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
190000.00 - 250000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Frontend Engineer, Growth and Data
Save Icon
Join Hebbia's Growth and Data team as a Frontend Engineer in New York or San Francisco. You will build innovative React/TypeScript interfaces to unlock unique customer value and drive platform growth. Collaborate cross-functionally to own product experiences from ideation to launch. Enjoy top ben...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Data Infrastructure Engineer
Save Icon
Join a dynamic AI startup in New York or DC as a Data Infrastructure Engineer. Design secure APIs, deploy ML models at scale, and build critical data platforms using Python/Go. Enjoy a fast-paced environment with competitive salary and equity.
Location Icon
Location
United States , New York or DC
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Data Infrastructure Engineer
Save Icon
Join a pioneering AI startup as a Data Infrastructure Engineer in New York or DC. You will design secure data middleware and deploy ML services at scale, using Python/Go. This hybrid role offers equity and a chance to shape the core data platform in a fast-paced, innovative environment.
Location Icon
Location
United States , New York or DC
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Data Infrastructure Engineer
Save Icon
Join a dynamic NYC/DC startup as a Data Infrastructure Engineer. Build cutting-edge AI platforms, deploy LLMs at scale, and manage the full data stack using Python/Go. Thrive in a fast-paced environment with equity ownership.
Location Icon
Location
United States , New York or DC
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Data Infrastructure Engineer
Save Icon
Join a dynamic NYC/DC startup as a Data Infrastructure Engineer. Design secure APIs, deploy ML models at scale, and manage production data using Python/Go/C. Gain equity while building cutting-edge AI platforms in a fast-paced, collaborative environment.
Location Icon
Location
United States , New York or DC
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Lead Data Integration Specialist / Senior Full Stack Engineer
Save Icon
Lead the design of a scalable data integration platform for a dynamic identity risk solutions company. Utilize your 10+ years of full-stack expertise in Python/Django and TypeScript/React to build and mentor. Enjoy a flexible New York-based role with unlimited PTO and significant growth potential.
Location Icon
Location
United States , New York
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Data Infrastructure Engineer
Save Icon
Join a venture-backed AI & national security startup as a Data Infrastructure Engineer. Design and deploy secure, scalable infrastructure for ML applications in New York City. Work on mission-critical systems requiring expertise in Python/Go, data pipelines, and ML Ops. Receive significant equity...
Location Icon
Location
United States , New York City Metropolitan Area
Salary Icon
Salary
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in New York. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience, proficiency in Python/PyTorch, and expertise in distributed systems and MLOps. En...
Location Icon
Location
United States , New York
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Staff Machine Learning Engineer - Fraud Data
Save Icon
Join Plaid's Fraud Data team in New York as a Staff Machine Learning Engineer. Design and build scalable ML infrastructure for cutting-edge fraud detection, leveraging the world's largest financial dataset. Lead the evolution of model deployment and monitoring with Python, PyTorch, and Spark. Req...
Location Icon
Location
United States , New York
Salary Icon
Salary
192000.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Rearc as a Senior Data Engineer in New York. Leverage your 8+ years of expertise in cloud data architecture, ETL, and Python to build scalable data pipelines and warehouses. Drive technical excellence using Spark, Databricks, and modern frameworks while enjoying comprehensive health benefits...
Location Icon
Location
United States , New York
Salary Icon
Salary
160000.00 - 200000.00 USD / Year
rearc.io Logo
Rearc
Expiration Date
Until further notice
Forward Deployed Engineer - Data-as-a-Service
Save Icon
Join Snorkel AI as a Forward Deployed Engineer in NYC, Redwood City, or San Francisco. This customer-facing role involves end-to-end ownership of AI/ML data pipelines, leveraging Python, SQL, and LLM workflows. Partner with top enterprises to deliver high-impact Data-as-a-Service solutions and re...
Location Icon
Location
United States , New York City; Redwood City; San Francisco
Salary Icon
Salary
172000.00 - 300000.00 USD / Year
snorkel.ai Logo
Snorkel AI
Expiration Date
Until further notice
Senior Data Engineer - Platform Enablement
Save Icon
Join SoundCloud's Platform Enablement team as a Senior Data Engineer. You will design scalable data pipelines using SQL, Apache Airflow, and GCP/AWS. This remote East Coast role offers a fast-paced environment, impactful projects, and comprehensive benefits including generous PTO and parental leave.
Location Icon
Location
United States , New York; Atlanta; East Coast
Salary Icon
Salary
160000.00 - 210000.00 USD / Year
soundcloud.com Logo
SoundCloud
Expiration Date
Until further notice
Staff Data Engineer
Save Icon
Lead our data infrastructure strategy as a Staff Data Engineer in New York or San Francisco. Architect scalable pipelines with dbt, Airflow, and cloud warehouses, setting technical direction for the entire organization. Enjoy top benefits like full health insurance, 401k match, and generous stipe...
Location Icon
Location
United States , New York; San Francisco
Salary Icon
Salary
170000.00 - 210000.00 USD / Year
taskrabbit.com Logo
Taskrabbit
Expiration Date
Until further notice
Data and BI Engineer
Save Icon
Join our data-driven team as a Data & BI Engineer in New York. Design and optimize scalable ETL pipelines, develop Power BI dashboards, and leverage Snowflake and Python. We seek 3+ years' experience in data warehousing, BI tools, and cloud platforms. Enjoy weekly pay, 401K matching, and comprehe...
Location Icon
Location
United States , New York
Salary Icon
Salary
110000.00 - 120000.00 USD / Year
afvusa.com Logo
American Food & Vending
Expiration Date
Until further notice
Full Stack Developer - Data Engineering & GenAI Applications
Save Icon
Join our team in New York as a Full Stack Developer, specializing in Data Engineering and GenAI applications. You will build scalable solutions using Python, React, and Azure, integrating AI APIs to automate enterprise workflows. This role offers a chance to lead technical roadmaps and work on cu...
Location Icon
Location
United States , New York
Salary Icon
Salary
Not provided
phaidoninternational.com Logo
Phaidon International
Expiration Date
Until further notice
Staff Data Platform Engineer
Save Icon
Lead the architecture of Vercel's next-gen Data Platform as a Principal Engineer. Design scalable, real-time systems using Kafka, ClickHouse, and modern lakehouse tech. Partner cross-functionally to define data strategy and build for AI/ML. Enjoy competitive compensation, equity, and flexible wor...
Location Icon
Location
United States , San Francisco; New York City
Salary Icon
Salary
196000.00 - 294000.00 USD / Year
vercel.com Logo
Vercel
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Category
Location
Work Mode
Salary