CrawlJobs Logo
Briefcase Icon
Category Icon

Filters

×
Cities

Data Engineer - Pyspark United States Jobs (On-site work)

178 Job Offers

Filters
Data Engineer SME
Save Icon
Seeking a Senior Data Engineer with an active TS/SCI clearance in Chantilly. You will design and implement complex ETL pipelines and data solutions within secure, classified government environments. This role requires expertise in Python, SQL, Elasticsearch, and data pipeline technologies like Ap...
Location Icon
Location
United States , Chantilly
Salary Icon
Salary
Not provided
anavationllc.com Logo
AnaVation
Expiration Date
Until further notice
Senior Manager, Data Engineering
Save Icon
Lead and build a world-class data engineering team in San Jose. You will design scalable cloud infrastructure (AWS/GCP/Azure) for data warehousing, ETL, and AI deployment. This senior role requires 6+ years of experience, including 3+ years leading teams and expertise in data governance.
Location Icon
Location
United States , San Jose
Salary Icon
Salary
240840.00 - 307600.00 USD / Year
archer.com Logo
Archer Aviation
Expiration Date
Until further notice
Data Engineer - Regulatory Reporting
Save Icon
Seeking an experienced Data Engineer to automate regulatory reporting in New York. You will design scalable Python/SQL solutions, processing large datasets in Snowflake to generate compliant reports. This tech lead role partners with Finance and Compliance teams, requiring strong communication an...
Location Icon
Location
United States , New York
Salary Icon
Salary
140000.00 - 190000.00 USD / Year
clearstreet.io Logo
Clear Street
Expiration Date
Until further notice
Software Engineer - Market Data
Save Icon
Join our New York team as a Software Engineer specializing in Market Data. Design and optimize high-performance, low-latency data pipelines for equities, options, and futures. Leverage your 8+ years of experience in distributed systems and real-time data processing. We offer competitive compensat...
Location Icon
Location
United States , New York
Salary Icon
Salary
200000.00 - 250000.00 USD / Year
clearstreet.io Logo
Clear Street
Expiration Date
Until further notice
Backend Software Engineer - Reference Data Services
Save Icon
Join Clear Street in NYC as a Backend Software Engineer on the FACT Team. Design and build highly scalable Golang services for critical reference data products. You'll tackle complex distributed systems challenges, mentor teammates, and enjoy top-tier benefits like equity and comprehensive insura...
Location Icon
Location
United States , New York
Salary Icon
Salary
200000.00 - 250000.00 USD / Year
clearstreet.io Logo
Clear Street
Expiration Date
Until further notice
Data Engineer
Save Icon
Seeking a skilled Data Engineer in Plano to build and manage CDC pipelines for data lake hydration. You will design ETL processes using Apache Spark for both streaming and batch data, transforming raw data into analytics-ready formats. This role requires strong Java, Python (PySpark), and AWS exp...
Location Icon
Location
United States , Plano
Salary Icon
Salary
Not provided
enormousenterprise.com Logo
Enormous Enterprise
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Figure, an AI Robotics pioneer, as a Senior Data Engineer in San Jose. Develop and optimize data pipelines to extract insights from robotic system logs, using Python and modern data stacks. Your work will directly impact the performance of our humanoid robot and inform critical engineering d...
Location Icon
Location
United States , San Jose
Salary Icon
Salary
140000.00 - 350000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Staff Data Engineer
Save Icon
Lead the data platform powering our humanoid robot fleet in San Jose. As a Staff Data Engineer, you'll architect scalable systems for telemetry, analytics, and release validation. Ideal candidates have deep expertise in Python, modern data stacks, and building platforms for robotics or autonomous...
Location Icon
Location
United States , San Jose
Salary Icon
Salary
150000.00 - 250000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Senior Software Engineer, Data Products
Save Icon
Join our Los Angeles team as a Senior Software Engineer for Data Products. You will design scalable AI solutions for live/VOD streaming, leveraging generative AI, LLMs, and multi-cloud platforms (AWS, GCP). Apply your expertise in distributed systems, PyTorch/TensorFlow, and vector databases to i...
Location Icon
Location
United States , Los Angeles
Salary Icon
Salary
143000.00 - 180000.00 USD / Year
foxcorporation.com Logo
Fox Corporation
Expiration Date
Until further notice
Senior Software Engineer, Data Products
Save Icon
Join our Los Angeles team as a Senior Software Engineer for Data Products. You will design scalable AI solutions for live/VOD streaming, leveraging generative AI, LLMs, and multi-cloud platforms like AWS/GCP. Apply your expertise in distributed systems, TensorFlow/PyTorch, and vector databases to...
Location Icon
Location
United States , Los Angeles
Salary Icon
Salary
143000.00 - 180000.00 USD / Year
foxnews.com Logo
Fox News Media
Expiration Date
Until further notice
Backend Engineer, Growth and Data
Save Icon
Join Hebbia's Growth and Data team as a Backend Engineer in New York City or San Francisco. You will architect high-scale backend systems, APIs, and infrastructure using Python/Java/Go and AWS. Build solutions for universal indexing and performance optimization while enjoying top benefits like un...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Data Engineer
Save Icon
Join as our first Data Engineer in NYC or SF. Architect end-to-end data solutions, build scalable ETL pipelines, and manage our central data lake. We seek 5+ years of experience with Python, SQL, and cloud data stacks. Enjoy unlimited PTO, comprehensive benefits, and a competitive equity package.
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
190000.00 - 250000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Frontend Engineer, Growth and Data
Save Icon
Join Hebbia's Growth and Data team as a Frontend Engineer in New York or San Francisco. You will build innovative React/TypeScript interfaces to unlock unique customer value and drive platform growth. Collaborate cross-functionally to own product experiences from ideation to launch. Enjoy top ben...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Data Engineer
Save Icon
Seeking a Data Engineer to design scalable database architecture using Python, Spark, and Hadoop. You will develop SQL scripts, stored procedures, and analyze data with Tableau and Power BI. This role is based in Piscataway, NJ, with opportunities at various US locations. A Master's degree in a r...
Location Icon
Location
United States , Piscataway, NJ and various unanticipated locations throughout the U.S.
Salary Icon
Salary
Not provided
itlize.com Logo
Itlize Global
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in San Francisco. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience with PyTorch and distributed systems. Enjoy full benefits, equity, and a role...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Senior Software Engineer - Data Infrastructure
Save Icon
Join Plaid's Data Infrastructure team in San Francisco as a Senior Software Engineer. You will build and scale core data and ML platforms using Spark, Data Warehouses, and orchestration tools. Lead key projects, mentor others, and enable product innovation. We offer comprehensive benefits, equity...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Machine Learning Engineer - Data Foundation and AI
Save Icon
Join Plaid's Data Foundation & AI team as a Machine Learning Engineer in New York. Design, build, and scale advanced ML/AI systems that power products for millions. You'll need 1-3 years of production ML experience, proficiency in Python/PyTorch, and expertise in distributed systems and MLOps. En...
Location Icon
Location
United States , New York
Salary Icon
Salary
186000.00 - 236400.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Data Engineer
Save Icon
Join Plaid's Data Engineering team in San Francisco to build robust golden datasets that power data-driven products. You'll leverage SQL, Python, and tools like DBT and Airflow to design pipelines on petabyte-scale data. Enjoy full benefits while solving complex data challenges to create insights...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
163200.00 - 223200.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Big Data Engineer
Save Icon
Join our team in St. Louis as a Big Data Engineer. You will design and implement optimal data solutions using Hadoop, Spark, and Kafka. Your role involves building ETL processes, managing data infrastructure, and guiding technology design. A relevant degree and 2-4 years of hands-on experience wi...
Location Icon
Location
United States , St. Louis
Salary Icon
Salary
Not provided
protocolinfotech.com Logo
Protocol Infotech
Expiration Date
Until further notice
Sr Data Engineer
Save Icon
Join our team in Irving, Texas, as a Senior Data Engineer. You will architect large-scale data solutions, optimize pipelines, and leverage Azure Synapse, Data Factory, and Hadoop. This role requires 6+ years of expertise in ETL, data modeling, and cloud technologies to drive data reliability and ...
Location Icon
Location
United States , Irving
Salary Icon
Salary
Not provided
rigusinc.com Logo
Resource Informatics Group
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary