Filters

Cities

Work Mode

Data Engineer - Pyspark United States Jobs (Remote work)

186 Job Offers

Filters

Data Engineer

Seeking an experienced Data Engineer in Worcester, USA. This role requires 5-10 years of expertise in data modeling, technical design, and Stored Procedures. Key technical skills include SQL, Snowflake, Databricks, Alteryx, Power BI, and Azure. Join a team leveraging modern data platforms for imp...

Location

United States , Worcester

Salary

55.00 - 65.00 USD / Hour

Beacon Hill

Expiration Date

Until further notice

Healthcare Data Engineer

Seeking a Healthcare Data Engineer in Irvine to develop innovative data solutions. Requires 5+ years with healthcare tech (HL7/FHIR), OOP (C#, Java, Python), and SQL/NoSQL. You'll design features using agile methodologies to meet critical caregiver and business needs.

Location

United States , Irvine

Salary

68904.00 - 143550.00 USD / Year

NTT DATA

Expiration Date

Until further notice

Data Engineer

Join SQA Group as a Data Engineer to build scalable data foundations and modern platforms. You will design pipelines, manage big data tools (Hadoop, Spark, Kafka), and utilize cloud services (AWS, Azure). This US-based role offers the chance to unlock the true power of data for impactful client p...

Location

United States

Salary

Not provided

SQA Group

Expiration Date

Until further notice

Informatica QA/ Data Engineer

Join a major data migration project in the United States as a QA/Data Engineer. This unique role blends data engineering with quality assurance to ensure a successful transfer. You will be responsible for moving and validating data at scale. Apply your data engineering expertise to this critical ...

Location

United States

Salary

75.00 USD / Hour

Signify Technology

Expiration Date

Until further notice

Software Engineer, Backend & Data

Join Epic's global team as a Software Engineer, Backend & Data. This fully remote US role focuses on building scalable backend systems and data infrastructure, including EDW design and pipelines. We seek a candidate proficient in SQL, Python/Scala/Java, and big data tech like Spark. Enable data-d...

Location

United States

Salary

160000.00 - 200000.00 USD / Year

Epic Kids

Expiration Date

Until further notice

GenAI Data Automation Engineer

Join our team in Atlanta as a GenAI Data Automation Engineer. Design and build innovative, AI-driven data pipelines across AWS and Azure hybrid environments. Leverage your expertise in LLMs, Python, Spark, and cloud services to operationalize Generative AI solutions. We offer comprehensive benefi...

Location

United States , Atlanta

Salary

Not provided

Robert Half

Expiration Date

Until further notice

1 ... 5 6 7 8 9 10

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.