CrawlJobs Logo
Briefcase Icon
Category Icon

Data Engineer - Pyspark Jobs (Remote work)

451 Job Offers

Filters
Data Engineer
Save Icon
Join iCapital's Lisbon team as a Data Engineer to build and scale the core data infrastructure powering our FinTech platform. You will design high-performance pipelines using Python, SQL, and modern cloud tools (AWS, Snowflake, dbt). This role offers a competitive package with equity, bonus, and ...
Location Icon
Location
Portugal , Lisbon
Salary Icon
Salary
Not provided
icapital.com Logo
iCapital Network
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join Socket's Data Platform team as a Senior Data Engineer. Design and build scalable infrastructure handling billions of records for real-time analytics. Utilize Node.js, TypeScript, Kafka, and ClickHouse in this high-impact, remote-first role. Enjoy competitive salary, equity, comprehensive ben...
Location Icon
Location
United States
Salary Icon
Salary
Not provided
socket.dev Logo
Socket
Expiration Date
Until further notice
Manager, Data Engineering
Save Icon
Lead the data engineering team and architect the semantic layer for a hyper-growth company. This strategic role requires deep expertise in dimensional modeling, dbt, and SQL to build reliable, performance-optimized data products. Enjoy a remote-first culture, competitive compensation with equity,...
Location Icon
Location
United States
Salary Icon
Salary
199680.00 - 249600.00 USD / Year
meetdandy.com Logo
Dandy
Expiration Date
Until further notice
Data Engineer
Save Icon
Join MediaRadar as a Data Engineer to build and manage scalable data pipelines on Azure Databricks. You will design ETL/ELT workflows, process large-scale datasets, and work with Delta Lake and Apache Spark. This US-based role offers comprehensive benefits including medical insurance, 401k match,...
Location Icon
Location
United States
Salary Icon
Salary
Not provided
mediaradar.com Logo
MediaRadar, Inc.
Expiration Date
Until further notice
Senior Azure Data Engineer
Save Icon
Join Parexel, a leading CRO, as a Senior Azure Data Engineer. Design and build data pipelines using Azure Data Factory, Databricks, and SQL within a medallion architecture. This remote Canada role requires 7+ years in software development and 2+ in cloud, with strong Azure expertise. Enjoy compre...
Location Icon
Location
Canada , Remote
Salary Icon
Salary
Not provided
parexel.com Logo
Parexel
Expiration Date
Until further notice
Data Migration Engineer
Save Icon
Lead complex data migration projects as a Data Migration Engineer at MoeGo. You will own the full lifecycle, from client discovery to cutover, using Python, Pandas, and advanced SQL. This autonomous, client-facing role requires deep technical skills to cleanse, transform, and validate large datas...
Location Icon
Location
United States
Salary Icon
Salary
Not provided
moego.pet Logo
MoeGo
Expiration Date
Until further notice
Data Engineer
Save Icon
Join a dynamic team building scalable data infrastructure for high-growth startups. As a Data Engineer, you'll design and maintain robust ETL/ELT pipelines using TypeScript in a fully remote European role. You'll ensure data quality and reliability while enjoying high autonomy and flexible hours.
Location Icon
Location
Salary Icon
Salary
80.00 USD / Hour
g2i.co Logo
G2i Inc.
Expiration Date
Until further notice
Senior Software Engineer for Code Reviewing LLM Data Training
Save Icon
Join our team as a Senior Software Engineer specializing in Swift code review. You will audit AI-generated code evaluations, ensuring quality, functionality, and adherence to guidelines. This remote role requires 5-7+ years of Swift expertise and strong QA skills. Weekly pay provided.
Location Icon
Location
Salary Icon
Salary
Not provided
g2i.co Logo
G2i Inc.
Expiration Date
Until further notice
Senior Software Engineer for Code Reviewing LLM Data Training
Save Icon
Join our team as a Senior Software Engineer specializing in code review. Utilize your 5-7+ years of Java expertise to audit AI-generated code evaluations, ensuring quality, functionality, and adherence to guidelines. This remote role is pivotal for training advanced LLMs, requiring strong analyti...
Location Icon
Location
Salary Icon
Salary
Not provided
g2i.co Logo
G2i Inc.
Expiration Date
Until further notice
Senior Software Engineer for Code Reviewing LLM Data Training
Save Icon
Seeking a Senior Software Engineer with deep R expertise to review AI-generated code evaluations. You will ensure annotation quality, validate code functionality, and provide expert feedback. This remote role requires 5-7+ years of R development and strong QA skills. Join a cutting-edge project t...
Location Icon
Location
Salary Icon
Salary
Not provided
g2i.co Logo
G2i Inc.
Expiration Date
Until further notice
Senior Software Engineer for Code Reviewing LLM Data Training
Save Icon
Join our team as a Senior Software Engineer specializing in code review. Utilize your 5-7+ years of C/C++ expertise to audit AI-generated code evaluations, ensuring quality, functionality, and security. You will validate code, provide feedback, and maintain high annotation standards within struct...
Location Icon
Location
Salary Icon
Salary
Not provided
g2i.co Logo
G2i Inc.
Expiration Date
Until further notice
Senior Software Engineer for Code Reviewing LLM Data Training
Save Icon
Join our team as a Senior Software Engineer specializing in Kotlin code review. You will ensure the quality of AI-generated code evaluations by auditing annotator work for correctness and adherence to guidelines. This role requires deep Kotlin expertise, QA experience, and excellent communication...
Location Icon
Location
Salary Icon
Salary
Not provided
g2i.co Logo
G2i Inc.
Expiration Date
Until further notice
Sr. Software Engineer for Data Training
Save Icon
Join a groundbreaking AI research project as a Senior Software Engineer. Utilize your 2-4 years of frontend expertise with React, Angular, or similar frameworks to shape user interactions. This fully remote role values strong UI/UX design, CSS proficiency, and excellent communication. Apply your ...
Location Icon
Location
Salary Icon
Salary
Not provided
g2i.co Logo
G2i Inc.
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Join our London team as a Senior Data Engineer. Design and build scalable data pipelines using Python, Airflow, and Kafka. Collaborate cross-functionally to shape our data architecture and drive product success in a high-growth, global environment.
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
Not provided
userlane.com Logo
Userlane GmbH
Expiration Date
Until further notice
Ai and data engineer
Save Icon
Join our team as an AI and Data Engineer to design and deploy cutting-edge AI solutions and data integrations. You will develop machine learning models, RAG agents, and robust data pipelines using Snowflake, Python, and SQL. This role requires 8+ years of enterprise experience and offers the oppo...
Location Icon
Location
United States
Salary Icon
Salary
Not provided
capstonec.com Logo
Capstone IT Staffing
Expiration Date
Until further notice
Engineering Manager - Data Platform & Analytics
Save Icon
Lead the data platform and analytics teams at Fluent in Toronto. Manage Databricks strategy, real-time pipelines, and BI dashboards to drive business insights. Requires 6+ years in data engineering/analytics with deep Databricks, Spark, and team leadership expertise. Enjoy competitive compensatio...
Location Icon
Location
Canada , Toronto
Salary Icon
Salary
160000.00 - 225000.00 CAD / Year
fluentco.com Logo
Fluent, Inc
Expiration Date
Until further notice
Data Engineering Lead
Save Icon
Lead our Data Engineering team in the US, building scalable data pipelines for our vendor universe. You'll leverage Databricks, Airflow, PySpark, and SQL to deliver robust solutions in a fast-paced environment. This hands-on leadership role offers flexible hours, generous 401K match, equity, and ...
Location Icon
Location
United States
Salary Icon
Salary
215000.00 - 240000.00 USD / Year
yipitdata.com Logo
YipitData
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Location Icon
Location
Canada
Salary Icon
Salary
145000.00 - 154000.00 USD / Year
waveapps.com Logo
Wave
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Location Icon
Location
United States , Washington, D.C.
Salary Icon
Salary
Not provided
trilogyfederal.com Logo
Trilogy Federal
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Location Icon
Location
United States , Washington, D.C.
Salary Icon
Salary
130000.00 - 150000.00 USD / Year
trilogyfederal.com Logo
Trilogy Federal
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary