CrawlJobs Logo
Briefcase Icon
Category Icon

Data Engineer - Pyspark Jobs

1548 Job Offers

Filters
Data Engineering Manager, Analytics
Save Icon
Lead our Data Engineering team in Bellevue, building scalable data architecture and BI solutions that directly impact company growth. You will manage a team, design data models, and deliver insights using cutting-edge tech on rich datasets. Requires 8+ years in BI/Data Warehousing, 2+ years manag...
Location Icon
Location
United States , Bellevue
Salary Icon
Salary
177000.00 - 247000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Data Scientist / Engineer Intern
Save Icon
Join our Data Management & Analytics team in Genève for a pivotal internship. You will help build a centralized clinical Data Lake on Azure, developing ingestion pipelines and supporting AI initiatives. Ideal for a Master's student skilled in Python, SQL, and Spark, with a passion for data archit...
Location Icon
Location
Switzerland , Genève
Salary Icon
Salary
Not provided
teoxane.com Logo
Teoxane
Expiration Date
Until further notice
Manufacturing Program Manager - Data Center Design, Engineering, & Construction
Save Icon
Lead the manufacturing program for Meta's data center construction, ensuring scalable capacity delivery. You'll manage supply chain optimization, manufacturing-informed design, and integration with onsite builds. This role requires 10+ years in construction/manufacturing program management and le...
Location Icon
Location
United States , Menlo Park
Salary Icon
Salary
150000.00 - 209000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Area schedule lead - data center design, engineering and construction
Save Icon
Lead our contingent project schedulers in data center design, engineering, and construction. This strategic role requires 10+ years of construction scheduling expertise with Primavera P6. You will ensure schedule health, manage claims, and guide site teams in Menlo Park. A bonus, equity, and bene...
Location Icon
Location
United States , Menlo Park
Salary Icon
Salary
123000.00 - 176000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Research Engineer, Media Data Research - MSL FAIR
Save Icon
Join Meta's FAIR team as a Research Engineer focused on Media Data Research. You will architect scalable data curation systems for cutting-edge Large Language and Media Models. This role requires expertise in LLM/LMM, multimodal data, and a strong software engineering background. Based in Menlo P...
Location Icon
Location
United States , Menlo Park
Salary Icon
Salary
257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Data Engineering Manager, Analytics
Save Icon
Lead our Data Engineering team in Menlo Park, building scalable data architecture that drives product strategy and user satisfaction. You will manage BI, data warehousing, and a team to deliver impactful data models, pipelines, and dashboards. Requires 8+ years in BI/Data Warehousing, 2+ years ma...
Location Icon
Location
United States , Menlo Park
Salary Icon
Salary
177000.00 - 247000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Data Engineering Leader, Analytics (Instagram)
Save Icon
Lead data engineering for Instagram's analytics, impacting over 1 billion users. Build and scale a high-performing team to design data architecture, models, and pipelines. Leverage 9+ years in BI/Data Warehousing and leadership in Menlo Park. Enjoy a role with bonus, equity, and cutting-edge tech...
Location Icon
Location
United States , Menlo Park
Salary Icon
Salary
210000.00 - 281000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Senior Data Engineer
Save Icon
Seeking a Senior Data Engineer in Cluj to develop Python ETL pipelines in Azure Synapse. Requires 5+ years' experience with cloud architecture, PySpark, and Azure. Enjoy flexible remote/hybrid work, private health insurance, and sponsored certifications. Join a collaborative team on impactful BI ...
Location Icon
Location
Romania , Cluj
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.

Filters

×
Countries
Category
Location
Work Mode
Salary