CrawlJobs Logo
Briefcase Icon
Category Icon

Python and Pyspark Developer United States Jobs

51 Job Offers

Filters
Senior ML Ops Engineer
Save Icon
Location Icon
Location
United States , Philadelphia
Salary Icon
Salary
95300.00 - 158800.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Read More
Arrow Right
Senior Data Engineer
Save Icon
Location Icon
Location
United States
Salary Icon
Salary
Not provided
resonate.com Logo
Resonate
Expiration Date
Until further notice
Read More
Arrow Right
Solutions Director, Analytics & AI
Save Icon
Location Icon
Location
United States
Salary Icon
Salary
202100.00 - 355410.00 USD / Year
rackspace.com Logo
Rackspace
Expiration Date
Until further notice
Read More
Arrow Right
Data Architect
Save Icon
Location Icon
Location
United States , King of Prussia
Salary Icon
Salary
145000.00 - 155000.00 USD / Year
bhsg.com Logo
Beacon Hill
Expiration Date
Until further notice
Read More
Arrow Right
Data Architect
Save Icon
Location Icon
Location
United States , King of Prussia
Salary Icon
Salary
145000.00 - 155000.00 USD / Year
bhsg.com Logo
Beacon Hill
Expiration Date
Until further notice
Read More
Arrow Right
Data Engineer, Enterprise Data, Analytics and Innovation
Save Icon
Location Icon
Location
United States
Salary Icon
Salary
110000.00 - 125000.00 USD / Year
vaniamgroup.com Logo
Vaniam Group
Expiration Date
Until further notice
Read More
Arrow Right
Data Scientist Specialist
Save Icon
Location Icon
Location
United States , McLean
Salary Icon
Salary
Not provided
apexsystems.com Logo
Apex Systems
Expiration Date
Until further notice
Read More
Arrow Right
Associate Data Engineer
Save Icon
Location Icon
Location
United States , Irving, TX
Salary Icon
Salary
122949.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
Until further notice
Read More
Arrow Right
Senior Machine Learning Engineer
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
207000.00 - 244000.00 USD / Year
https://checkr.com Logo
Checkr
Expiration Date
Until further notice
Read More
Arrow Right
Manager, AI/ML
Save Icon
Location Icon
Location
United States , Bellevue; Overland Park; Frisco
Salary Icon
Salary
Not provided
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Read More
Arrow Right
Manager - AI Observability
Save Icon
Location Icon
Location
United States , Bellevue; Atlanta; Overland Park; Frisco
Salary Icon
Salary
155300.00 - 280100.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Read More
Arrow Right
Embark on a rewarding career path by exploring Python and PySpark Developer jobs, a pivotal role at the intersection of data engineering, big data analytics, and software development. Professionals in this field are the architects of large-scale data processing systems, leveraging the powerful combination of Python's versatility and PySpark's distributed computing capabilities. This role is central to modern data-driven organizations, enabling them to transform vast, unstructured data into actionable insights and intelligence. The core mission of a Python and PySpark Developer is to design, build, and maintain robust, scalable, and efficient data pipelines and applications. A typical day involves a range of responsibilities focused on handling big data. Developers are primarily tasked with writing and optimizing complex PySpark code for distributed data processing, which allows for the efficient handling of datasets far too large for a single machine. They design and implement ETL (Extract, Transform, Load) processes, pulling data from diverse sources like data lakes, databases, and streaming platforms. A significant part of the role involves data transformation—cleansing, aggregating, and enriching raw data to make it suitable for analysis, reporting, and feeding machine learning models. These professionals also build and maintain real-time data streaming pipelines using technologies like Apache Kafka. Furthermore, they are responsible for performance tuning of Spark applications to minimize processing time and resource consumption, collaborating closely with data scientists, analysts, and other engineering teams to understand data requirements and deliver reliable data products. To succeed in Python and PySpark Developer jobs, a specific and advanced skill set is required. Mastery of the Python programming language is non-negotiable, with a deep understanding of its libraries for data manipulation such as Pandas. Profound expertise in Apache Spark and its Python API, PySpark, is essential, including a solid grasp of Spark's core concepts like RDDs, DataFrames, and the Catalyst optimizer. Experience with big data ecosystems, including file formats like Parquet and ORC, and cluster resource managers like YARN, is highly typical. Knowledge of SQL and database principles is fundamental, as is experience with distributed messaging systems like Kafka for real-time data ingestion. Beyond technical prowess, these roles often require strong problem-solving abilities, the capacity to debug complex distributed system issues, and effective communication skills to act as a subject-matter expert. A background in software engineering principles, version control with Git, and an understanding of cloud platforms like AWS, Azure, or GCP are common requirements for these positions. If you are passionate about big data and possess these skills, pursuing Python and PySpark Developer jobs can be a highly fulfilling career choice, offering the opportunity to work on cutting-edge technology that powers business decision-making.

Filters

×
Countries
Category
Location
Work Mode
Salary