Filters

Countries

India (2)

Work Mode

Bigdata Pyspark Developer Jobs

2 Job Offers

Filters

Bigdata Java Spark And Pyspark Developer

Seeking a Senior Big Data Developer with 5-8 years' experience in Java, Spark, and PySpark. Lead medium-scale projects in Pune, driving application design and development in an agile environment. Strong analytical skills and experience with infrastructure programs are essential. This role offers ...

Location

India , Pune

Salary

Not provided

Citi

Expiration Date

Until further notice

Bigdata Developer with PySpark

Join Citi in Pune as a Big Data Developer. Utilize your 4-8 years of financial services experience to design and optimize PySpark ETL pipelines. Your expertise in Python, Java, and Hadoop will be key in developing scalable data solutions. This role offers involvement in the full development lifec...

Location

India , Pune

Salary

Not provided

Citi

Expiration Date

Until further notice

Explore a world of opportunity in Big Data PySpark Developer jobs, a critical and growing field at the intersection of data engineering and advanced analytics. Professionals in this role are the architects of large-scale data processing systems, leveraging the power of Apache Spark and the Python programming language to transform vast, complex datasets into structured, actionable information. These developers are in high demand across numerous industries, from finance and healthcare to e-commerce and technology, as organizations increasingly rely on data-driven decision-making. A career in this domain offers the chance to work on cutting-edge technology, solving some of the most challenging data problems faced by modern enterprises. A Big Data PySpark Developer is primarily responsible for the end-to-end development and maintenance of big data applications. Their typical day involves designing, building, and optimizing robust data pipelines that can efficiently process terabytes or even petabytes of data. Common responsibilities include writing and optimizing complex PySpark code for data extraction, transformation, and loading (ETL) processes; performing data analysis to identify trends and anomalies; and ensuring the overall health and performance of the big data ecosystem. They are also tasked with analyzing applications to identify vulnerabilities and security issues, conducting rigorous testing and debugging, and resolving performance bottlenecks. Furthermore, senior professionals in these jobs often act as subject matter experts, advising stakeholders, mentoring junior analysts, and leading medium to large-scale development projects. To excel in Big Data PySpark Developer jobs, a specific and robust skill set is required. Mastery of the PySpark framework is non-negotiable, encompassing a deep understanding of Spark SQL, DataFrames, and RDDs for both batch and stream processing. Strong proficiency in Python programming is essential, along with intermediate to advanced knowledge of SQL for querying databases. Expertise within the broader Hadoop ecosystem, particularly with components like HDFS, Hive, and YARN, is a standard expectation. Beyond technical prowess, successful candidates possess strong analytical and problem-solving skills to troubleshoot complex technical challenges. Experience with development tools like Git for version control, JIRA for project tracking, and Jupyter notebooks for interactive development is commonplace. Given the collaborative nature of these roles, which often involve working with global teams, excellent written and verbal communication skills, the ability to work under pressure, and a results-oriented mindset are crucial for securing and thriving in these high-impact jobs.

We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.

Filters

Countries

India (18)
United States (1)
Canada (1)
Poland (1)

Location

Work Mode

All (20)

Hybrid work (15)

On-site work (5)

Salary

All (21)

Specified salary (1)