Filters

Cities

Work Mode

Data Engineer - Pyspark India Jobs (On-site work)

69 Job Offers

Filters

Big Data Engineer

Seeking a Big Data Engineer in Chennai to develop and enhance application systems. This role requires 3-5 years' experience with Hadoop, Python, PySpark, and SQL. You will analyze complex issues, implement solutions, and write code within a global technology framework. Apply your expertise in dat...

Location

India , Chennai

Salary

Not provided

Citi

Expiration Date

Until further notice

Quality Engineer – AI & Data Platforms

Join our team as a Quality Engineer for AI & Data Platforms in Bangalore or Coimbatore. You will validate AI workflows, automate tests with Selenium/Pytest, and ensure system performance on Kubernetes/OpenShift. This role requires strong QA experience and collaboration with cross-functional Agile...

Location

India , Bangalore; Coimbatore

Salary

Not provided

Soliton

Expiration Date

Until further notice

Data Engineer

Join Amgen as a Data Engineer in Hyderabad. Design and optimize scalable data pipelines on AWS/Databricks using Python/Scala. Apply your 5+ years' experience in ETL, data modeling, and Agile practices within a collaborative team. Drive data strategy with modern tools and best practices.

Location

India , Hyderabad

Salary

Not provided

Amgen

Expiration Date

Until further notice

Sr. Associate Data Engineer

Join Amgen as a Senior Associate Data Engineer in Hyderabad. Design and optimize scalable data pipelines using Python/Scala on AWS and Databricks. Apply your 5+ years' experience in ETL, data modeling, and Agile practices within a collaborative team. Drive data strategy and leverage cutting-edge ...

Location

India , Hyderabad

Salary

Not provided

Amgen

Expiration Date

Until further notice

Senior Data Engineer

Seeking a Senior Data Engineer in Hyderabad to design and optimize scalable data pipelines on AWS and Databricks. You will leverage Python/Scala and ETL orchestration within a SAFE Agile team to ensure data integrity and performance. This role requires 6+ years' experience, mastery of modern data...

Location

India , Hyderabad

Salary

Not provided

Amgen

Expiration Date

Until further notice

Associate Data Engineer

Join our RunOps team in Hyderabad as an Associate Data Engineer, providing 24/7 support for scalable data solutions. You will maintain complex data pipelines using Databricks, AWS, Python, and SQL within a Scaled Agile (SAFe) framework. This role is ideal for a problem-solver passionate about big...

Location

India , Hyderabad

Salary

Not provided

Amgen

Expiration Date

Until further notice

Sr Data Engineer

Join our team in Hyderabad as a Senior Data Engineer. You will design and build scalable data pipelines using Python, Spark, and cloud platforms like AWS. Leverage your expertise in ETL processes and big data technologies to drive actionable business insights. We offer competitive total rewards a...

Location

India , Hyderabad

Salary

Not provided

Amgen

Expiration Date

Until further notice

Big Data Engineer

Join Citi in Pune as a Big Data Engineer. This role requires 0-2 years of experience in application development and programming. You will analyze systems, implement solutions, and write code using various programming languages. Utilize your knowledge of business processes and industry standards t...

Location

India , Pune

Salary

Not provided

Citi

Expiration Date

Until further notice

Data Engineer

Join Citi in Chennai as a Data Engineer (Engineering Analyst 2). Utilize your 3-5 years of engineering experience in a complex global environment. You will design, monitor, and improve systems, ensuring performance and quality standards. This role offers a chance to work on innovative solutions w...

Location

India , Chennai

Salary

Not provided

Citi

Expiration Date

Until further notice

1 2 3 4

Are you a data architect with a passion for building robust, scalable systems? Your search for Data Engineer - PySpark jobs ends here. A Data Engineer specializing in PySpark is a pivotal role in the modern data ecosystem, responsible for constructing the foundational data infrastructure that powers analytics, machine learning, and business intelligence. These professionals are the master builders of the data world, transforming raw, unstructured data into clean, reliable, and accessible information for data scientists, analysts, and business stakeholders. If you are seeking jobs where you can work with cutting-edge big data technologies to solve complex data challenges at scale, this is your domain. In this profession, typical responsibilities revolve around the entire data pipeline lifecycle. Data Engineers design, develop, test, and maintain large-scale data processing systems. A core part of their daily work involves writing efficient, scalable code using PySpark, the Python library for Apache Spark, to perform complex ETL (Extract, Transform, Load) or ELT processes. They build and orchestrate data pipelines that ingest data from diverse sources—such as databases, APIs, and log files—into data warehouses like Snowflake or data lakes on cloud platforms like AWS, Azure, and GCP. Ensuring data quality and reliability is paramount; they implement robust data validation, monitoring, and observability frameworks to guarantee that data is accurate, timely, and trusted. Furthermore, they are tasked with optimizing the performance and cost of these data systems, fine-tuning Spark jobs for maximum efficiency, and automating deployment processes through CI/CD and Infrastructure as Code (IaC) practices. To excel in Data Engineer - PySpark jobs, a specific and powerful skill set is required. Mastery of Python and PySpark is non-negotiable, as it is the primary tool for distributed data processing. Profound knowledge of SQL is essential for data manipulation and querying. Experience with workflow orchestration tools like Apache Airflow is a common requirement to manage complex pipeline dependencies. A deep understanding of cloud data solutions (AWS, GCP, Azure) and platforms like Databricks is highly valued. Beyond technical prowess, successful candidates possess strong problem-solving abilities to debug and optimize data flows, a keen eye for system design and architecture, and excellent collaboration skills to work with cross-functional teams, including data scientists and business analysts. They are often expected to mentor junior engineers and contribute to establishing data engineering best practices and standards across an organization. If you are ready to build the future of data, explore the vast array of Data Engineer - PySpark jobs available and take the next step in your impactful career.