CrawlJobs Logo

Lead Data Engineer

India, Kochi · Job Posted December 08, 2025
Apply Position
Job Link Share

Job Description

We are seeking an experienced Senior Data Engineer to lead the development of a scalable data ingestion framework while ensuring high data quality and validation. The successful candidate will also be responsible for designing and implementing robust APIs for seamless data integration. This role is ideal for someone with deep expertise in building and managing big data pipelines using modern AWS-based technologies, and who is passionate about driving quality and efficiency in data processing systems.

Job Responsibility

  • Architect, develop, and maintain end-to-end data ingestion framework for extracting, transforming, and loading data from diverse sources
  • Use AWS services (Glue, Lambda, EMR, ECS, EC2, Step Functions) to build scalable, resilient automated data pipelines
  • Develop and implement automated data quality checks, validation routines, and error-handling mechanisms
  • Establish comprehensive monitoring, logging, and alerting systems for data quality issues
  • Architect and develop secure, high-performance APIs for data services integration
  • Create thorough API documentation and establish standards for security, versioning, and performance
  • Work with business stakeholders, data scientists, and operations teams to understand requirements
  • Participate in sprint planning, code reviews, and agile ceremonies
  • Contribute to CI/CD pipeline development using GitLab

Requirements

  • 5+ years experience in data engineering with analytical platform development focus
  • Proficiency in Python and/or PySpark
  • Strong SQL skills for ETL processes and large-scale data manipulation
  • Extensive AWS experience (Glue, Lambda, Step Functions, S3)
  • Familiarity with big data systems (AWS EMR, Apache Spark, Apache Iceberg)
  • Database experience with DynamoDB, Aurora, Postgres, or Redshift
  • Proven experience designing and implementing RESTful APIs
  • Hands-on CI/CD pipeline experience (preferably GitLab)
  • Agile development methodology experience
  • Strong problem-solving abilities and attention to detail
  • Excellent communication and interpersonal skills
  • Ability to work independently and collaboratively
  • Capacity to quickly learn and adapt to new technologies
  • 10+ years total experience

Nice to have

  • Bachelor’s/Master’s in Computer Science, Data Engineering or related field (preferred)
  • Experience with additional AWS services (Kinesis, Firehose, SQS)
  • Familiarity with data lakehouse architectures and modern data quality frameworks
  • Experience with proactive data quality management in multi-cluster environments

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Lead Data Engineer

8 matching positions

Lead Data Engineer

Do you love building and pioneering in the technology space? Do you enjoy solvin...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or Engineering
  • At least 9 years of experience in application development (Internship experience does not apply)
  • At least 3 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Have a high bar for quality, perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
  • Fulltime
Read More
Arrow Right
New

Lead Data Engineer

Within COO Technology, Wells Fargo is seeking a Lead Data Engineer to help shape...
Location
Location
United States , Iselin; Charlotte; Irving
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
June 08, 2026
Flip Icon
Requirements
Requirements
  • 5+ years of Database Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 5+ years of data management experience within Public Cloud (GCP, AWS, Azure)
  • 5+ years of hands on experience of Python or Java, plus Spark SQL for building data pipelines, libraries, and automation tooling.
  • 5+ years with orchestration tools (Cloud Composer/Airflow) and CI/CD (Cloud Build, Git‑based workflows) for data workloads
Job Responsibility
Job Responsibility
  • Design and implement scalable, secure data platforms on Google Cloud using managed services (BigQuery, Dataflow, Dataproc, Pub/Sub, Cloud Storage, Composer).)
  • Build reusable frameworks and tooling (ingestion, transformation, quality, orchestration) that can be adopted by multiple product and domain teams.
  • Enable self‑service data consumption and governance by standardizing patterns, templates, and platform capabilities rather than one‑off pipelines.
  • Design logical and physical data platform architectures leveraging BigQuery, Dataflow/Apache Beam, Dataproc/Spark, Pub/Sub, and Cloud Storage.
  • Define and implement standardized ingestion, transformation, and serving patterns (batch and streaming) as reusable blueprints.
  • Optimize cost, performance, and reliability of GCP data workloads (partitioning, clustering, storage classes, autoscaling strategies).
  • Build opinionated data ingestion frameworks (e.g., config‑driven pipelines, connectors, schema handling, error handling) on top of Dataflow, Dataproc, or Composer.
  • Develop shared transformation libraries in Python/SQL/Beam (e.g., common SCD patterns, data quality checks, masking/tokenization routines).
  • Provide orchestration capabilities via Cloud Composer or Cloud Workflows with reusable DAGs/templates and CI/CD integration.
  • Implement robust data modeling (dimensional, data vault, or canonical models) and semantic layers in BigQuery and related tools.
What we offer
What we offer
  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Rapid7 is seeking a Data Engineer, Data Engineering & Analytics to join a high-p...
Location
Location
India , Pune
Salary
Salary:
Not provided
rapid7.com Logo
Rapid7
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ability to thrive in a fast-paced hybrid organization
  • Comfort working in a highly agile, intensely iterative environment
  • Demonstrated capacity to clearly and concisely communicate complex business activities, technical requirements, and recommendations
  • 8+ years of experience in data engineering, analytics, or business intelligence
  • 8+ years experience designing, implementing, operating, and extending enterprise dimensional data models
  • 3+ years experience building reports and dashboards in Tableau and/or other similar data visualization tools
  • Experience in DBT modeling and understanding modular, performant models
  • Solid understanding of Snowflake, SQL, and data warehouse management
  • Understanding of ETL/ELT processes, data pipelines, and cloud-based data architectures
  • Familiarity with modern data stacks (DBT, Airflow, Fivetran, Matillion, or similar tools)
Job Responsibility
Job Responsibility
  • Implement data modeling best practices to enhance data accessibility and reporting capabilities
  • Ensure data integrity, security, and compliance with industry standards and regulations
  • Document plans and results in user-stories, issues, PRs, the team’s handbook - following the tradition of documentation first
  • Implement the Corp Data philosophy in everything you do
  • Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment
  • Maintain and advocate for these standards through code review
  • Collaborate with IT and DevOps teams to optimize cloud infrastructure and data governance policies
  • Manage and enhance the existing Tableau reporting suite, ensuring self-service analytics and actionable insights for stakeholders
  • Design, develop, and extend DBT code repository to extend the Enterprise Dimensional Warehouse capabilities and infrastructure
  • Develop and maintain a single source of truth for business metrics, ensuring consistency across reporting platforms
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Lead Data Engineer (Finance Tech) Do you love building and pioneering in the te...
Location
Location
United States , Richmond, Virginia; McLean, Virginia; Cambridge, Massachusetts
Salary
Salary:
179400.00 - 225100.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree
  • At least 4 years of experience in application development
  • At least 2 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
  • Experience leveraging interactive AI tooling to accelerate productivity, utilizing capabilities beyond basic code completion
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
What we offer
What we offer
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Join us as a Lead Data Engineer. At Barclays, we don’t just adapt to the future,...
Location
Location
India , Pune
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong SQL skills for complex queries, optimization, and performance tuning
  • Knowledge of batch and real-time data processing patterns including understanding with streaming technologies (like Kafka)
  • Experience with Spark for distributed data processing
  • Experience building and maintaining robust ETL/ELT pipelines
  • Data modeling expertise (dimensional modeling, star/snowflake schemas)
  • Hands-on experience with data quality frameworks and validation
  • Proven Experience in designing and implementing scalable data solutions on public cloud
  • Deep knowledge of data services like: AWS S3, Redshift, Athena
  • Understanding of security, IAM, VPC, and cost optimization
  • Proficient in Python or Java for data processing
Job Responsibility
Job Responsibility
  • Design, develop and improve software, utilising various engineering methodologies, that provides business, platform, and technology capabilities for our customers and colleagues
  • Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data
  • Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures
  • Development of processing and analysis algorithms fit for the intended data complexity and volumes
  • Collaboration with data scientist to build and deploy machine learning models
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Lead Data Engineer Do you love building and pioneering in the technology space?...
Location
Location
United States , San Francisco
Salary
Salary:
215200.00 - 245600.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree
  • At least 4 years of experience in application development (Internship experience does not apply)
  • At least 2 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
What we offer
What we offer
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Rapid7 is seeking a Data Engineer, Data Engineering & Analytics to join a high-p...
Location
Location
India , Pune
Salary
Salary:
Not provided
rapid7.com Logo
Rapid7
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering, analytics, or business intelligence
  • 8+ years experience designing, implementing, operating, and extending enterprise dimensional data models
  • 3+ years experience building reports and dashboards in Tableau and/or other similar data visualization tools
  • Experience in DBT modeling and understanding modular, performant models
  • Solid understanding of Snowflake, SQL, and data warehouse management
  • Understanding of ETL/ELT processes, data pipelines, and cloud-based data architectures
  • Familiarity with modern data stacks (DBT, Airflow, Fivetran, Matillion, or similar tools)
  • Hands-on experience building and operating cloud infrastructure on AWS
  • Experience managing software installations, upgrades, and configuration in production environments
  • Ability to manage data governance, security, and compliance requirements (SOC 2, GDPR, etc.)
Job Responsibility
Job Responsibility
  • Implement data modeling best practices to enhance data accessibility and reporting capabilities
  • Ensure data integrity, security, and compliance with industry standards and regulations
  • Document plans and results in user-stories, issues, PRs, the team’s handbook - following the tradition of documentation first
  • Implement the Corp Data philosophy in everything you do
  • Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment
  • Maintain and advocate for these standards through code review
  • Collaborate with IT and DevOps teams to optimize cloud infrastructure and data governance policies
  • Manage and enhance the existing Tableau reporting suite, ensuring self-service analytics and actionable insights for stakeholders
  • Design, develop, and extend DBT code repository to extend the Enterprise Dimensional Warehouse capabilities and infrastructure
  • Develop and maintain a single source of truth for business metrics, ensuring consistency across reporting platforms
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Lead Data Engineer Do you love building and pioneering in the technology space?...
Location
Location
United States , Richmond; McLean; Cambridge
Salary
Salary:
179400.00 - 225100.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree
  • At least 4 years of experience in application development (Internship experience does not apply)
  • At least 2 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
What we offer
What we offer
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • Health, financial and other benefits
  • Fulltime
Read More
Arrow Right