CrawlJobs Logo

Lead Data Engineer

United States, Houston · Job Posted February 20, 2026
Apply Position
Job Link Share

Job Description

We are looking for an experienced Lead Data Engineer to oversee the design, implementation, and management of advanced data infrastructure in Houston, Texas. This role requires expertise in architecting scalable solutions, optimizing data pipelines, and ensuring data quality to support analytics, machine learning, and real-time processing. The ideal candidate will have a deep understanding of Lakehouse architecture and Medallion design principles to deliver robust and governed data solutions.

Job Responsibility

  • Develop and implement scalable data pipelines to ingest, process, and store large datasets using tools such as Apache Spark, Hadoop, and Kafka
  • Utilize cloud platforms like AWS or Azure to manage data storage and processing, leveraging services such as S3, Lambda, and Azure Data Lake
  • Design and operationalize data architecture following Medallion patterns to ensure data usability and quality across Bronze, Silver, and Gold layers
  • Build and optimize data models and storage solutions, including Databricks Lakehouses, to support analytical and operational needs
  • Automate data workflows using tools like Apache Airflow and Fivetran to streamline integration and improve efficiency
  • Lead initiatives to establish best practices in data management, facilitating knowledge sharing and collaboration across technical and business teams
  • Collaborate with data scientists to provide infrastructure and tools for complex analytical models, using programming languages like Python or R
  • Implement and enforce data governance policies, including encryption, masking, and access controls, within cloud environments
  • Monitor and troubleshoot data pipelines for performance issues, applying tuning techniques to enhance throughput and reliability
  • Stay updated with emerging technologies in data engineering and advocate for improvements to the organization's data systems

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field with 10+ years of experience in data engineering, or a Master’s degree with 5+ years of relevant experience
  • Proven expertise in designing and implementing Medallion Architecture within a Databricks Lakehouse environment
  • Proficiency in big data technologies such as Apache Spark, Hadoop, and Kafka
  • Extensive experience with cloud platforms like AWS and Azure, including integration of storage and compute services
  • Strong programming skills in Python, Java, or Scala, with hands-on experience in data modeling and stored procedures
  • Knowledge of tools and platforms like Apache Airflow, Databricks, and Dataiku
  • Familiarity with ETL processes and machine learning model deployment
  • Excellent problem-solving skills and ability to optimize data systems for performance and scalability

What we offer

  • medical
  • vision
  • dental
  • life and disability insurance
  • company 401(k) plan

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Lead Data Engineer

8 matching positions

Lead Data Engineer

Do you love building and pioneering in the technology space? Do you enjoy solvin...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or Engineering
  • At least 9 years of experience in application development (Internship experience does not apply)
  • At least 3 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Have a high bar for quality, perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
  • Fulltime
Read More
Arrow Right
New

Lead Data Engineer

Within COO Technology, Wells Fargo is seeking a Lead Data Engineer to help shape...
Location
Location
United States , Iselin; Charlotte; Irving
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
June 08, 2026
Flip Icon
Requirements
Requirements
  • 5+ years of Database Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 5+ years of data management experience within Public Cloud (GCP, AWS, Azure)
  • 5+ years of hands on experience of Python or Java, plus Spark SQL for building data pipelines, libraries, and automation tooling.
  • 5+ years with orchestration tools (Cloud Composer/Airflow) and CI/CD (Cloud Build, Git‑based workflows) for data workloads
Job Responsibility
Job Responsibility
  • Design and implement scalable, secure data platforms on Google Cloud using managed services (BigQuery, Dataflow, Dataproc, Pub/Sub, Cloud Storage, Composer).)
  • Build reusable frameworks and tooling (ingestion, transformation, quality, orchestration) that can be adopted by multiple product and domain teams.
  • Enable self‑service data consumption and governance by standardizing patterns, templates, and platform capabilities rather than one‑off pipelines.
  • Design logical and physical data platform architectures leveraging BigQuery, Dataflow/Apache Beam, Dataproc/Spark, Pub/Sub, and Cloud Storage.
  • Define and implement standardized ingestion, transformation, and serving patterns (batch and streaming) as reusable blueprints.
  • Optimize cost, performance, and reliability of GCP data workloads (partitioning, clustering, storage classes, autoscaling strategies).
  • Build opinionated data ingestion frameworks (e.g., config‑driven pipelines, connectors, schema handling, error handling) on top of Dataflow, Dataproc, or Composer.
  • Develop shared transformation libraries in Python/SQL/Beam (e.g., common SCD patterns, data quality checks, masking/tokenization routines).
  • Provide orchestration capabilities via Cloud Composer or Cloud Workflows with reusable DAGs/templates and CI/CD integration.
  • Implement robust data modeling (dimensional, data vault, or canonical models) and semantic layers in BigQuery and related tools.
What we offer
What we offer
  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Rapid7 is seeking a Data Engineer, Data Engineering & Analytics to join a high-p...
Location
Location
India , Pune
Salary
Salary:
Not provided
rapid7.com Logo
Rapid7
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ability to thrive in a fast-paced hybrid organization
  • Comfort working in a highly agile, intensely iterative environment
  • Demonstrated capacity to clearly and concisely communicate complex business activities, technical requirements, and recommendations
  • 8+ years of experience in data engineering, analytics, or business intelligence
  • 8+ years experience designing, implementing, operating, and extending enterprise dimensional data models
  • 3+ years experience building reports and dashboards in Tableau and/or other similar data visualization tools
  • Experience in DBT modeling and understanding modular, performant models
  • Solid understanding of Snowflake, SQL, and data warehouse management
  • Understanding of ETL/ELT processes, data pipelines, and cloud-based data architectures
  • Familiarity with modern data stacks (DBT, Airflow, Fivetran, Matillion, or similar tools)
Job Responsibility
Job Responsibility
  • Implement data modeling best practices to enhance data accessibility and reporting capabilities
  • Ensure data integrity, security, and compliance with industry standards and regulations
  • Document plans and results in user-stories, issues, PRs, the team’s handbook - following the tradition of documentation first
  • Implement the Corp Data philosophy in everything you do
  • Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment
  • Maintain and advocate for these standards through code review
  • Collaborate with IT and DevOps teams to optimize cloud infrastructure and data governance policies
  • Manage and enhance the existing Tableau reporting suite, ensuring self-service analytics and actionable insights for stakeholders
  • Design, develop, and extend DBT code repository to extend the Enterprise Dimensional Warehouse capabilities and infrastructure
  • Develop and maintain a single source of truth for business metrics, ensuring consistency across reporting platforms
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Lead Data Engineer (Finance Tech) Do you love building and pioneering in the te...
Location
Location
United States , Richmond, Virginia; McLean, Virginia; Cambridge, Massachusetts
Salary
Salary:
179400.00 - 225100.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree
  • At least 4 years of experience in application development
  • At least 2 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
  • Experience leveraging interactive AI tooling to accelerate productivity, utilizing capabilities beyond basic code completion
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
What we offer
What we offer
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Join us as a Lead Data Engineer. At Barclays, we don’t just adapt to the future,...
Location
Location
India , Pune
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong SQL skills for complex queries, optimization, and performance tuning
  • Knowledge of batch and real-time data processing patterns including understanding with streaming technologies (like Kafka)
  • Experience with Spark for distributed data processing
  • Experience building and maintaining robust ETL/ELT pipelines
  • Data modeling expertise (dimensional modeling, star/snowflake schemas)
  • Hands-on experience with data quality frameworks and validation
  • Proven Experience in designing and implementing scalable data solutions on public cloud
  • Deep knowledge of data services like: AWS S3, Redshift, Athena
  • Understanding of security, IAM, VPC, and cost optimization
  • Proficient in Python or Java for data processing
Job Responsibility
Job Responsibility
  • Design, develop and improve software, utilising various engineering methodologies, that provides business, platform, and technology capabilities for our customers and colleagues
  • Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data
  • Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures
  • Development of processing and analysis algorithms fit for the intended data complexity and volumes
  • Collaboration with data scientist to build and deploy machine learning models
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Lead Data Engineer Do you love building and pioneering in the technology space?...
Location
Location
United States , San Francisco
Salary
Salary:
215200.00 - 245600.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree
  • At least 4 years of experience in application development (Internship experience does not apply)
  • At least 2 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
What we offer
What we offer
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Rapid7 is seeking a Data Engineer, Data Engineering & Analytics to join a high-p...
Location
Location
India , Pune
Salary
Salary:
Not provided
rapid7.com Logo
Rapid7
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering, analytics, or business intelligence
  • 8+ years experience designing, implementing, operating, and extending enterprise dimensional data models
  • 3+ years experience building reports and dashboards in Tableau and/or other similar data visualization tools
  • Experience in DBT modeling and understanding modular, performant models
  • Solid understanding of Snowflake, SQL, and data warehouse management
  • Understanding of ETL/ELT processes, data pipelines, and cloud-based data architectures
  • Familiarity with modern data stacks (DBT, Airflow, Fivetran, Matillion, or similar tools)
  • Hands-on experience building and operating cloud infrastructure on AWS
  • Experience managing software installations, upgrades, and configuration in production environments
  • Ability to manage data governance, security, and compliance requirements (SOC 2, GDPR, etc.)
Job Responsibility
Job Responsibility
  • Implement data modeling best practices to enhance data accessibility and reporting capabilities
  • Ensure data integrity, security, and compliance with industry standards and regulations
  • Document plans and results in user-stories, issues, PRs, the team’s handbook - following the tradition of documentation first
  • Implement the Corp Data philosophy in everything you do
  • Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment
  • Maintain and advocate for these standards through code review
  • Collaborate with IT and DevOps teams to optimize cloud infrastructure and data governance policies
  • Manage and enhance the existing Tableau reporting suite, ensuring self-service analytics and actionable insights for stakeholders
  • Design, develop, and extend DBT code repository to extend the Enterprise Dimensional Warehouse capabilities and infrastructure
  • Develop and maintain a single source of truth for business metrics, ensuring consistency across reporting platforms
  • Fulltime
Read More
Arrow Right

Lead Data Engineer

Lead Data Engineer Do you love building and pioneering in the technology space?...
Location
Location
United States , Richmond; McLean; Cambridge
Salary
Salary:
179400.00 - 225100.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree
  • At least 4 years of experience in application development (Internship experience does not apply)
  • At least 2 years of experience in big data technologies
  • At least 1 year experience with cloud computing (AWS, Microsoft Azure, Google Cloud)
Job Responsibility
Job Responsibility
  • Collaborate with and across Agile teams to design, develop, test, implement, and support technical solutions in full-stack development tools and technologies
  • Work with a team of developers with deep experience in machine learning, distributed microservices, and full stack systems
  • Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake
  • Share your passion for staying on top of tech trends, experimenting with and learning new technologies, participating in internal & external technology communities, and mentoring other members of the engineering community
  • Collaborate with digital product managers, and deliver robust cloud-based solutions that drive powerful experiences to help millions of Americans achieve financial empowerment
  • Perform unit tests and conduct reviews with other team members to make sure your code is rigorously designed, elegantly coded, and effectively tuned for performance
What we offer
What we offer
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • Health, financial and other benefits
  • Fulltime
Read More
Arrow Right