CrawlJobs Logo

Databricks Engineer- GCP Cloud

India, Bangalore South · Job Posted April 20, 2026
Apply Position
Job Link Share

Job Description

We are looking for a bright and dynamic engineer, motivated and able to work independently as well as in partnership with IT and Business teams spread across the globe. The candidate needs to be an exceptionally strong Python and SQL programmer with hands-on experience in GCP-native data technologies including BigQuery, Dataproc, Cloud Composer, and Datastream. Besides technical skills, we are looking for a candidate with a strong sense of ownership and the ability to work in a diverse, cross-functional team spanning Engineering, Research, DataOps, and Compliance.

Job Responsibility

  • Build and maintain scalable, distributed, fault-tolerant data pipelines on GCP, including BigQuery-based lakehouse layers and Dataproc-driven Delta Lake workflows
  • Actively participate in meetings with various stakeholders across data engineering, compliance, and business teams globally
  • Understand market data processing and transformation needs
  • build pipelines to acquire, normalise, transform, and release large volumes of financial data through the OMDP data factory
  • Design and implement bitemporal data models (valid-time + system-time) on BigQuery to support certified, regulatory-grade time-series datasets
  • Build, use, and maintain software testing frameworks (unit / non-regression / user acceptance) for data pipelines and transformation logic
  • Take complete ownership of solutions and assigned tasks, including ingestion pipelines, QA workflows, correction management, and audit trail implementation.
  • Work in a collaborative manner with other team members and contribute to shared platform services rather than vertical-specific implementations
  • Have business acumen to understand financial concepts around reference data related to equities and other asset classes
  • Support teams across data and technology in implementing AI solutions and integrating their services with MSCI's data science products and platforms, including AI-assisted ingestion, anomaly detection, and semantic search over the lakehouse using Vertex AI

Requirements

  • 6-8 years of experience in data engineering
  • Proficient in Python programming — data pipeline development, transformation logic, and automation scripts
  • Proficient in data query and analysis using SQL, with strong hands-on experience in BigQuery — partitioning, clustering, materialised views, and time-series query patterns at scale
  • Hands-on experience building and scheduling pipelines using Cloud Composer (Apache Airflow) — DAG authoring, SLA alerting, retry logic, and dependency management
  • Working knowledge of Dataproc (Apache Spark) — batch ingestion, Delta Lake merge operations, and incremental data processing
  • Proficient in AI-assisted development tools such as GitHub Copilot, Cursor, or others for accelerating code generation and enhancing developer productivity
  • Code versioning and collaboration using Git — branching strategies, pull request workflows, and pipeline-as-code practices
  • Familiarity with REST APIs — consuming external data vendor APIs and building service-layer integrations
  • Familiarity with GCP cloud technologies — Cloud Storage, Pub/Sub, Datastream, Cloud Monitoring, IAM, and VPC Service Controls

Nice to have

  • Basic knowledge of data manipulation and analysis libraries — pandas, PySpark, or equivalent
  • Basic knowledge of columnar storage, SQL-based querying, and time-series analytics (ClickHouse or equivalent)
  • Familiarity with Dataplex for data discovery, lineage, policy tagging, and data quality rule management
  • Understanding of Change Data Capture (CDC) patterns using Datastream for replicating transactional data into BigQuery
  • Understanding of bitemporal data modeling concepts (valid-time and system-time) and the challenges of implementing them within BigQuery's append-optimised design
  • Understanding of financial reference data — equities, fixed income identifiers, corporate actions, or index composition data
  • Familiarity with BigQuery cost management — slot reservations, query cost controls, and workload isolation using reservations and assignments
  • Exposure to CI/CD pipelines and infrastructure-as-code using Terraform for data platform deployments on GCP
  • Prior experience or projects involving LLMs and Agentic AI — particularly using Vertex AI for AI-assisted data quality, anomaly detection, semantic search, or natural language querying over structured datasets — is a strong plus

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Databricks Engineer- GCP Cloud

8 matching positions

Cloud Engineer - GCP & Databricks

As a Cloud Engineer, you are passionate about experience innovation and eager to...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
valtech.com Logo
Valtech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of hands-on professional experience in cloud data engineering or GCP platform roles
  • Bachelor's degree in Computer Science, Engineering, Information Systems, or equivalent practical experience
  • BigQuery with advanced SQL, partitioning, clustering, and cost optimization
  • Cloud Storage, Cloud Functions, Cloud Run
  • Dataflow (Apache Beam) for batch and streaming pipelines
  • Cloud Composer / Airflow for orchestration
  • Pub/Sub for event-driven architectures
  • Vertex AI exposure for model serving or pipelines
  • IAM, VPC, organization policies, and security governance
  • Terraform for infrastructure as code
Job Responsibility
Job Responsibility
  • Design GCP-native architectures using BigQuery, Dataflow, Cloud Composer (Airflow), Pub/Sub, Cloud Storage, Vertex AI, and Cloud Run
  • Build and maintain batch and streaming data pipelines using medallion architecture (Bronze, Silver, Gold)
  • Implement infrastructure as code using Terraform
  • Manage deployments through CI/CD pipelines such as Cloud Build
  • Define and enforce GCP landing zone standards including IAM, VPC, Shared VPC, Private Service Connect, and organization policies
  • Build end-to-end Databricks Lakehouse solutions on GCP
  • Design Delta Lake tables with proper governance using Unity Catalog
  • Develop and optimise PySpark and SQL workloads for large-scale transformations
  • Configure Databricks clusters, job scheduling, autoscaling, and cost controls
  • Implement Databricks Workflows and Asset Bundles for orchestration and CI/CD
What we offer
What we offer
  • Flexibility, with remote and hybrid work options (country-dependent)
  • Career advancement, with international mobility and professional development programs
  • Learning and development, with access to cutting-edge tools, training and industry experts
  • Fulltime
Read More
Arrow Right

Databricks Engineer

We’re working with multiple organisations across Auckland and Wellington seeking...
Location
Location
New Zealand , Wellington
Salary
Salary:
Not provided
84recruitment.co.nz Logo
84 Recruitment Limited
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on experience with Databricks
  • Proven background in ETL/ELT pipeline development
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Solid understanding of modern data architecture and best practices
Job Responsibility
Job Responsibility
  • Design, build, and maintain ETL/ELT pipelines on Databricks
  • Implement and optimise Medallion Architecture data layers
  • Ensure data quality, governance, and performance across pipelines
  • Collaborate with stakeholders to deliver scalable data solutions
Read More
Arrow Right

Databricks Engineer

Databricks Engineer (Contract) – Auckland & Wellington & Christchurch. We’re wor...
Location
Location
New Zealand , Christchurch
Salary
Salary:
Not provided
84recruitment.co.nz Logo
84 Recruitment Limited
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on experience with Databricks or similar cloud platform
  • Proven background in ETL/ELT pipeline development
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Solid understanding of modern data architecture and best practices
Job Responsibility
Job Responsibility
  • Design, build, and maintain ETL/ELT pipelines on Databricks
  • Implement and optimise Medallion Architecture data layers
  • Ensure data quality, governance, and performance across pipelines
  • Collaborate with stakeholders to deliver scalable data solutions
Read More
Arrow Right

Databricks Engineer

Databricks Engineer (Contract) – Auckland & Wellington & Christchurch. We’re wor...
Location
Location
New Zealand , Auckland; Wellington; Christchurch
Salary
Salary:
Not provided
84recruitment.co.nz Logo
84 Recruitment Limited
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on experience with Databricks or similiar cloud platform
  • Proven background in ETL/ELT pipeline development
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Solid understanding of modern data architecture and best practices
Job Responsibility
Job Responsibility
  • Design, build, and maintain ETL/ELT pipelines on Databricks
  • Implement and optimise Medallion Architecture data layers
  • Ensure data quality, governance, and performance across pipelines
  • Collaborate with stakeholders to deliver scalable data solutions
Read More
Arrow Right

Senior Cloud Engineer

The Senior Cloud Engineer will design, build, and support cloud infrastructure a...
Location
Location
United States , Austin
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Technology, or similar field
  • 7+ years of experience in enterprise IT environments
  • 5+ years of experience with public cloud platforms (e.g., AWS, Azure, GCP)
  • 3+ years of experience with IaC languages (e.g., Terraform, JSON, Python, JavaScript)
  • Knowledge of cloud configuration (e.g., virtual networks, IAM, SQL databases, NoSQL, Kubernetes, Databricks)
  • Experience utilizing tools for ticketing, project management, and documentation (e.g., Jira, Confluence, Bitbucket)
  • Experience operating within a DevOps framework using CI/CD pipelines
  • Experience working within an Agile/Scrum environment
  • Proven ability to communicate effectively with technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Build, update, and troubleshoot cloud infrastructure across large, complex environments
  • Develop and automate infrastructure using Terraform and other IaC tools
  • Create reusable, modular, and orchestrated code to deploy resources consistently and efficiently
  • Analyze technical requirements and propose meaningful, actionable solutions
  • Research and implement new tools, patterns, and cloud-native technologies
  • Continuously identify opportunities to improve processes, tooling, and technical approaches
What we offer
What we offer
  • Healthcare (medical, dental, and vision plans)
  • 401(k) and retirement plans
  • Commuter benefits
  • Employee and vendor discounts
  • Employee Assistance Program (EAP)
Read More
Arrow Right

Cloud Infrastructure Engineer

A lot of cloud roles are about scale. This one is about scale and trust. I'm wor...
Location
Location
United States
Salary
Salary:
165000.00 - 250000.00 USD / Year
thisisiceberg.com Logo
Iceberg Cyber Security
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong production cloud engineering experience (Azure / AWS / GCP)
  • Hands-on experience with CI/CD, containers, and IaC
  • Solid grounding in cloud security, IAM, and data protection
  • Experience running production systems with reliability in mind
  • Exposure to high-volume data or document-heavy environments
  • Comfortable working in regulated or security-conscious settings
  • 5+ years in production cloud infrastructure roles
  • Experience supporting AI or data-intensive systems in production
  • Strong understanding of cloud, security, and platform engineering fundamentals
  • Ability to operate independently in complex environments
Job Responsibility
Job Responsibility
  • Build and operate cloud infrastructure for AI-driven systems
  • Design CI/CD pipelines and containerized platforms (Docker, Kubernetes)
  • Own infrastructure as code (Terraform / Bicep)
  • Manage Databricks environments in Azure for large-scale processing
  • Implement monitoring, logging, and operational tooling
  • Support multi-cloud environments (Azure, AWS, GCP)
  • Help scale AI systems from prototype to production
  • Manage API usage, cost control, and governance across AI providers
  • Ensure secure handling of highly sensitive client data
  • Fulltime
Read More
Arrow Right

Databricks Platform Engineer

Bright Vision Technologies is looking for a skilled Databricks Platform Engineer...
Location
Location
United States , Bridgewater
Salary
Salary:
Not provided
bvteck.com Logo
Bright Vision Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Databricks Lakehouse Platform
  • Apache Spark
  • Delta Lake
  • Databricks Workflows
  • Databricks SQL
  • Cloud Platforms (AWS / Azure / GCP)
  • Data Engineering Pipelines
  • ETL/ELT
  • Python
  • Scala
What we offer
What we offer
  • H-1B sponsorship for the 2026 quota
  • Equal employment opportunity
  • Inclusive work environment
  • Fulltime
Read More
Arrow Right

Senior Databricks Data Engineer

To develop, implement, and optimize complex Data Warehouse (DWH) and Data Lakeho...
Location
Location
Romania , Bucharest
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven, expert-level experience with the entire Databricks ecosystem (Workspace, Cluster Management, Notebooks, Databricks SQL)
  • in-depth knowledge of Spark architecture (RDD, DataFrames, Spark SQL) and advanced optimization techniques
  • expertise in implementing and managing Delta Lake (ACID properties, Time Travel, Merge, Optimize, Vacuum)
  • advanced/expert-level proficiency in Python (with PySpark) and/or Scala (with Spark)
  • advanced/expert-level skills in SQL and Data Modeling (Dimensional, 3NF, Data Vault)
  • solid experience with a major Cloud platform (AWS, Azure, or GCP), especially with storage services (S3, ADLS Gen2, GCS) and networking
  • bachelor’s degree in Computer Science, Engineering, Mathematics, or a relevant technical field
  • minimum of 5+ years of experience in Data Engineering, with at least 3+ years of experience working with Databricks and Spark at scale.
Job Responsibility
Job Responsibility
  • Design and implement robust, scalable, and high-performance ETL/ELT data pipelines using PySpark/Scala and Databricks SQL on the Databricks platform
  • expertise in implementing and optimizing the Medallion architecture (Bronze, Silver, Gold) using Delta Lake
  • design and implement real-time/near-real-time data processing solutions using Spark Structured Streaming and Delta Live Tables (DLT)
  • implement Unity Catalog for centralized data governance, fine-grained security (row/column-level security), and data lineage
  • develop and manage complex workflows using Databricks Workflows (Jobs) or external tools (Azure Data Factory, Airflow) to automate pipelines
  • integrate Databricks pipelines into CI/CD processes using tools like Git, Databricks Repos, and Bundles
  • work closely with Data Scientists, Analysts, and Architects to deliver optimal technical solutions
  • provide technical guidance and mentorship to junior developers.
What we offer
What we offer
  • Full access to foreign language learning platform
  • personalized access to tech learning platforms
  • tailored workshops and trainings to sustain your growth
  • medical insurance
  • meal tickets
  • monthly budget to allocate on flexible benefit platform
  • access to 7 Card services
  • wellbeing activities and gatherings.
  • Fulltime
Read More
Arrow Right