This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a highly skilled Senior Databricks Engineer to contribute to the engineering, modernization, and continuous evolution of data processing platform on Databricks on AWS. While supporting the transition from the legacy Cloudera Hadoop platform to Databricks on AWS, this role will continue to play a key part in enhancing performance, simplifying pipelines, and delivering new capabilities on the Databricks platform over the long term. The ideal candidate is a strong hands‑on Spark engineer with solid design experience, capable of contributing to architectural decisions while leading complex implementation and optimization efforts.
Job Responsibility
Platform Engineering & Modernization: Refactor and modernize existing Spark pipelines to Databricks native architectures
Eliminate legacy Hadoop dependencies and adopt cloud native AWS patterns
Enhance and extend existing processing logic using optimized Spark (JavaSpark / PySpark) on Databricks
Databricks Native Development: Build and optimize solutions using Databricks features, including Delta Lake, Databricks Workflows for orchestration and Auto scaling and job clusters
Design & Solution Engineering: Contribute to low and mid level architecture and design
Translate high level architecture into detailed technical designs
Define data models, pipeline patterns, and reusable components
Ensure solutions are scalable, maintainable, and production ready
Performance Optimization & Simplification: Analyze, improve Spark job performance and simplify complex or over engineered pipelines into standardized, efficient patterns
Engineering Standards & Best Practices: Follow and contribute to Databricks and Spark engineering standards
Write clean, modular, and testable code
Contribute to shared frameworks, reusable libraries, and quality standards
Collaboration & Stakeholder Engagement: Work closely with senior architects, platform teams, and DevOps engineers
Provide technical inputs, troubleshooting support, and implementation guidance
Participate in design discussions and technical decision making
Testing & Quality Assurance: Develop unit, integration, and data validation tests
Support production releases and post deployment validation
Requirements
10+ years in data engineering or distributed systems
Strong expertise in Apache Spark (JavaSpark / PySpark), Databricks on AWS, and Delta Lake
Experience with AWS services and large‑scale distributed data processing
Experience modernizing or refactoring legacy data platforms into cloud‑based architectures
Strong background in Spark performance tuning and large‑scale batch optimization
Ability to translate architecture into implementable designs
Understanding of data modeling and pipeline orchestration patterns
Strong problem‑solving mindset for complex distributed systems
Comfortable working in time‑bound, high‑impact environments
Proactive, accountable, and collaborative
Clear communication skills across global teams
Bachelor’s degree/University degree or equivalent experience