This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Cloudera, our Data Services Pillar is the heart of data innovation. We don’t just work with technology; we build it. Our mission is to empower data practitioners by creating seamless, enterprise-grade experiences for data engineering, warehousing, streaming, operational databases, and AI. Cloudera is looking for an exceptional and passionate software engineer to join the Data Warehouse engineering team. The technology stack includes popular open source query engines - Apache Hive, Impala, Trino, and table formats like Apache Iceberg. Thus, there is ample scope for collaboration and contribution to open source. This is an exciting opportunity to work on products that handle complex SQL query workloads on public or private clouds as part of the Cloudera Data Platform (CDP).
Job Responsibility:
Work on large-scale, distributed systems to help drive Hive innovation and build additional components around it to enhance the Hive ecosystem
Have an exciting opportunity to work on products that handle complex SQL query workloads on public or private clouds as part of Cloudera Data Platform (CDP)
Design and develop features for parallel and distributed query engines to help drive innovation in CDP
Focus on query optimization, performance and scalability of SQL queries
Write design documentation for key features and capabilities
Improve code quality through writing tests, automation, and code reviews
Understand the customer’s workload and provide effective technical solutions
Requirements:
Bachelor’s or Master’s degree in Computer Science or equivalent, and 6 years of experience
Experience with query optimization using tools like Apache Calcite
Clean coding habits, attention to detail, and a focus on quality
Hands-on programmer with strong data structures and algorithms skills. Java experience is desired
Good understanding of database internals, query processing and SQL query optimization
Strong oral and written communication skills
Ability to work effectively on cross-functional projects
Nice to have:
Experience with contributing to any of the open-source Apache projects like Hive, Impala, Calcite or an RDBMS
Experience with the Hadoop ecosystem and file formats like Parquet, ORC
Experience with public cloud infrastructures such as Microsoft Azure, Amazon Web Services and Google Cloud Platform