This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises. The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance. Cloudera is looking for an exceptional and passionate software engineer with a strong distributed systems background to join the Storage Engineering team focused on building Apache Ozone. The Storage team is responsible for primary storage and storage access layers, which are core to the platform. They created and wrote most of the HDFS code and made a huge impact on the big data and cloud computing industry. Apache Ozone (Apache Ozone) provides a massively scalable distributed object store with a distributed file system interface. Ozone is designed to scale to tens of billions of files and blocks, and overcome the limitations of Hadoop Distributed File System (HDFS), namely, millions of small files and managing a huge number of datanodes. Ozone is one of the fastest-growing products inside CDP in terms of customer adoption and expansion revenue.
Job Responsibility:
Directly involved in the design and implementation of the core feature set of Apache Ozone and Apache Ratis (open-source RAFT implementation)
Regularly contribute code and design docs to the Apache open-source community
Support enterprise customers running 100s of petabytes-scale big data analytics and ML/AI pipelines
Partner with product managers and cross-functional teams as a part of the Cloudera Data Platform ecosystem in understanding requirements and turning them into a solid design and implementation, and facilitating integration and adoption
Responsible for leading and collaborating with a talented group of engineers working on a feature and mentoring junior engineers
Requirements:
Bachelor's +6, Master's 4-6 years of relevant industry experience required
Strong backend engineering skill set with expertise in Java, or strong C++ skills, with intermediate Java expertise
Passionate about programming. Clean coding habits, attention to detail, and focus on quality
Experience with large-scale, distributed systems design and development with a strong understanding of scaling, replication, consistency, and high availability
Solid experience with system software design and development with a strong understanding of computer architecture, storage, network, and IO subsystems, and distributed systems
Hands-on programmer with strong data structures and algorithms skillset
Strong oral and written communication skills
Nice to have:
Strong background in a distributed storage system, including file systems, database storage internals, NoSQL storage, or distributed hash tables
Strong background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations
Strong understanding of the Apache Big Data ecosystem and over 3+ years of experience in systems software, including file systems
Recognized contributions to open source projects
Experience using projects such as Hive, Pig, MapReduce, HBase, etc., is a big plus
Good Understanding of storage development, RAFT replication framework, or equivalent distributed consensus frameworks