Cloud Reliability Engineer Job at Hewlett Packard Enterprise (Bangalore)

Job Description

This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office. Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Responsibility

Work on tools and technologies that involve monitoring, automating, and improving systems through software engineering principles applied to IT operations
Harness the power of data and metrics to make evidence-based improvements that enhance the way we operate COM
Build and maintain comprehensive metrics collection systems
Collaborate and partner with feature development partner teams on best practices to ensure we have global team visibility of our application's health, SLIs and SLOs
Use data to gain insight into the COM stack for the purpose of improving performance, reliability, and cost effectiveness
Build out robust documentation and runbook standards that our teams use to improve our incident response effectiveness
Implement and maintain security controls and practices to protect systems from unauthorized access and attacks

Requirements

Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
Typically 2-8 years’ experience
Development experience with Python, Go or Java (or C#, C++, C) or similar programming languages
Good understanding of REST APIs and the fundamentals of successful design and testing of a REST API
Have an enthusiastic, go-for-it attitude
Have an urge to collaborate and communicate asynchronously
Good understanding of distributed systems, event driven programming paradigms and designing for scale and performance
Ability to troubleshoot complex issues with curiosity, flexibility, creativity and a sense of ownership and accountability
Strong communication skills and ability to work in a distributed team
Highly desirable one or more of: Grafana, Prometheus, AWS, Kubernetes, Terraform

Nice to have

Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Release Management, Security-First Mindset, User Experience (UX)

What we offer

Health & Wellbeing
Personal & Professional Development
Unconditional Inclusion

Hewlett Packard Enterprise - All Job Offers

Select Country

Cloud Reliability Engineer

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Cloud Reliability Engineer

Cloud Engineer / Site Reliability Engineer (SRE)

Senior Site Reliability Engineer Cloud Platform

Principal Site Reliability Engineer (Sovereign Cloud)

Sr Principal Site Reliability Engineer (Sovereign Cloud)

Sr Principal Site Reliability Engineer (Sovereign Cloud)

Principal Site Reliability Engineer (Sovereign Cloud)

Principal Site Reliability Engineer (Sovereign Cloud)

Senior Site Reliability Engineer (SRE) – Cloud & Distributed Systems

Our AI answers in your language