CrawlJobs Logo

Site Reliability Engineer

https://www.hpe.com/ Logo

Hewlett Packard Enterprise

Location Icon

Location:
India, Bangalore

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

This role involves enabling SRE support and monitoring for HPE Networking SASE products. Primary responsibilities include ensuring high availability of cloud-based applications, designing and maintaining scalable infrastructure, collaborating with development teams, and driving automation for deployment and monitoring. The candidate will also contribute to incident management and performance optimization.

Job Responsibility:

  • Enable SRE support and monitoring for HPE Networking SASE products to ensure that applications are running as per their requirements
  • Create strategies to detect issues, address those issues, and design systems to troubleshoot automatically using tools like Prometheus, Grafana, or Datadog
  • Ensure high availability and performance of cloud-based applications and services
  • Design, implement, and maintain scalable infrastructure using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation
  • Collaborate with development teams to improve application performance and reliability from design through production
  • Gain insights from the data fetched from monitoring tools to enhance the product's performance
  • Drive automation for deployment, monitoring, scaling, and incident response
  • Manage and optimize Kubernetes clusters and containerized applications
  • Define and implement SLOs/SLIs and continuously improve observability and monitoring practices
  • Lead and participate in incident management and root cause analysis to prevent recurrence

Requirements:

  • Bachelor's or Master’s degree in Computer Science, Information Systems, or equivalent
  • 4-7 years of overall experience in DevOps or SRE
  • 5+ years programming experience in Python is a must
  • 5+ years of experience in developing Cloud native applications using Kubernetes, Helm, or Docker container environments is a must
  • Expertise in automation and CI-CD pipeline tools like Terraform, Ansible, Jenkins, and/or Git is a must
  • Expertise in monitoring tools like Grafana, Datadog, or Prometheus is a must
  • Experience in developing, deploying, and maintaining applications for Public Cloud environments (AWS, Azure, GCP, etc)
  • Knowledge of networking protocols and concepts such as routing, TCP/IP, BGP, OSPF/ISIS, NetFlow, SNMP, and Internet Traffic Engineering techniques
  • Good communication skills, written and verbal, along with ability to communicate complex procedures
  • A desire to constantly grow and learn new skills

Nice to have:

  • Cloud Architectures
  • Cross Domain Knowledge
  • Design Thinking
  • Development Fundamentals
  • DevOps
  • Distributed Computing
  • Microservices Fluency
  • Full Stack Development
  • Release Management
  • Security-First Mindset
  • User Experience (UX)
What we offer:
  • Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Specific programs catered to helping you reach career goals
  • Flexibility to manage work and personal needs
  • Inclusive environment that celebrates individual uniqueness

Additional Information:

Job Posted:
June 01, 2025

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.