This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
This role involves enabling SRE support and monitoring for HPE Networking SASE products. Primary responsibilities include ensuring high availability of cloud-based applications, designing and maintaining scalable infrastructure, collaborating with development teams, and driving automation for deployment and monitoring. The candidate will also contribute to incident management and performance optimization.
Job Responsibility:
Enable SRE support and monitoring for HPE Networking SASE products to ensure that applications are running as per their requirements
Create strategies to detect issues, address those issues, and design systems to troubleshoot automatically using tools like Prometheus, Grafana, or Datadog
Ensure high availability and performance of cloud-based applications and services
Design, implement, and maintain scalable infrastructure using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation
Collaborate with development teams to improve application performance and reliability from design through production
Gain insights from the data fetched from monitoring tools to enhance the product's performance
Drive automation for deployment, monitoring, scaling, and incident response
Manage and optimize Kubernetes clusters and containerized applications
Define and implement SLOs/SLIs and continuously improve observability and monitoring practices
Lead and participate in incident management and root cause analysis to prevent recurrence
Requirements:
Bachelor's or Master’s degree in Computer Science, Information Systems, or equivalent
4-7 years of overall experience in DevOps or SRE
5+ years programming experience in Python is a must
5+ years of experience in developing Cloud native applications using Kubernetes, Helm, or Docker container environments is a must
Expertise in automation and CI-CD pipeline tools like Terraform, Ansible, Jenkins, and/or Git is a must
Expertise in monitoring tools like Grafana, Datadog, or Prometheus is a must
Experience in developing, deploying, and maintaining applications for Public Cloud environments (AWS, Azure, GCP, etc)
Knowledge of networking protocols and concepts such as routing, TCP/IP, BGP, OSPF/ISIS, NetFlow, SNMP, and Internet Traffic Engineering techniques
Good communication skills, written and verbal, along with ability to communicate complex procedures
A desire to constantly grow and learn new skills
Nice to have:
Cloud Architectures
Cross Domain Knowledge
Design Thinking
Development Fundamentals
DevOps
Distributed Computing
Microservices Fluency
Full Stack Development
Release Management
Security-First Mindset
User Experience (UX)
What we offer:
Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
Specific programs catered to helping you reach career goals
Flexibility to manage work and personal needs
Inclusive environment that celebrates individual uniqueness
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.