CrawlJobs Logo

Cloud Engineer II SRE

https://www.hpe.com/ Logo

Hewlett Packard Enterprise

Location Icon

Location:
India

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Cloud Engineer II SRE role at Hewlett Packard Enterprise, part of the 24X7 operations group managing applications, monitoring alerts, maintaining uptime, developing automated systems, leveraging CI/CD & Git Ops, patching security vulnerabilities, and managing public cloud infrastructure.

Job Responsibility:

  • Part of the 24X7 operations group (working in shifts) managing an application or multiple applications
  • Monitor & remediate alerts and maintain uptime
  • Develops and maintains automated systems to improve operational efficiency and ensure compliance with security policies
  • Executes automation and debugs issues as required
  • Leverage CI/CD & Git Ops for managing the application platform
  • Patching security vulnerabilities
  • Manage public cloud infrastructure
  • Shares and reviews innovative technical ideas with peers, high-level technical contributors, and managers
  • Analyses incidents / problems to develop and implement solutions to complex application problems, system administration issues, or network concerns

Requirements:

  • Bachelor's degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Master's desirable
  • 3-5 years' experience
  • Strong Experience in Ubuntu & K8s platforms
  • Experience in programming skills in Scripting / Python / Golang/ Ansible/ Terraform
  • Strong experience in DevOps practices like continuous integration/continuous deployment (CI/CD)
  • Knowledge on Git Ops model
  • Working experience in cloud platforms, especially AWS
  • Ability to quickly learn new skills and technologies
  • Strong system debugging skills
  • Knowledge on security-related activities like patching, CVE
  • Good written and verbal communication skills

Nice to have:

  • Cloud Architectures
  • Cross Domain Knowledge
  • Design Thinking
  • Development Fundamentals
  • DevOps
  • Distributed Computing
  • Microservices Fluency
  • Full Stack Development
  • Release Management
  • Security-First Mindset
  • User Experience (UX)
What we offer:
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion

Additional Information:

Job Posted:
November 04, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Cloud Engineer II SRE

Cloud Engineer II - SRE

Cloud Engineer II - SRE role at Hewlett Packard Enterprise, part of the 24X7 ope...
Location
Location
India
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Master's desirable
  • Typically 3-5 years' experience
  • Strong Experience in Ubuntu & K8s platforms
  • Experience in programming skills in Scripting/Python/Golang/Ansible/Terraform
  • Strong experience in DevOps practices like continuous integration/continuous deployment (CI/CD)
  • Knowledge on Git Ops model
  • Working experience in cloud platforms, especially AWS
  • Ability to quickly learn new skills and technologies
  • Strong system debugging skills
Job Responsibility
Job Responsibility
  • Part of the 24X7 operations group working in shifts managing an application or multiple applications
  • Monitor & remediate alerts and maintain uptime
  • Develops and maintains automated systems to improve operational efficiency and ensure compliance with security policies
  • Executes automation and debugs issues as required
  • Leverage CI/CD & Git Ops for managing the application platform
  • Patching security vulnerabilities
  • Manage public cloud infrastructure
  • Shares and reviews innovative technical ideas with peers, high-level technical contributors, and managers
  • Analyses incidents/problems to develop and implement solutions to complex application problems, system administration issues, or network concerns
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Career development programs
  • Fulltime
Read More
Arrow Right

Cloud Engineer II - SRE

Cloud Engineer II - SRE role at Hewlett Packard Enterprise, part of the 24X7 ope...
Location
Location
India
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Master's desirable
  • Typically 3-5 years' experience
  • Strong Experience in Ubuntu & K8s platforms
  • Experience in programming skills in Scripting / Python / Golang/ Ansible/ Terraform
  • Strong experience in DevOps practices like continuous integration/continuous deployment (CI/CD)
  • Knowledge on Git Ops model
  • Working experience in cloud platforms, especially AWS
  • Ability to quickly learn new skills and technologies
  • Strong system debugging skills
Job Responsibility
Job Responsibility
  • Part of the 24X7 operations group working in shifts managing an application or multiple applications
  • Monitor & remediate alerts and maintain uptime
  • Develops and maintains automated systems to improve operational efficiency and ensure compliance with security policies
  • Executes automation and debugs issues as required
  • Leverage CI/CD & Git Ops for managing the application platform
  • Patching security vulnerabilities
  • Manage public cloud infrastructure
  • Shares and reviews innovative technical ideas with peers
  • Analyses incidents / problems to develop and implement solutions to complex application problems
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

Azure Cloud SRE - Security Specialist II

Quzara seeks a highly skilled Senior Azure Cloud SRE - Platform with a focus on ...
Location
Location
United States
Salary
Salary:
Not provided
quzara.com Logo
Quzara
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor of Science in Computer Science or related field
  • 7+ years of experience in Information Assurance, Cloud Infrastructure, and Security Operations
  • Hands-on expertise with Terraform for IaC, Chef for configuration management, Azure Image Builder for image creation, and Qualys for vulnerability management
  • Advanced certifications preferred: AZ-500 (Azure Security Engineer), MS-500 (Microsoft Security Administrator)
  • Strong scripting skills in PowerShell, Python, or similar languages
  • Proven experience leading technical teams and working in DevSecOps and cloud security environments
  • Strong communication skills
  • adept at collaborating with various stakeholders
  • Leadership or mentoring experience is advantageous
Job Responsibility
Job Responsibility
  • Infrastructure as Code (IaC) with Terraform: Design, implement, and maintain secure and scalable cloud infrastructure using Terraform to automate deployments and manage cloud resources effectively
  • Configuration Management with Chef: Manage and automate system configurations, ensuring consistent security baselines and compliance across environments using Chef
  • Image Management: Create, maintain, and deploy golden images using Azure Image Builder to standardize secure, up-to-date machine images across cloud environments
  • Security & Vulnerability Management: Leverage Qualys to continuously scan for vulnerabilities, track remediation efforts, and ensure compliance with security standards
  • Technical Leadership: Serve as the technical leader of the SRE team, mentoring team members, driving best practices in cloud security, and providing strategic direction on infrastructure initiatives
  • Team Collaboration & Guidance: Lead technical discussions, provide expertise on complex security challenges, and guide the team in implementing secure, scalable, and high-performing cloud solutions
  • Automation & Scripting: Develop automation scripts and workflows to improve security processes, configuration management, and cloud infrastructure deployments
  • Collaboration & Security Best Practices: Collaborate with cross-functional teams to integrate security into infrastructure, CI/CD pipelines, and daily operations, ensuring adherence to security policies and frameworks
What we offer
What we offer
  • Inclusive work environment committed to innovation and teamwork
  • Fulltime
Read More
Arrow Right

Senior Security Operations Engineer II

As a Senior Security Operations Engineer, you’ll play a key role in ensuring the...
Location
Location
United States , Scottsdale
Salary
Salary:
Not provided
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in operations, site reliability, or infrastructure engineering roles
  • Strong experience securing and managing cloud environments (e.g., AWS, Azure) and containerized workloads
  • Deep understanding of Linux systems, networking, distributed systems, and their associated security controls
  • Proficiency in automation, scripting, and security tooling integration to streamline operations and enforcement
  • Experience with security monitoring, alerting, SIEM platforms, and observability tools
  • Solid grasp of CI/CD practices with integrated security testing and compliance checks
  • Experience managing Kubernetes clusters and running containerized workloads in production
  • Experience with deploying and administrating any of the following: scalable cloud native secrets solutions such as AWS KMS, Azure KeyVault
  • PKI solutions such as EJBCA, Smallstep, Venafi
  • or vaulting solutions such as Hashicorp Vault
Job Responsibility
Job Responsibility
  • Implementing and improving automated security checks in CI/CD pipelines to prevent vulnerabilities from reaching production
  • Writing, reviewing, and maintaining security-focused infrastructure-as-code for scalable and compliant deployments
  • Investigating security incidents, performing root cause analysis, and implementing long-term mitigation strategies
  • Collaborating with developers to develop new features, services, and infrastructure requirements
  • Enhancing security observability through improved log collection, metrics, and alerting configurations
  • Maintaining and improving security runbooks, incident response playbooks, and internal security tooling for operational efficiency
  • Resolve security/infrastructure incidents by participating in high impact/high visibility incidents as a participant and ideally as an incident commander
  • Maintain and secure critical infrastructure components such as PKI (Public Key Infrastructure) and IAM ( Identity & Access Management) systems, ensuring reliability, scalability, and compliance with organizational and industry security standards
  • Build and maintain secure, reliable, and scalable infrastructure that protects core services and sensitive data
  • Troubleshoot and resolve complex operational and system-level issues across environments
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Security Operations Engineer II

As a Senior Security Operations Engineer, you’ll play a key role in ensuring the...
Location
Location
United States , Scottsdale
Salary
Salary:
Not provided
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in operations, site reliability, or infrastructure engineering roles
  • Strong experience securing and managing cloud environments (e.g., AWS, Azure) and containerized workloads
  • Deep understanding of Linux systems, networking, distributed systems, and their associated security controls
  • Proficiency in automation, scripting, and security tooling integration to streamline operations and enforcement
  • Experience with security monitoring, alerting, SIEM platforms, and observability tools
  • Solid grasp of CI/CD practices with integrated security testing and compliance checks
  • Experience managing Kubernetes clusters and running containerized workloads in production
  • Experience with deploying and administrating any of the following: scalable cloud native secrets solutions such as AWS KMS, Azure KeyVault
  • PKI solutions such as EJBCA, Smallstep, Venafi
  • or vaulting solutions such as Hashicorp Vault
Job Responsibility
Job Responsibility
  • Implementing and improving automated security checks in CI/CD pipelines to prevent vulnerabilities from reaching production
  • Writing, reviewing, and maintaining security-focused infrastructure-as-code for scalable and compliant deployments
  • Investigating security incidents, performing root cause analysis, and implementing long-term mitigation strategies
  • Collaborating with developers to develop new features, services, and infrastructure requirements
  • Enhancing security observability through improved log collection, metrics, and alerting configurations
  • Maintaining and improving security runbooks, incident response playbooks, and internal security tooling for operational efficiency
  • Resolve security/infrastructure incidents by participating in high impact/high visibility incidents as a participant and ideally as an incident commander
  • Maintain and secure critical infrastructure components such as PKI (Public Key Infrastructure) and IAM ( Identity & Access Management) systems, ensuring reliability, scalability, and compliance with organizational and industry security standards
  • Build and maintain secure, reliable, and scalable infrastructure that protects core services and sensitive data
  • Troubleshoot and resolve complex operational and system-level issues across environments
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Software Engineer II - Platform Infrastructure

We are seeking a Senior Software Engineer II to architect, build, and operate se...
Location
Location
Canada
Salary
Salary:
179200.00 - 210600.00 CAD / Year
confluent.io Logo
Confluent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience in software engineering, SRE, or security engineering roles, with significant experience operating security platform services
  • Strong backend software development experience (Go, Java, Rust, Python)
  • Expertise with distributed systems, cloud infrastructure (AWS, GCP, Azure), Kubernetes, service mesh, and container orchestration
  • Strong understanding of security domains: IAM, OAuth2, OIDC, PKI, secrets management, policy engines, audit pipelines, zero trust architecture
  • Experience building highly reliable, observable, and resilient production systems
  • Operational expertise: SLOs, SLIs, error budgets, on-call leadership, incident management
  • Strong collaboration skills to drive alignment across engineering, security, and compliance stakeholders
  • Excellent communication skills with ability to influence technical and business leaders
  • BS, MS, or PhD in computer science or a related field, or equivalent work experience
Job Responsibility
Job Responsibility
  • Architect, design, and develop platform services with a strong focus on scalability, security, and developer experience
  • Lead operational design for reliability: build comprehensive observability, monitoring, and incident response automation into security-critical services
  • Build automation and tooling to drive self-healing systems, proactive risk detection, failure recovery, and continuous resilience testing
  • Collaborate with compliance, governance, and risk teams to translate regulatory and policy requirements into scalable technical controls
  • Lead technical design reviews, security architecture reviews, and incident postmortems for platform-level incidents
  • Mentor engineers across multiple disciplines on both security and operational best practices
  • Own end-to-end delivery of services: from initial design and development through deployment, production hardening, and lifecycle maintenance
What we offer
What we offer
  • Remote-First Work
  • Robust Insurance Benefits
  • Flexible Time Away
  • The Best Teammates
  • Experience Ambassadors
  • Open and Honest Culture
  • Well-Being and Growth
  • Offers Equity
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer II

We are seeking a skilled Site Reliability Engineer (SRE) to join our team and he...
Location
Location
United States , Alpharetta
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in an SRE, DevOps, or Cloud Infrastructure role
  • Strong hands-on experience with Microsoft Azure
  • Infrastructure-as-Code experience using Terraform and Terragrunt
  • Experience designing and managing cloud-native environments
  • Proficiency with Kubernetes (preferably AKS)
  • Experience supporting containerized workloads and orchestration patterns
  • Exposure to Databricks environments is required
  • Experience with GitHub Actions / GitHub Workflows
  • Hands-on experience with ArgoCD and GitOps-based deployment strategies
  • Solid understanding of Grafana
Job Responsibility
Job Responsibility
  • Design, implement, and manage Azure cloud infrastructure using Terraform and Terragrunt
  • Maintain, operate, and optimize Kubernetes clusters on Azure Kubernetes Service (AKS)
  • Build and manage CI/CD pipelines using GitHub Actions / GitHub Workflows
  • Implement GitOps-based deployments using ArgoCD
  • Enhance system reliability by implementing monitoring, alerting, and observability solutions using Grafana
  • Automate operational tasks to reduce toil and improve team efficiency
  • Participate in on-call rotations, incident response, root cause analysis, and post-mortems
  • Partner with development teams to improve application performance, scalability, and resilience
  • Implement and promote SRE best practices, including: Service Level Indicators (SLIs)
  • Service Level Objectives (SLOs)
What we offer
What we offer
  • medical, vision, dental, and life and disability insurance
  • eligible to enroll in our company 401(k) plan
Read More
Arrow Right

Site Reliability Engineer II

Site Reliability Engineer II - (Microsoft 365 Enterprise + Cloud). We are lookin...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Mid-level years of software development: automation-related experience is most valued
  • Scripting languages such as bash, python, and PowerShell, or compiled languages such as C, C# are most relevant, but others are acceptable
  • Awareness of, and ability to reason about, modern software & systems architectures, including load-balancing, queueing, caching, distributed systems failure modes, microservices, and so on
  • Associated troubleshooting skills, including the ability to follow RPC (Remote Procedure Call) call-chains across arbitrary network steps
  • Consequent understanding of monitoring in distributed systems
  • Deep understanding of operating system level concepts such as processes, memory allocation, and the network stack
  • understanding of how applications are affected by the above, and ability to debug same
  • Experience with working in a team, including coordinating large projects, communicating well, and exercising initiative when presented with problems
  • Practical experience running large scale online systems is always an advantage
Job Responsibility
Job Responsibility
  • Researches and maintains deep knowledge of industry trends as well as advances in large-scale distributed systems and cloud technologies
  • identifies opportunities to create, implement, and/or optimally utilize new tools, technologies, and/or processes to solve ambiguous problems and improve product availability, reliability, efficiency, observability, and/or performance
  • Drives the adoption of innovative solutions across engineering teams working with related products within an organization
  • Apply advanced statistical and machine learning techniques to analyze large datasets and extract meaningful insights
  • Experience working with all service aspects of high throughput and multi-tenant services, ability to understand and design workflows carefully, properly handle errors, write clean and well-factored code with good tests and good maintainability
  • Engages with product engineering teams by partaking in code/design reviews, participating in on-call rotations and incident responses throughout product development and operations cycles
  • leverages end-to-end technical expertise on underlying systems/platforms and insights from engagements with product engineering teams and telemetry analyses to propose scalable improvements in code and designs with attention to customer/business objectives and incident prevention
  • Develops code, scripts, systems, or platforms that automate moderately complex but repetitive operations processes (e.g., monitoring, alerting, deploying products and updates, debugging) at scale
  • reviews existing automation code and scripts to evaluate reusability, extendibility, and scalability within an organization
  • Analyzes data from telemetry pipelines and monitoring tools that detail operations metrics (e.g., availability, reliability, performance, efficiency) of systems, platforms, or products operating at scale
  • Fulltime
Read More
Arrow Right