CrawlJobs Logo

Kubernetes Escalations Engineer

https://www.hpe.com/ Logo

Hewlett Packard Enterprise

Location Icon

Location:
India , Bangalore

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

In the HPE Hybrid Cloud, we lead the innovation agenda and technology roadmap for all of HPE. This includes managing the design, development, and supporting our next-generation turnkey Private Cloud for Artificial Intelligence (PCAI) platform. Working with customers, we help them reimagine their information technology needs to deliver a simple, consumable solution that helps them drive their business results.

Job Responsibility:

  • Guide and support both internal and external customers with PCAI platform solutions and workloads
  • Apply in-depth professional knowledge and innovative ideas to solve complex problems
  • Provide technical leadership for significant project/program work
  • Lead or participate in cross-functional initiatives and contribute to mentorship and knowledge sharing across the organization
  • Front line escalation support for PCAI core software stack called Artificial Intelligence Enterprise (AIE)
  • Collaborate with core engineering, COE and support counterpart and development partners
  • Support co-development and co-creation of joint solutions with partners like Deloitte, Accenture and other GSIs.

Requirements:

  • Bachelor's or master’s degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Typically, 10-15 years’ experience
  • Strong Kubernetes skills. Any or all of CKA/CKAD/CKS certifications
  • Strong knowledge of containerized platform like RHEL, Rockey OS etc.
  • Strong knowledge of modern analytics stack and tools like Apache Spark, Jupyter Notebooks, Tensorflow, R Studio.
  • Strong knowledge of data scientists and engineer’s workload workflow and related tools.
  • Desired programming skills in Python, Java, Golang
  • Strong Linux sysadmin skills
  • TCP/IP skills (must be able to troubleshoot using tcpdump and wireshark)
  • Proficient in cloud-based security concepts like using identity and access management, firewalls, VPN, and in-plane switching systems.
  • Excellent written and verbal communication skills.
What we offer:
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment

Additional Information:

Job Posted:
January 10, 2026

Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Kubernetes Escalations Engineer

Dev Escalation Engineer

Dev Escalation Engineer role at Aruba (HPE Company), a leading provider of next-...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Typically 4-7 years' experience
  • Strong programming skills in Python, Java, Golang, or JavaScript
  • Good understanding of distributed systems, event-driven programming paradigms, and designing for scale and performance
  • Experience with cloud-native applications, developer tools, managed services, and next-generation databases
  • Knowledge of DevOps practices like CI/CD, infrastructure as code, containerization, and orchestration using Kubernetes
  • Good written and verbal communication skills and agile in a changing environment
Job Responsibility
Job Responsibility
  • Analyse feature specifications and determine required coding, testing, and integration activities
  • Design and develop moderate to complex cloud application modules per feature specifications adhering to security policies
  • Identify debug and create solutions for issues with code and integration into application architecture
  • Develop and execute comprehensive test plans for features adhering to performance, scale, usability, and security requirements
  • Deploy cloud-based systems and applications code using continuous integration/deployment (CI/CD) pipelines to automate cloud applications' management, scaling, and deployment
  • Contribute towards innovation and integration of new technologies into projects
  • Analyze science, engineering, business, and other data processing problems to develop and implement solutions to complex application problems, system administration issues, or network concerns
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

Solutions Engineering Lead

We are hiring a Solutions Engineering Team Lead for the East region to scale and...
Location
Location
United States , Boston
Salary
Salary:
220000.00 - 300000.00 USD / Year
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in customer-facing technical roles (Sales Engineering, Solutions Architecture or similar)
  • 3+ years leading or managing pre-sales technical teams with a record of coaching success
  • Experience supporting or owning team-level quotas within a sales organization
  • Hands-on expertise with the following: Kibana, Grafana, Datadog, New Relic, Splunk, Honeycomb, Jaeger, OpenSearch
  • Proficiency crafting PromQL, Lucene and SQL queries for troubleshooting and dashboards
  • Deep knowledge of cloud services central to observability: AWS: EKS, Fargate, Lambda, CloudFormation, CloudWatch Logs and Metrics
  • Azure Monitor and equivalents in Google Operations Suite
  • Working knowledge of OpenTelemetry, modern DevOps and container platforms (Kubernetes, Docker)
  • Strong ability to communicate with engineers and C-level audiences alike
Job Responsibility
Job Responsibility
  • Own regional SE performance in partnership with Account Executives, ensuring quota attainment and deal velocity
  • Hire, onboard and mentor Solutions Engineers, setting clear KPIs and career paths
  • Maintain a strong personal presence with customers, modeling technical excellence and closing strategic opportunities
  • Improve processes for discovery, POC execution, documentation and knowledge sharing
  • Collaborate with Product, Support and Customer Success to shorten feedback loops and accelerate adoption
  • Architect and deploy reference designs for logs, metrics, traces, SIEM and Kubernetes monitoring across AWS, Azure and GCP
  • Lead white-board deep-dive sessions on ingestion pipelines, index-free querying and cost-optimized retention strategies
  • Provide escalation support during POCs: troubleshoot complex issues, analyze logs, traces, craft PromQL, Lucene or Dataprime queries and isolate root causes
  • Track technical success metrics such as POC win rate, onboarding time-to-value and validation scorecards, converting data insights into process improvements
  • Contribute code or scripts (Python, Go or Java) for custom exporters, automation and synthetic monitoring
What we offer
What we offer
  • Comprehensive and inclusive employee benefits for healthcare, dental, and mental health benefits
  • 401(k) plan and match
  • Paid sick time and paid time off
  • Fulltime
Read More
Arrow Right

Senior Production Engineer - Application Support Lead - Futures Engineering

Senior Application Support Lead to oversee the support operations for our enterp...
Location
Location
United States , Chicago
Salary
Salary:
155000.00 - 185000.00 USD / Year
clearstreet.io Logo
Clear Street
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5–8 years of experience in application support
  • At least 2 years in a leadership or senior technical role, ideally in financial services or fintech
  • Knowledge of Java and ReactJS, with experience debugging and analyzing application logs
  • Hands-on experience with Kubernetes and Docker for deployment troubleshooting
  • Familiarity with monitoring tools (e.g., Datadog) and services such as Pager Duty
  • Experience with ticketing systems (e.g., Jira)
  • Deep understanding of cleared derivatives, futures, or back-office operations in financial markets
  • Proven ability to lead and motivate a support team
  • Strong decision-making and problem-solving skills in high-pressure environments
  • Excellent communication and interpersonal skills
Job Responsibility
Job Responsibility
  • Provide advanced troubleshooting for complex application issues, including Java/ReactJS code-level analysis, database queries, and Kubernetes/Docker environment diagnostics
  • Manage a team of application support analysts, providing mentorship, training, and performance evaluations
  • Oversee the triage, prioritization, and resolution of support tickets, ensuring SLAs are met
  • Lead complex configuration tasks, such as system integrations, and custom module deployments
  • Act as the primary point of escalation for major incidents, coordinating with infrastructure, development, and client teams
  • Develop and implement support processes, including automated monitoring, knowledge base enhancements, and proactive issue detection
  • Liaise with clients, product managers, and senior leadership to provide updates on support metrics, system performance, and improvement initiatives
  • Utilize advanced monitoring tools to proactively identify performance bottlenecks and coordinate with DevOps to optimize Kubernetes/Docker deployments
  • Create and maintain comprehensive technical documentation and deliver training to support staff and end-users
  • Contribute to the roadmap for support operations, aligning with business goals and client needs
What we offer
What we offer
  • Competitive compensation packages
  • Company equity
  • 401k matching
  • Gender neutral parental leave
  • Full medical, dental and vision insurance
  • Lunch stipends
  • Fully stocked kitchens
  • Happy hours
  • Fulltime
Read More
Arrow Right

Solutions Engineering Lead

Coralogix is a modern full-stack observability platform that transforms how busi...
Location
Location
United States , Dallas
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in customer-facing technical roles (Sales Engineering, Solutions Architecture or similar)
  • 3+ years leading or managing pre-sales technical teams with a record of coaching success
  • Experience supporting or owning team-level quotas within a sales organization
  • Hands-on expertise with the following: Kibana, Grafana, Datadog, New Relic, Splunk, Honeycomb, Jaeger, OpenSearch
  • Proficiency crafting PromQL, Lucene and SQL queries for troubleshooting and dashboards
  • Deep knowledge of cloud services central to observability: AWS: EKS, Fargate, Lambda, CloudFormation, CloudWatch Logs and Metrics
  • Azure Monitor and equivalents in Google Operations Suite
  • Working knowledge of OpenTelemetry, modern DevOps and container platforms (Kubernetes, Docker)
  • Strong ability to communicate with engineers and C-level audiences alike
  • Familiarity with structured sales methodologies such as MEDDPIC or Command of the Message (plus)
Job Responsibility
Job Responsibility
  • Own regional SE performance in partnership with Account Executives, ensuring quota attainment and deal velocity
  • Hire, onboard and mentor Solutions Engineers, setting clear KPIs and career paths
  • Maintain a strong personal presence with customers, modeling technical excellence and closing strategic opportunities
  • Improve processes for discovery, POC execution, documentation and knowledge sharing
  • Collaborate with Product, Support and Customer Success to shorten feedback loops and accelerate adoption
  • Architect and deploy reference designs for logs, metrics, traces, SIEM and Kubernetes monitoring across AWS, Azure and GCP
  • Lead white-board deep-dive sessions on ingestion pipelines, index-free querying and cost-optimized retention strategies
  • Provide escalation support during POCs: troubleshoot complex issues, analyze logs, traces, craft PromQL, Lucene or Dataprime queries and isolate root causes
  • Track technical success metrics such as POC win rate, onboarding time-to-value and validation scorecards, converting data insights into process improvements
  • Contribute code or scripts (Python, Go or Java) for custom exporters, automation and synthetic monitoring
What we offer
What we offer
  • Comprehensive and inclusive employee benefits for healthcare, dental, and mental health benefits
  • A 401(k) plan and match
  • Paid sick time
  • Paid time off
  • Fulltime
Read More
Arrow Right

HPC SW Cloud Engineer

HPC SW Cloud Engineer role focused on designing, implementing, and maintaining H...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically 8+ years experience
  • Linux OS
  • Go Lang programming
  • Python programming
  • Docker container engine
  • Podman container engine
  • Kubernetes container orchestration
  • Github version control
  • Gitlab version control
Job Responsibility
Job Responsibility
  • Design, implement, Kubernetes hosted microservices services to support scalable and resilient cloud-based applications
  • Implement infrastructure as code methodologies to automate the provisioning and management of cloud resources
  • Utilize tools such as Terraform or Ansible for declarative infrastructure definition
  • Collaborate with cross-functional teams to define and implement best practices for cloud-based services
  • Ability to triage- requiring a strong blend of technical depth, investigative skills, and cross-team coordination to quickly assess, prioritize, and resolve complex internal and customer reported issues
  • Expertise in container orchestration using Kubernetes, including deploying, scaling, and managing containerized applications
  • Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance processes
  • Implement CI/CD pipelines to facilitate continuous integration and delivery
  • Implement and enforce security best practices within Kubernetes hosted software environments
  • Ensure compliance with industry standards and regulations related to cloud infrastructure
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

HPC SW Cloud Engineer

HPC SW Cloud Engineer role focused on designing, implementing, and maintaining H...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically 8+ years experience
  • Linux OS
  • Go Lang programming
  • Python programming
  • Docker container engine
  • Podman container engine
  • Kubernetes container orchestration
  • GitHub version control
  • GitLab version control
Job Responsibility
Job Responsibility
  • Design, implement, Kubernetes hosted microservices services to support scalable and resilient cloud-based applications
  • Implement infrastructure as code methodologies to automate the provisioning and management of cloud resources
  • Utilize tools such as Terraform or Ansible for declarative infrastructure definition
  • Collaborate with cross-functional teams to define and implement best practices for cloud-based services
  • Ability to triage- requiring a strong blend of technical depth, investigative skills, and cross-team coordination to quickly assess, prioritize, and resolve complex internal and customer reported issues
  • Expertise in container orchestration using Kubernetes, including deploying, scaling, and managing containerized applications
  • Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance processes
  • Implement CI/CD pipelines to facilitate continuous integration and delivery
  • Implement and enforce security best practices within Kubernetes hosted software environments
  • Ensure compliance with industry standards and regulations related to cloud infrastructure
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

DevOps Engineer

Join our international and innovative Infrastructure-DevOps Team that tackles re...
Location
Location
North Macedonia , Skopje
Salary
Salary:
Not provided
hornetsecurity.com Logo
Hornetsecurity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience as DevOps engineer, SRE, System Engineer or something related
  • Ideally a BS degree in Computer science, Engineering or related subject
  • Eagerness to learn new technologies and improve existing processes
  • Basic knowledge of Linux, Kubernetes, Helm charts and ArgoCD
  • Experience with Bash scripting and Ansible
Job Responsibility
Job Responsibility
  • End-to-end deployment of products, including setting up infrastructure from scratch
  • Help improve the performance and scalability of our infrastructure
  • Actively contribute to the next generation of fully automated infrastructure and the industrialization of our organization
  • Handle escalated incidents and IT operational issues, providing technical support to internal teams via ticketing systems, on-call rotations, and incident response
What we offer
What we offer
  • Room for innovation and autonomy
  • Temporary Employee Exchange Program to work at global office locations
  • Flexible working hours and the option to work from home
  • Permanent contracts
  • Team events like Laser Tag, Office Movie Nights, Foodie Fridays
  • FitKit subscription and private insurance
  • Referral Bonus of 1500€ for each successful referral
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

Affirm is reinventing credit to make it more honest and friendly, giving consume...
Location
Location
Spain
Salary
Salary:
85000.00 - 115000.00 EUR / Year
affirm.com Logo
Affirm
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience designing, developing and launching backend systems at scale using scripting and development languages like Bash, Python or Kotlin
  • A track record of developing highly available distributed systems using technologies like AWS, MySQL and Kubernetes
  • Meaningful experience contributing in or driving parts of the Incident Lifecycle process, enabling actionable insights that improve the quality culture, reliability, resilience, and system performance
  • 4+ years working in a Site Reliability or Production Engineering team
  • Experience defining a technical plan for the delivery of a significant feature or system component with an elegant, simple and extensible design
  • Experience in making impactful changes in a large code base, and have developed a suite of tools and practices that enable you and your team to do so safely
  • Strong verbal and written communication skills that support effective collaboration with our global engineering team
  • On-Call Rotation - There would be an on-call rotation for this role as a requirement
Job Responsibility
Job Responsibility
  • You will be responsible for owning and delivering quarterly goals for your team, leading engineers on your team through ambiguity to solve open-ended problems, and ensuring that everyone is supported throughout delivery
  • You will support your peers and stakeholders in the product development lifecycle by collaborating with infrastructure, product management, developer experience & analytics by participating in ideation, articulating technical constraints, and partnering on decisions that properly consider risks and trade-offs
  • You will proactively identify technical solutions and operational processes that strengthen incident readiness, response, and post-incident analysis
  • You will support the operations and availability of your team’s artifacts by creating and monitoring metrics, escalating when needed, and supporting “keep the lights on” & on-call efforts
  • You will foster a culture of quality and ownership on your team by setting or improving code review and design standards for your team, and advocating for them beyond your team through your writing and tech talks
  • You will help develop talent on your team by providing feedback and guidance, and leading by example
What we offer
What we offer
  • Flexible Spending Wallets for tech, food and lifestyle
  • Away Days - wellness days to take off work and recharge
  • Learning & Development programs
  • Parental benefit
  • Employee Resource & Community Groups
  • Health care coverage - Affirm covers all premiums for all levels of coverage for you and your dependents
  • Flexible Spending Wallets - generous stipends for spending on Technology, Food, various Lifestyle needs, and family forming expenses
  • Time off - competitive vacation and holiday schedules allowing you to take time off to rest and recharge
  • ESPP - An employee stock purchase plan enabling you to buy shares of Affirm at a discount
  • Fulltime
Read More
Arrow Right