CrawlJobs Logo

HPC SW Cloud Engineer

India, Bangalore · Job Posted September 13, 2025
Apply Position
Job Link Share

Job Description

HPC SW Cloud Engineer role focused on designing, implementing, and maintaining HPC CSM manageability platform hosted on Kubernetes infrastructure. The position requires expertise in cloud native technologies, Kubernetes, automation, and DevOps practices with good understanding of security on Cloud Native applications.

Job Responsibility

  • Design, implement, Kubernetes hosted microservices services to support scalable and resilient cloud-based applications
  • Implement infrastructure as code methodologies to automate the provisioning and management of cloud resources
  • Utilize tools such as Terraform or Ansible for declarative infrastructure definition
  • Collaborate with cross-functional teams to define and implement best practices for cloud-based services
  • Ability to triage- requiring a strong blend of technical depth, investigative skills, and cross-team coordination to quickly assess, prioritize, and resolve complex internal and customer reported issues
  • Expertise in container orchestration using Kubernetes, including deploying, scaling, and managing containerized applications
  • Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance processes
  • Implement CI/CD pipelines to facilitate continuous integration and delivery
  • Implement and enforce security best practices within Kubernetes hosted software environments
  • Ensure compliance with industry standards and regulations related to cloud infrastructure
  • Provide escalated support for complex technical issues
  • Conduct root cause analysis for incidents and implement preventive measures
  • Mentor junior team members and actively participate in knowledge-sharing activities

Requirements

  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically 8+ years experience
  • Linux OS
  • Go Lang programming
  • Python programming
  • Docker container engine
  • Podman container engine
  • Kubernetes container orchestration
  • GitHub version control
  • GitLab version control
  • Ansible declarative
  • YAML declarative
  • HCL declarative
  • Helm package manager
  • RPM package manager
  • Jenkins CI/CD
  • GitHub Actions CI/CD
  • Cloud Architectures knowledge
  • Cross Domain Knowledge
  • Design Thinking
  • Development Fundamentals
  • DevOps
  • Distributed Computing
  • Microservices Fluency
  • Full Stack Development
  • Security-First Mindset
  • Solutions Design
  • Testing & Automation
  • User Experience (UX)

What we offer

  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

HPC SW Cloud Engineer

8 matching positions

HPC SW Cloud Engineer

HPC SW Cloud Engineer role focused on designing, implementing, and maintaining H...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically 8+ years experience
  • Linux OS
  • Go Lang programming
  • Python programming
  • Docker container engine
  • Podman container engine
  • Kubernetes container orchestration
  • Github version control
  • Gitlab version control
Job Responsibility
Job Responsibility
  • Design, implement, Kubernetes hosted microservices services to support scalable and resilient cloud-based applications
  • Implement infrastructure as code methodologies to automate the provisioning and management of cloud resources
  • Utilize tools such as Terraform or Ansible for declarative infrastructure definition
  • Collaborate with cross-functional teams to define and implement best practices for cloud-based services
  • Ability to triage- requiring a strong blend of technical depth, investigative skills, and cross-team coordination to quickly assess, prioritize, and resolve complex internal and customer reported issues
  • Expertise in container orchestration using Kubernetes, including deploying, scaling, and managing containerized applications
  • Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance processes
  • Implement CI/CD pipelines to facilitate continuous integration and delivery
  • Implement and enforce security best practices within Kubernetes hosted software environments
  • Ensure compliance with industry standards and regulations related to cloud infrastructure
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

HPC SW Cloud QA Engineer

HPC SW Cloud QA Engineer role focused on designing, implementing, and maintainin...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong understanding of Linux (RHEL, SLES, Ubuntu) system administration
  • Experience with Kubernetes, containers (Docker/Podman), and networking fundamentals
  • Proficiency in scripting languages (Python, Bash) for automation
  • Familiarity with HPC architectures, job schedulers (Slurm, PBS Pro), and workload management concepts
  • Experience with test automation frameworks (e.g., pytest, Robot Framework, Jenkins CI/CD)
  • Hands-on experience in system-level testing, API testing, and performance validation
  • Familiarity with Git, Jira, Confluence, and defect tracking workflows
  • Experience with monitoring and log analysis tools (Grafana, Prometheus, ELK stack) is a plus
  • Good understanding of security on Cloud Native applications
Job Responsibility
Job Responsibility
  • Design, implement, and execute comprehensive test plans for the CSM platform
  • Develop automated test suites using Python, Bash, and CI/CD frameworks
  • Integrate automated testing into the development pipeline
  • Identify, document, and track defects
  • Provide clear, reproducible test cases and logs
  • Perform stress testing and scale testing on large HPC clusters
  • Monitor and analyze system metrics to assess stability under load
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

HPC SW Cloud Network Engineer - Expert

High Performance Computing, AI and Labs is a critical element of HPE. We are foc...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience
  • BE / B Tech in CS or equivalent degree
  • OS: Linux
  • CNI: Cilium, Weave
  • Debugging: DNS, CNI Troubleshooting
  • Programming: Go Lang, Python
  • Container Engines: Docker, Podman
  • Container Orchestration: Kubernetes
  • Version Control: github, gitlab
  • Declarative: Ansible, YAML, HCL
Job Responsibility
Job Responsibility
  • Design, implement, Kubernetes hosted microservices services to support scalable and resilient cloud-based applications
  • Implement infrastructure as code methodologies to automate the provisioning and management of cloud resources
  • Core networking skills of OSI / TCP stack
  • Networking of kubernetes covering CNI, Ingress & Egress, security
  • Utilize tools such as Terraform or Ansible for declarative infrastructure definition
  • Collaborate with cross-functional teams to define and implement best practices for cloud-based services
  • Ability to triage- requiring a strong blend of technical depth, investigative skills, and cross-team coordination to quickly assess, prioritize, and resolve complex internal and customer reported issues
  • Expertise in container orchestration using Kubernetes, including deploying, scaling, and managing containerized applications
  • Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance processes
  • Implement CI/CD pipelines to facilitate continuous integration and delivery
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

HPC Sw Cloud Network Engineer

We are looking for an experienced cloud development engineer to work on our HPC ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically 8+ years experience
  • OS: Linux
  • CNI: Cilium, Weave
  • Debugging: DNS, CNI Troubleshooting
  • Programming: Go Lang, Python
  • Container Engines: Docker, Podman
  • Container Orchestration: Kubernetes
  • Version Control: GitHub, GitLab
  • Declarative: Ansible, YAML, HCL
Job Responsibility
Job Responsibility
  • Design, implement, Kubernetes hosted microservices services to support scalable and resilient cloud-based applications
  • Implement infrastructure as code methodologies to automate the provisioning and management of cloud resources
  • Core networking skills of OSI/TCP stack
  • Networking of Kubernetes covering CNI, Ingress & Egress, security
  • Utilize tools such as Terraform or Ansible for declarative infrastructure definition
  • Collaborate with cross-functional teams to define and implement best practices for cloud-based services
  • Ability to triage-requiring a strong blend of technical depth, investigative skills, and cross-team coordination to quickly assess, prioritize, and resolve complex internal and customer reported issues
  • Expertise in container orchestration using Kubernetes, including deploying, scaling, and managing containerized applications
  • Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance processes
  • Implement CI/CD pipelines to facilitate continuous integration and delivery
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

HPC SW Cloud QA Engineer

We are looking for an experienced cloud development engineer to work on our HPC ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree preferred or Associate degree holder (technical field) with 8-12 years working experience in related fields desired
  • Strong understanding of Linux (RHEL, SLES, Ubuntu) system administration
  • Experience with Kubernetes, containers (Docker/Podman), and networking fundamentals
  • Proficiency in scripting languages (Python, Bash) for automation
  • Familiarity with HPC architectures, job schedulers (Slurm, PBS Pro), and workload management concepts
  • Experience with test automation frameworks (e.g., pytest, Robot Framework, Jenkins CI/CD)
  • Hands-on experience in system-level testing, API testing, and performance validation
  • Familiarity with Git, Jira, Confluence, and defect tracking workflows
  • Experience with monitoring and log analysis tools (Grafana, Prometheus, ELK stack) is a plus.
Job Responsibility
Job Responsibility
  • Design, implement, and execute comprehensive test plans for the CSM platform, including functional, regression, integration, and performance testing
  • Validate HPC system management capabilities such as node provisioning, monitoring, workload orchestration, and system upgrades
  • Develop automated test suites using Python, Bash, and CI/CD frameworks to ensure rapid and repeatable test execution
  • Integrate automated testing into the development pipeline to support continuous delivery
  • Identify, document, and track defects
  • work with engineering teams to resolve issues
  • Perform stress testing and scale testing on large HPC clusters
  • Monitor and analyze system metrics to assess stability under load.
What we offer
What we offer
  • Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Specific programs catered to helping reach career goals
  • Unconditional inclusion
  • Flexibility to manage work and personal needs.
  • Fulltime
Read More
Arrow Right

HPC SW Cloud Network Engineer

High Performance Computing, AI and Labs focuses on delivering innovative solutio...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • typically 8+ years experience
  • OS: Linux
  • CNI: Cilium, Weave
  • Debugging: DNS, CNI Troubleshooting
  • Programming: Go Lang, Python
  • Container Engines: Docker, Podman
  • Container Orchestration: Kubernetes
  • Version Control: GitHub, GitLab
  • Declarative: Ansible, YAML, HCL
Job Responsibility
Job Responsibility
  • Design, implement Kubernetes-hosted microservices services to support scalable and resilient cloud-based applications
  • implement infrastructure-as-code methodologies to automate the provisioning and management of cloud resources
  • core networking skills of OSI/TCP stack
  • networking of Kubernetes covering CNI, ingress and egress, security
  • utilize tools such as Terraform or Ansible for declarative infrastructure definition
  • collaborate with cross-functional teams to define and implement best practices for cloud-based services
  • triage complex internal and customer-reported issues
  • expertise in container orchestration using Kubernetes, including deploying, scaling, and managing containerized applications
  • develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance processes
  • implement CI/CD pipelines to facilitate continuous integration and delivery
What we offer
What we offer
  • Health & Wellbeing: comprehensive suite of benefits supporting physical, financial, and emotional wellbeing
  • Personal & Professional Development: programs to achieve career goals
  • Unconditional Inclusion: embracing individual uniqueness and background flexibility
  • Fulltime
Read More
Arrow Right

Principal AI Factory Solution Product Manager

Product Manager - AI Factory Solution
Location
Location
United States , Spring
Salary
Salary:
152000.00 - 349000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
July 27, 2026
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, Business, or a related field
  • MBA or advanced degree preferred
  • 10+ years of product management experience, with at least 5 years focused on AI/ML products or solutions
  • Demonstrated ability to build large-scale AI solutions that bring together hardware, software and services into a cohesive offering
  • Strong understanding of AI technologies, including AI/ML lifecycle (training, tuning, inferencing), large language models, computer vision, and cloud-based AI platforms (e.g., AWS SageMaker, Microsoft AzureML, Google AI)
  • Proven track record of launching successful AI products, with experience in agile methodologies and tools like Jira
  • Background in High Performance Computing (HPC) and experience blending it with AI workloads will be an advantage
  • Excellent analytical skills, with proficiency in data analysis and market testing
  • Outstanding communication and stakeholder management abilities, capable of presenting to technical and non-technical audiences up to the senior executive/SVP levels
  • Ability to thrive in a startup-like fast-paced, innovative environment with strong problem-solving skills
Job Responsibility
Job Responsibility
  • Define and drive the overall AI factory at-scale and sovereign solution vision, roadmap, and features, while closely aligning with customer needs and HPE strategic goals
  • Define and drive the key software components necessary for the solution, which may be a mix of HPE developed, commercial and community IP
  • Conduct market research, competitive analysis, and customer interviews to identify AI factory opportunities and validate solution ideas and software features in a quick turn manner
  • Collaborate with engineers, product managers and presales architects to translate requirements into technical specifications and prototypes
  • Oversee the software integration and end-to-end solution lifecycle, from feature ideation and MVP development to launch, iteration, and scaling
  • Monitor solution performance using KPIs like full-stack wins, product mix, customer satisfaction, and iterate offering based on data insights
  • Work with legal, finance, pricing and supply chain to setup and manage resale contracts for commercial SW
  • Partner with sales and marketing to develop go-to-market strategies, pricing models, support strategies and customer enablement materials
  • Ensure solution complies with ethical AI standards while ensuring highest level of data privacy and sovereignty (e.g., GDPR, CCPA)
  • Stay abreast of AI trends, such as generative models, agentic AI, and industry applications
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Senior Product Manager - Ai Infrastructure And Solutions

The Senior Product Manager, AI Infrastructure and Solutions owns the system-leve...
Location
Location
United States , Santa Clara
Salary
Salary:
188000.00 - 282000.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior-level, cross-functional, large enterprise experience in Product Management, Solutions Product Management, Technical Product Management, Systems Engineering, or related roles in: Data center infrastructure
  • AI/ML platforms
  • HPC systems
  • Enterprise solutions
  • Integrated hardware + software products
  • Strong working knowledge of several of the following: Platform/system architecture (power/thermals, topology, I/O, networking, memory, RAS)
  • Firmware/driver and software stack dependencies
  • Benchmarking methodology and reproducibility
  • Demonstrated experience defining and launching system-level solutions requiring coordination across hardware, firmware, drivers, software frameworks, and partner ecosystems
  • Excellent communications, collaboration, and relationship building across functions and organizations, including experience presenting to executives
Job Responsibility
Job Responsibility
  • Own system-level product definition (HW + SW)
  • Author and maintain System Requirements Specifications (SRS) for solutions spanning hardware platform, firmware/driver, software stack, compatibility, manageability, and lifecycle expectations
  • Define assumptions, constraints, non-goals, and acceptance criteria
  • Translate customer/segment needs into prioritized requirements
  • Consolidate and normalize customer, Sales, OEM/ODM partner, and internal stakeholder inputs into a structured, prioritized requirements backlog with decision rationale
  • Ensure requirements are workload-grounded and aligned to segment outcomes
  • Lead cross-functional dependency and integration planning
  • Identify dependencies across silicon/platform, firmware/drivers, ROCm/software stack, frameworks, OEM systems, and cloud images
  • Surface integration risks early, drive tradeoff discussions, and maintain decision logs
  • Set performance targets and benchmarking expectations (with TME/Eng)
What we offer
What we offer
  • Benefits offered are described: AMD benefits at a glance
  • Fulltime
Read More
Arrow Right