This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
HPC Linux System Administrator. High Performance Computing, AI and Labs is a critical element of HPE. We are focused on delivering innovative solutions that accelerate our customers’ digital transformation, enabling them to tackle their complex, and data-intensive workloads. Combining deep expertise and the development of the world’s most cutting-edge, high-performance supercomputers, is defining the next era of computing delivering valuable insight & innovation. Join us and redefine what’s next for you.
Job Responsibility:
Must be hands-on. Be able to develop a solid understanding of the Linux system and be able to test the system
Manage and maintain HPC clusters, including installation, configuration, and optimization of compute and management nodes
Administer Linux/Unix-based systems, ensuring high availability, performance, and security
Perform system imaging, software provisioning, and configuration management using tools such as Ansible
Conduct hardware troubleshooting and coordinate with vendors or internal teams for hardware repairs and replacements
Oversee lab systems used for development, testing, and release validation in HPC environments
Manage storage systems (NFS, Lustre, GPFS, RAID) and ensure efficient data flow across the HPC environment
Monitor system performance, perform regular health checks, and implement preventive maintenance measures
Apply OS, firmware, and security updates to maintain system stability and compliance
Develop and maintain automation scripts (using Bash, Python, or Ansible) to improve operational efficiency
Document system configurations, maintenance procedures, and troubleshooting guides
Collaborate with cross-functional teams across geographies to resolve issues, plan upgrades, and support project activities
Provides guidance and mentoring to less-experienced staff members
Requirements:
Bachelor's or Master's engineering degree in Computer Science, Information Systems
Typically 4-8 years experience
Strong proficiency in Linux/Unix administration (installation, configuration, tuning, troubleshooting)