HPC Engineer Job at WhiteBlue (Chennai)

Job Responsibility

Design, implementation & support of high-performance compute clusters
Solid knowledge on HPC systems, including CPU/GPU architecture, scalable/robust storage, high-bandwidth inter-connects, and a knowledge of cloud based computing architectures
Apply their attention to detail to generate HW BOMs for the HCP Clusters, provide vendor management and oversee HW release activities
Use their strong skills with the Linux OS to configure appropriate operating systems for the HPC system
Understand and assemble the project specifications and performance requirements at the subsystem and system levels
Adhere and drive to project timelines to insure program achievements complete on time
Support design and release of new products to manufacturing and ultimately the customer, providing quality golden images, procedures, scripts and documentation to the manufacturing team and customer support team
Validated in-depth and flavor agnostic knowledge of Linux systems (SuSE, RedHat, Rocky, Ubuntu)
Experience of crafting and maintaining robust storage
Strong HPC HW knowledge especially in the server, GPU, networking, Storage, BIOS & BMC arenas
Experience in System-D, Net boot/PXE, Linux HA
Strong understanding of TCP/IP fundamentals and knowledge of protocols, DNS, DHCP, HTTP, LDAP, SMTP
Ability to code and develop Shell and Python scripts
Experience with one or more of the listed Configuration Mgmt utilities. (Salt, Chef, Puppet etc)

Requirements

Experience in designing, implementing, and supporting high-performance computing (HPC) clusters with strong knowledge of CPU/GPU architecture, scalable storage, interconnects, and cloud-based systems
Solid knowledge on HPC systems, including CPU/GPU architecture, scalable/robust storage, high-bandwidth inter-connects, and a knowledge of cloud based computing architectures
Apply their attention to detail to generate HW BOMs for the HCP Clusters, provide vendor management and oversee HW release activities
Use their strong skills with the Linux OS to configure appropriate operating systems for the HPC system
Understand and assemble the project specifications and performance requirements at the subsystem and system levels
Adhere and drive to project timelines to insure program achievements complete on time
Support design and release of new products to manufacturing and ultimately the customer, providing quality golden images, procedures, scripts and documentation to the manufacturing team and customer support team
Validated in-depth and flavor agnostic knowledge of Linux systems (SuSE, RedHat, Rocky, Ubuntu)
Experience of crafting and maintaining robust storage
Strong HPC HW knowledge especially in the server, GPU, networking, Storage, BIOS & BMC arenas
Experience in System-D, Net boot/PXE, Linux HA
Strong understanding of TCP/IP fundamentals and knowledge of protocols, DNS, DHCP, HTTP, LDAP, SMTP
Ability to code and develop Shell and Python scripts
Experience with one or more of the listed Configuration Mgmt utilities. (Salt, Chef, Puppet etc)
8-10 years

WhiteBlue - All Job Offers

Select Country

HPC Engineer

Job Responsibility

Requirements

Looking for more opportunities?

HPC Engineer

HPC Engineer

Hpc Engineer

Senior Distributed Systems Engineer (HPC Platform)

Senior Field Application Engineer - HPC

Staff Flight Sciences Software and HPC Engineer

Member of Technical Staff, Site Reliability Engineer (HPC)

Machine Learning Engineer – HPC

Product Development Engineer-hpc

Our AI answers in your language