This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a System Administrator on our project, you’ll support the operation, maintenance, and design of information technology resources within a laboratory for High Performance Computing (HPC) technology and data analytics for a research organization.
Job Responsibility:
Support the operation, maintenance, and design of information technology resources within a laboratory for High Performance Computing (HPC) technology and data analytics for a research organization
Maintain responsibility for the installation of new equipment and operating systems to best suit the needs of research to be conducted and the researchers requiring support
Support the review of current infrastructure to provide an overview and prepare plans for improvement and enhancement that will increase efficiency and security within the network
Maintain responsibility for cutting-edge computing systems and the evaluation of their use for research in furtherance of the mission
Identify problem areas and opportunities for improvement in a mission-critical network
Help your team better understand the network by turning metrics into information and explaining their meaning
Requirements:
Experience configuring and managing Linux and Windows operating systems, including supporting day-to-say operations, installing operating system software, troubleshooting, maintaining integrity of and configuring network components, and implementing operating systems enhancements
Experience monitoring and maintaining systems, including compute nodes, storage, networking, and software
Experience optimizing system operations and resource utilization and performing system capacity analysis
TS/SCI clearance with a polygraph
Associate's degree in an Information Technology field and 12+ years of experience as a lead systems administrator, Bachelor's degree in an Information Technology field and 8+ years of experience as a lead systems administrator, or 15+ years of experience as a lead systems administrator in lieu of a degree
Nice to have:
Experience in shell scripting and programming in C or Python
Experience with Simple Linux Utility for Resource Management (SLURM)
Experience with system automation, provisioning, storage, or optimization tools such as Ansible, Warewolf, Lustre, or BeeGFS
Experience with Unix
Knowledge of performance monitoring tools such as Grafana or Prometheus
Knowledge of setting up and executing benchmarks in an HPC environment and analyzing results
Knowledge of network engineering
What we offer:
Health, life, disability, financial, and retirement benefits