This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for an experienced IT Manager specializing in platform research systems to join our team in Piscataway, New Jersey. This is a contract position offering an exciting opportunity to lead cutting-edge initiatives in high-performance computing and storage environments. The ideal candidate will possess a strong technical background and proven leadership skills to manage complex IT systems and support research-driven technology needs.
Job Responsibility:
Oversee the administration and engineering of IT systems, including hardware, virtualization technologies, and large-scale environments
Lead a team of IT professionals, providing guidance and support to ensure successful project execution
Manage high-performance computing systems, including Linux at scale, job schedulers, and workload performance optimization
Implement and maintain high-performance research storage solutions, including parallel file systems and disaster recovery processes
Configure and optimize high-speed interconnects, GPUs, and node provisioning within clustered environments
Develop and manage automation and configuration management processes using scripting languages and tools
Ensure security best practices are adhered to, including identity integrations, access controls, and vulnerability management
Collaborate with researchers, central IT teams, and vendors to address technical needs and challenges
Utilize project management skills to plan, execute, and deliver IT initiatives effectively
Support container technologies and software environment management for research applications
Requirements:
At least 7 years of experience in IT systems administration or engineering, with expertise in large-scale or clustered environments
A minimum of 3 years of leadership experience managing IT teams
Proficiency in Linux systems, job scheduling tools, and performance tuning
Hands-on experience with high-performance research storage, including parallel file systems and disaster recovery concepts
Knowledge of high-speed interconnects, GPUs, and node provisioning in clustered systems
Strong skills in automation, configuration management, and scripting for operational support
Familiarity with security practices for shared compute and storage services, including identity integrations and patch management