Cleared Senior/ Principal Computing Infrastructure Engineer Job at Sandia National Laboratories (Albuquerque)

Job Description

We are seeking a Computing Infrastructure Engineer to perform the full lifecycle management (analysis, design, development, testing, integration and maintenance) of multi-user information systems including servers, storage, virtual and cloud infrastructures, and associated components and subsystems. Maintains the integrity of servers and systems to meet established requirements for service levels, disaster recovery, business continuity, and security. Coordinates and interfaces with customers, suppliers, and domain experts such as networking, software, and desktop personnel.

Job Responsibility

Perform the full lifecycle management (analysis, design, development, testing, integration and maintenance) of multi-user information systems including servers, storage, virtual and cloud infrastructures, and associated components and subsystems
Maintains the integrity of servers and systems to meet established requirements for service levels, disaster recovery, business continuity, and security
Coordinates and interfaces with customers, suppliers, and domain experts such as networking, software, and desktop personnel
Applies systematic, disciplined, and quantifiable engineering and life cycle management techniques and processes to deliver integrated information systems hardware for mission-enabled or direct mission deployment and production
Performs technical research and development to enable continuing innovation within the infrastructure
Evaluates, recommends, and may decide hardware and software technologies for hardware system solutions
Introduces new computer systems and technologies into current configurations for optimum information systems engineering functionality
Collaborates with customers, suppliers, and other domain experts such as networking, software and desktop personnel in the development and integration of hardware systems solutions
Performs hardware systems analysis, monitoring, troubleshooting, repair and recovery, disaster recovery, software installations, systems OS and software applications upgrades and security patches, performance tuning, hardware upgrades and resource optimization, maintenance of user accounts, and training users

Requirements

A Bachelor's degree in a relevant discipline and five (5) years of directly relevant experience, or an equivalent combination of directly relevant education and engineering or scientific experience that demonstrates the knowledge, skills, and ability to perform independent research and development
Experience in various operating systems (e.g., Linux, Windows), networking protocols, virtualization technologies
Experience in scripting languages (e.g., Python, Bash) and configuration management tools (e.g., Ansible, Puppet)
Experience in various network hardware components (e.g., Dell/HP servers and desktops, Thin Clients, Zero Clients, Firewalls)
Active DOE Q clearance

Nice to have

Experience in designing, implementing, and managing computing infrastructure, including servers, storage systems, and network devices
Experience with data center operations and best practices is essential
Understanding of cybersecurity principles and practices, including secure network design, access controls, and vulnerability management
Certifications: Industry certifications such as Certified Information Systems Security Professional (CISSP), Certified Cloud Security Professional (CCSP), or vendor-specific certifications (e.g., AWS Certified Solutions Architect) are highly desirable
Project Management: Familiarity with project management methodologies and tools to effectively plan, execute, and deliver infrastructure projects on time and within budget
Automation and DevOps: Experience with automation tools (e.g., Jenkins, GitLab CI/CD) and knowledge of DevOps principles to streamline infrastructure deployment and management processes
Cloud Expertise: Proficiency in deploying and managing infrastructure in public, private, or hybrid cloud environments. Knowledge of containerization technologies (e.g., Docker, Kubernetes) is a plus
Continuous Learning: Demonstrated commitment to staying updated with emerging technologies, industry trends, and best practices through self-learning, training, or participation in professional communities
Knowledge of NIST 800-53 requirements
Strong analytical and problem-solving skills to identify and resolve infrastructure issues efficiently. Experience with monitoring tools and performance optimization techniques is important
Excellent verbal and written communication skills to interact with team members, stakeholders, and vendors. Ability to work collaboratively in a multidisciplinary environment

What we offer

Challenging work with amazing impact that contributes to security, peace, and freedom worldwide
Extraordinary co-workers
Some of the best tools, equipment, and research facilities in the world
Career advancement and enrichment opportunities
Flexible work arrangements for many positions include 9/80 (work 80 hours every two weeks, with every other Friday off) and 4/10 (work 4 ten-hour days each week) compressed workweeks, part-time work, and telecommuting (a mix of onsite work and working from home)
Generous vacation, strong medical and other benefits, competitive 401k, learning opportunities, relocation assistance and amenities aimed at creating a solid work/life balance

Sandia National Laboratories - All Job Offers

Select Country

Cleared Senior/ Principal Computing Infrastructure Engineer

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Cleared Senior/ Principal Computing Infrastructure Engineer

Senior/ Principal Civil Design Engineer

Senior Infrastructure Developer (.NET/C#)

Principal Engineer – Supply Chain

Senior Principal Engineering Manager

Senior Principal Product Development - Hardware

Principal Software Engineer Manager

Principal Software Engineer

Principal Software Engineer - Edge AI

Our AI answers in your language