This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or software development, if you have a passion for making systems better, we need you!
Job Responsibility:
Lead the development of more robust systems for Booz Allen by building a resilient infrastructure
Build in redundancy, implement monitoring tools, and automate wherever possible
Reduce toil by scripting routine tasks and automating self-repair
Support your team of engineers and act as a subject matter expert for our clients
Requirements:
5+ years of experience creating and maintaining highly reliable and scalable systems to reduce issues and downtime, including design and implementation of physical servers, storage systems, and network infrastructures
5+ years of experience providing technical support for system upgrades, rollouts, and enhancements
3+ years of experience developing and deploying infrastructure solutions
3+ years of experience employing and sustaining VMware for v6.x and later, including the design and implementation of virtual data centers
3+ years of experience designing and deploying highly available storage solutions for technologies, including SAN storage and high-capacity storage solutions
Experience with data center design and buildout
Experience transforming large-scale software, data center, or on-premises infrastructure programs to a virtualized architecture
Ability to interact with clients and lead, train, and mentor junior system administrators
Top Secret clearance
Bachelor's degree
Nice to have:
Experience with user management and monitoring tools
Ability to analyze test results
Possession of excellent verbal and written communication skills, to communicate with stakeholders, developers, and operations teams to continuously improve the health, stability, and reliability of the system