This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
This role supports the operation and maintenance of high-traffic, business-critical internet communication systems to ensure continuous availability. It primarily focuses on automating system administration and monitoring to enhance network efficiency and reliability. The role involves conducting tests for redundancy, resilience, and failover to maintain uptime standards. Success is measured by system performance, uptime, and the ability to identify and resolve faults proactively. The work directly impacts organizational stability and the performance of digital infrastructure for customers and internal users.
Job Responsibility:
Develops and maintains monitoring systems that detect symptoms to enable proactive fault identification and performance tuning
Manages platform infrastructure including capacity planning and scaling to support system demands
Analyzes operational data to uncover insights that inform system improvements and reliability
Collaborates with development teams to enhance services through testing and controlled release procedures
Designs, implements, and monitors complex system architectures to ensure high performance and uptime
Also responsible for other duties/projects as assigned by business management as needed
Requirements:
Bachelor's Degree OR combination of education and experience deemed equivalent
Acceptable areas of study include Computer Science, Engineering
Must finish school between December 2026 and June 2027 to be eligible
Less than 2 years - Operating and maintaining high traffic, business critical internet site communications systems
Less than 2 years - Automating the administration and monitoring of network systems
Less than 2 years - Conducting tests for redundancy, resilience, and failover in digital infrastructure
Understanding of high traffic network systems and their operations
Proficiency in automating the administration and monitoring of network systems.
Ability to conduct rigorous tests for redundancy, resilience, and failover.
Proficiency in using DevOps-centric automation tools and technologies for CICD, configuration management, etc.
Ability to use software to improve the availability, scalability, latency, and efficiency of services.
Ability to use dashboards for continuous monitoring and health check of applications, and the underlying infrastructure.
Understanding of how to improve the quality of services using the monitoring feedback for non-production environment.
Ability to develop and prepare data for dashboard views.
At least 18 years of age
Legally authorized to work in the United States
High School Diploma or GED
Must be actively enrolled in a degree program or graduated within the last year
Nice to have:
Master's/Advanced Degree Computer Science, Engineering or Related Field
What we offer:
Relocation may be provided to program participants who reside more than 50 miles from the internship location