This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Lead complex, broad impact initiatives including provision of high level systems consultation for the technology teams
Work as key participant in large scale planning of computer systems and network infrastructure for Systems Operations functional area
Review and analyze complex technical challenges, as well as escalated support issues related to core business solutions that require in depth evaluation of multiple factors, such as alternatives, enhancements, periodic systems reviews, or improvements to existing systems
Make decisions on technical changes and enhancements
Consult with engineering team on change design requiring solid understanding of technical process controls or standards that influence and drive new initiatives
Collaborate and consult with technical peers, colleagues, and mid to more experienced level managers to resolve systems support issues and achieve goals
Requirements:
5+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Extensive hands-on experience leading OpenShift Container Platform (OCP) implementations and supporting production-grade Kubernetes environments
Strong expertise in DevOps and Site Reliability Engineering (SRE) practices, including automation, monitoring, and reliability-focused operations
Proven ability to operate and support mission‑critical, enterprise-scale distributed systems with high availability and performance requirements
Deep knowledge of Linux/Unix system administration, supporting both traditional and containerized workloads in production
Experience supporting Java-based enterprise applications and troubleshooting application-level and platform-level issues
Solid background in supporting Oracle databases, Autosys, and enterprise scheduling tools within hybrid environments
Hands-on experience enabling and supporting CI/CD pipelines, deployment automation, and release orchestration on OpenShift
Strong understanding of cloud-native architecture concepts, including containerization, microservices, and immutable infrastructure
Demonstrated experience working within an ITIL-aligned service management framework, including incident, problem, and change management
Advanced skills in incident triage, rapid restoration, and Root Cause Analysis (RCA) with the ability to communicate findings to both technical and executive stakeholders
Experience designing and maintaining monitoring, logging, and observability solutions using tools such as Splunk, Grafana, or equivalent platforms
Ability to build and deliver operational dashboards, executive reports, and SLA/SLO metrics to drive continuous improvement and capacity planning
Strong automation mindset with experience reducing manual effort through scripting, tooling, and preventive maintenance initiatives
Demonstrated leadership skills, including mentoring engineers, serving as an escalation point, and guiding teams through complex production issues
Excellent collaboration, communication, and stakeholder management skills, with the confidence to lead during high-severity production incidents