Job Description:
This is an individual contributor role within the CTO ITSO Application Support organization, with end‑to‑end responsibility for the operational health and reliability of enterprise monitoring, scheduling, and observability platforms. The role owns production support, incident response, platform health, upgrades, and operational readiness, partnering closely with engineering, SRE, and onshore teams to meet defined SLAs, SLOs, and availability targets. The position emphasizes strong ITIL execution combined with SRE principles, including reliability engineering, proactive issue prevention, automation, and reduction of operational toil. This role is part of the CTO Application Support team within ITSO, responsible for run‑the‑business operations of critical enterprise scheduling, monitoring, and observability platforms. The team ensures availability, reliability, performance, and resilience of platforms such as Autosys, Grafana, Splunk, Cribl, ThousandEyes, and other observability and monitoring tools that support production environments across the enterprise. The function operates at the intersection of ITIL‑aligned service operations and an SRE mindset, with a strong focus on operational stability, automation, observability, and continuous service improvement.