This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
This role is responsible for driving enterprise-level improvements in monitoring effectiveness by defining optimization strategies, improving signal quality, and leading cross-functional initiatives for incident reduction. This role requires a strong understanding of how diverse monitoring domains operate within production environments and ownership in shaping how monitoring and alerting functions evolve.
Job Responsibility
Own and drive enterprise-wide initiatives for improving monitoring effectiveness and reducing alert noise
Define standards and approach for alert optimization across multiple monitoring domains
Establish a consistent view of how events are generated across infrastructure, application, batch, network, and mainframe environments
Drive alignment across teams to improve monitoring quality and eliminate redundant or low-value alerts
Identify systemic gaps in monitoring design and lead long-term improvements
Leverage data insights and AI-driven approaches to enhance event correlation and signal quality
Mentor and guide teams in building a strong optimization and observability mindset
Requirements
10–16+ years in monitoring, event management, or production environments
Proven experience in driving large-scale improvements in monitoring effectiveness or incident reduction