This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a Monitoring Service Owner – IT Infrastructure to lead the end-to-end ownership of monitoring services, ensuring stability, availability, and optimal performance. This role focuses on proactive monitoring, effective alerting, and continuous optimisation, while aligning services with business requirements, compliance standards, and cloud transformation strategies. The individual will act as a technical escalation point, support operational excellence, and drive continuous improvement within infrastructure services.
Job Responsibility
Own and manage end-to-end monitoring services to ensure high availability, performance, and reliability
Perform root cause analysis (RCA), resolve incidents, and deliver permanent solutions
Coordinate, prepare, and implement service change requests and ensure readiness for CAB reviews
Monitor service performance, generate reports, and prepare service analysis for monthly review meetings
Lead Continuous Service Improvement (CSI) initiatives and deliver operational enhancements
Act as a technical escalation point across service desks, support teams, and SMEs
Maintain and update operational documentation, including user guides and work instructions
Conduct operational assessments for projects and provide input for go-live decisions
Develop automation scripts and solutions to improve efficiency and reduce manual intervention
Collaborate with internal stakeholders to align service delivery with business needs and technical strategy
Monitor ticket quality, coach SMEs, and support capability development across the team
Requirements
Degree in Engineering, Computer Science, or a related field
Extensive hands-on expertise in Azure monitoring and administration
Skilled in Azure Monitor tools, including log analytics, metrics, alerts, dashboards, and diagnostic settings
Proficient in Kusto Query Language (KQL) for queries, dashboards, and reporting
Knowledgeable in Infrastructure as Code (IaC) tools such as Terraform or ARM templates
Experienced in SCOM and PowerShell scripting for automation and troubleshooting
Certified in ITIL Foundation or equivalent service management frameworks
Familiar with telemetry concepts, reporting tools, and performance analysis
A collaborative communicator with strong interpersonal, organisational, and problem-solving skills
Comfortable working in large-scale production environments with a quality and customer-focused approach
What we offer
Opportunity to lead critical infrastructure services in a global organisation
Exposure to advanced cloud monitoring tools and large-scale IT environments
Collaborative and inclusive team culture focused on innovation and improvement
Continuous learning through real-world service optimisation and transformation initiatives
Engagement in cross-functional projects with high business impact