CrawlJobs Logo

Senior IT Problem & Major Incident Manager

United Kingdom, London Employment contract · Job Posted January 24, 2026
Apply Position
Job Link Share

Job Description

A fantastic opportunity has arisen for a Senior IT Problem & Major Incident Manager to join our London based law firm on a permanent basis.

Job Responsibility

  • Have a sound understanding of the ITIL framework and practical experience in operating ITIL-aligned problem management processes
  • Significant experience in managing major incidents, including stakeholder communication, incident analysis and resolution, and working in stressful situations
  • Mentor Major Incident Managers
  • Conduct Major Incident Reviews
  • Champion for the problem management practice
  • Prevent recurrence of problems through effective problem management
  • Facilitate root cause analysis and methodologies
  • Ensure effective problem resolution through collaboration
  • Ensure accurate problem records and known errors
  • Organise and chair meetings related to the practice
  • Provide problem training and guidance to resolution teams
  • Provide reports and metrics on problem activities
  • Audit problem activities
  • Continuously improve problem processes by identifying, proposing, and implementing improvements

Requirements

  • Experience in the Problem Management practice, 5+ years preferred
  • Experience in Major Incident Management and Critical Situation Management, 7+ years preferred
  • Certified in ITIL 4 Foundation (required)
  • Certified in ITIL 4 Problem Management (preferred)
  • Certified in ITIL 4 Incident Management (preferred)
  • Knowledge of ITIL 4 principles and practices
  • Possess strong analytical and problem-solving skills
  • Possess strong written and verbal communication skills
  • Confident in constructively challenging the status quo and influencing stakeholders to achieve optimal outcomes
  • Proficiency in ITSM tools, ServiceNow a plus

Nice to have

  • Certified in ITIL 4 Problem Management
  • Certified in ITIL 4 Incident Management
  • Proficiency in ITSM tools, ServiceNow a plus

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior IT Problem & Major Incident Manager

8 matching positions

Senior Manager IT Storage Engineering

Lead the strategy, architecture, and delivery of enterprise storage and data pro...
Location
Location
United States , San Jose
Salary
Salary:
180400.00 - 270600.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Leadership and people management skills
  • Ability to mentor and grow high-performing teams
  • Communication skills, ability to translate complex technical concepts into business outcomes
  • Collaborative mindset with strong stakeholder management across Engineering, IT, Security, and Compliance
  • Strategic thinking with a balance of innovation and pragmatic execution
  • Problem-solving orientation with a focus on continuous improvement and operational excellence
  • Customer-focused approach with an emphasis on user experience and service reliability
  • Experience in enterprise storage and data protection (file, block, object)
  • Engineering team management experience
  • Proven expertise in high-performance storage solutions for EDA, HPC, AI/ML workloads
Job Responsibility
Job Responsibility
  • Own and evolve the storage platform roadmap, balancing EDA performance requirements with enterprise resilience, compliance, and cost goals
  • Define and govern reference architectures across file, block, object storage, and data protection (on-prem and cloud)
  • Lead storage design reviews for EDA workflows, including metadata-intensive and high IOPS/low latency workloads (build/test/simulation/regression flows)
  • Establish standard patterns for tiering, archiving, retention, immutability (WORM), and disaster recovery (RPO/RTO, replication, failover)
  • Design solutions supporting AI workloads (including >5TB/s training throughput)
  • Ensure storage segmentation and isolation aligned with performance and security requirements
  • Align all architectures with security, encryption, RBAC, audit logging, and data governance standards
  • Own end-to-end storage and backup service delivery (availability, performance, capacity, recoverability, user experience)
  • Lead major incident management, root cause analysis, and corrective/preventive actions
  • Define and track SLAs/SLOs (latency, throughput, backup success, restore times, replication health)
What we offer
What we offer
  • Benefits offered are described: AMD benefits at a glance
  • Fulltime
Read More
Arrow Right

Senior Incident & Problem Management Leader

The Senior Incident & Problem Management Leader plays a critical role within the...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
coca-colahellenic.com Logo
Coca-Cola HBC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Information Technology, Computer Science, or related field
  • ITIL v4 certification required
  • additional certifications in IT4IT or SIAM are a plus
  • Proven experience in incident and problem management within a complex, multi-vendor environment
  • Strong understanding of SIAM principles and experience working in a SIAM ecosystem
  • Excellent analytical and problem-solving skills, with a keen eye for detail
  • Strong leadership and communication skills, with the ability to manage multiple stakeholders
  • Experience with IT Service Management (ITSM) tools such as ServiceNow, BMC Remedy, or similar
  • In-depth knowledge of ITIL v4 and IT4IT best practices
  • Strong understanding of incident and problem management processes and techniques
Job Responsibility
Job Responsibility
  • Lead the end-to-end Incident Management process across a complex, multi-vendor SIAM ecosystem
  • Act as senior escalation point for high-impact and business-critical incidents, including escalations from senior management
  • Own and oversee Major Incident Management, ensuring effective war-room leadership, decision-making, and executive-level communication
  • Ensure incidents are managed in line with agreed SLAs, OLAs, and XLAs across all platforms and service providers
  • Drive post-incident reviews for major and recurring incidents, ensuring actionable outcomes and measurable improvements
  • Lead the Problem Management process end-to-end, ensuring problems are identified, prioritized, analyzed, and permanently resolved
  • Ensure high-quality Root Cause Analysis (RCA) and follow-through on corrective and preventive actions
  • Govern the Known Error Database (KEDB) and ensure effective reuse of known errors and workarounds across service teams
  • Proactively identify systemic issues and recurring failure patterns, translating insights into service improvement initiatives
  • Ensure process documentation is reviewed and updated at least every 6 months or upon material change
Read More
Arrow Right

Senior Incident Manager

Microsoft Cloud Infrastructure and Operations (CO+I) is the engine that powers M...
Location
Location
United States , San Antonio; Redmond; Atlanta; Phoenix; Washington DC
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Mechanical Engineering, Electrical Engineering, Aerospace Engineering, Data Science, Cybersecurity, or related field AND 3+ years technical experience in data center or critical environment space OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Shares insights and best practices that can be applied to improve development and operations across related sets of the systems, services, platforms, and/or products
  • Mentors and coaches other engineers to help them identify and propose relevant solutions
  • Collaborates within and across teams by proactively and systematically sharing information with an appropriate level of detail for their audience
  • Overcomes obstacles by resolving conflicts and issues across interdependent teams and engages with partners and stakeholders so issues can be resolved and mutual objectives are met
  • Develops, leverages, and drives sharing of information and knowledge base across teams
  • Leverages advanced technical expertise, judgment, and decision making to coordinate multiple work streams and resources in crisis situations to drive mitigation plan and resolve, reduce, or mitigate the impact of a crisis by engaging necessary teams and escalating to appropriate stakeholders
  • Independently conducts root cause analyses and participates in post-incident reviews based on incidences/crises for the purposes of leading continuous improvement
  • Applies diagnostic expertise
  • Provides guidance to other engineers working to mitigate and resolve issues
  • Communicates customer impact and other relevant information with key stakeholders, leadership, and customers
  • Fulltime
Read More
Arrow Right

Enterprise Major Incident & Problem Management Analyst

The Major Incident and Problem Manager plays a critical role in managing and res...
Location
Location
Lithuania , Vilnius
Salary
Salary:
3129.79 - 4471.10 EUR / Month
cencora.com Logo
Cencora
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's Degree in Computer Science, Information Systems, Business Administration, or related field
  • or equivalent work experience
  • ITIL4 certified is a plus
  • Major Incident Management expertise: strong command of ITIL/ITSM concepts (incident, major incident, problem, change) and service restoration focus
  • Problem Management & RCA awareness: understanding of PMIR/RCA practices (5 Whys methodology) and how to translate incidents into corrective/preventive actions
  • Change/release awareness: ability to identify change-related outages, enforce change controls during recovery, and coordinate rollback/failover decisions
  • Customer/service mindset: focus on restoring critical services, minimizing impact, and supporting business continuity
  • Operational leadership under pressure: ability to take command, structure a response quickly, and drive multiple teams to resolution
  • Technical breadth: working knowledge across infrastructure, networks, cloud, end-user, applications, databases, and integrations (enough to triage and challenge effectively)
  • Crisis communication: Excellent written and verbal communication skills
Job Responsibility
Job Responsibility
  • Own the Major Incident (MI) process end-to-end: declare/confirm major incidents, assign severity/priority, maintain command and control, keep a clear action plan, track owners/timelines, remove blockers, and ensure operational discipline to drive restoration of service
  • Communications Management: provide timely, accurate updates (internal and external as required), manage executive briefings, and set stakeholder expectations
  • Tooling and Documentation: ensure incidents are logged correctly in ITSM tools, with accurate timestamps, actions, and evidence. Track and report MI KPIs, trends, and improvement outcomes
  • Collaborate with Executive Leadership: Work closely with senior management to align incident management strategies with business goals
  • Service Improvement: identify recurring issues, process gaps, and automation opportunities
  • recommend and drive MI process improvements
  • Problem Management Handoff: ensure follow-up work is created, prioritized, and tracked (problem records, corrective actions, risk acceptance where needed)
  • Problem Management: ensure problems are logged, categorized, prioritized, and progressed through to closure. Perform root cause analysis to prevent future incidents
  • After-hours/On-call Participation: Participate in out-of-business-hours support duties as part of an on-call rotation schedule
What we offer
What we offer
  • competitive annual bonus
  • life insurance from Day 1
  • best-in-class health insurance package
  • up to 6 fully paid benefit days a year
  • referral bonus scheme
  • boundless learning opportunities
  • global Employee Assistance Program
  • office location in Quadrum equipped with everything you need for a small break at work and fresh snacks at all times
  • Fulltime
Read More
Arrow Right

Senior Incident and Problem Management Leader

The Senior Incident & Problem Management Leader plays a critical role within the...
Location
Location
Egypt , Cairo
Salary
Salary:
Not provided
coca-colahellenic.com Logo
Coca-Cola HBC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Information Technology, Computer Science, or related field
  • ITIL v4 certification required
  • additional certifications in IT4IT or SIAM are a plus
  • Proven experience in incident and problem management within a complex, multi-vendor environment
  • Strong understanding of SIAM principles and experience working in a SIAM ecosystem
  • Excellent analytical and problem-solving skills, with a keen eye for detail
  • Strong leadership and communication skills, with the ability to manage multiple stakeholders
  • Experience with IT Service Management (ITSM) tools such as ServiceNow, BMC Remedy, or similar
  • In-depth knowledge of ITIL v4 and IT4IT best practices
  • Strong understanding of incident and problem management processes and techniques
Job Responsibility
Job Responsibility
  • Lead the end-to-end Incident Management process across a complex, multi-vendor SIAM ecosystem
  • Act as senior escalation point for high-impact and business-critical incidents, including escalations from senior management
  • Own and oversee Major Incident Management, ensuring effective war-room leadership, decision-making, and executive-level communication
  • Ensure incidents are managed in line with agreed SLAs, OLAs, and XLAs across all platforms and service providers
  • Drive post-incident reviews for major and recurring incidents, ensuring actionable outcomes and measurable improvements
  • Lead the Problem Management process end-to-end, ensuring problems are identified, prioritized, analyzed, and permanently resolved
  • Ensure high-quality Root Cause Analysis (RCA) and follow-through on corrective and preventive actions
  • Govern the Known Error Database (KEDB) and ensure effective reuse of known errors and workarounds across service teams
  • Proactively identify systemic issues and recurring failure patterns, translating insights into service improvement initiatives
  • Ensure process documentation is reviewed and updated at least every 6 months or upon material change
Read More
Arrow Right

Incident Manager

As an Incident Manager, you will be a part of the Sysco Information Technology S...
Location
Location
United States , Houston
Salary
Salary:
Not provided
sysco.com Logo
Sysco
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in IT Support handling emails, incidents and phone calls supporting technology products
  • 3+ years of experience in playing the role of an Incident Manager and/or leading a support team
  • Superior English language and communication skills - both written and verbal
  • Skills with the ability to articulate technical solutions for both technical and non-technical audiences
  • Highly motivated, self-directed and responsible contributor
  • Ability to direct cross-functional resources through incident life cycle
  • Technical knowledge and experience working with Windows/Linux environments, Cloud environments, SQL, Active Directory, Scripting, etc.
  • Network troubleshooting knowledge including LAN/WAN, DHCP, TCP/IP, Firewalls, and Routing Cloud Services/Platforms
  • Monitoring tools such as Dynatrace, New Relic, DataDog, Nagios and Splunk
  • Bachelor's degree in computer science, information technology or equivalent
Job Responsibility
Job Responsibility
  • Lead, own and drive incident management bridge calls and chats with support teams, application development teams, infrastructure teams’ business teams, and senior leadership with the purpose of remediating major incidents
  • Establishing strong command and control of an incident, establishing clear accountability/ownership and methodical evaluation of complex issue scenarios
  • Applying technical/environmental knowledge and experience to develop and drive appropriate tasks, forming paths to resolution
  • Effectively verbalize, document, communicate and facilitate the identification, handling, status reporting, solution options and alternatives, change implementation of various incidents and problems reactively and proactively identified in the IT environment, and effectively bring them to resolution and/or closure
  • Understand the Service Levels, determine the criticality of incidents, ensure incidents are addressed within agreed SLAs by the service owners, and taking follow-up actions until incidents are resolved
  • Ensure the correct resources are working on the resolution of major incidents appropriate to the severity and identify when escalation is required and trigger such escalation accordingly
  • Work closely with Service Owners to deliver a clear, concise picture of incidents and the short-term remediation applied
  • ensure problem prevention methods and mitigation strategies are continually applied to improve application availability and make recommendations on long-term solutions
  • Effectively document analysis, business cases, solution strategies, incident and problem tracking, processes, procedures, and knowledge articles associated with implementing fixes and solutions to existing or predicted IT incidents and problems
  • Facilitate analysis and review meetings and provide recommendations to service owners and platform owners to improve practices, service availability and IT services
  • Fulltime
Read More
Arrow Right

Application Support Senior Manager

The Apps Support Senior Manager accomplishes results through the management of p...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-10 years experience in an Apps Support role would be an added advantage, but not essential
  • Experience with people management
  • Highly assertive communications skills, commanding personality. Ability to engage a large audience and lead the discussion with clear, articulate, and highly assertive communication. Must show confidence in all communications
  • Experience with application support would be an added advantage, but not essential
  • Effectively share information with other support team members and with other technology teams
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Ability to communicate appropriately to relevant stakeholde
  • Bachelor’s/University degree or equivalent experience, Master’s degree preferred
Job Responsibility
Job Responsibility
  • Manages one or more apps support teams
  • Provides technical oversight across systems and applications
  • leverage skills across multiple apps support areas
  • Provide a holistic view of the production environment for both technology and business management, working closely as a trusted partner for senior business heads
  • Provide guidance on feedback reviews with development managers, ensuring outstanding issues are tracked to closure, particularly long-term strategic fixes
  • Performs controlled resolution of incidents and problems including prioritization and escalation to relevant groups when appropriate, root cause analysis of all problems with follow-through to resolution
  • Consults with the primary clients of the application in conjunction with development managers in order to understand the strategic objectives of users
  • Requires awareness of audit and compliance related issues
  • Contributes to formulation of strategies for apps support and other functional areas
  • Champion stability initiatives to enable application high availability for Business-As-Usual which includes better monitoring, failover and resiliency
  • Fulltime
Read More
Arrow Right

Operational Incident Manager

You’ll lead the Operational Incident Team within Channel Operations, playing a c...
Location
Location
United Kingdom , Stoke on Trent
Salary
Salary:
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong understanding of the information systems used in frontline environments and how they support day to day operations
  • Comfortable monitoring performance, spotting issues early, and taking the lead when outages or incidents occur
  • Experience leading people and enjoy supporting, coaching, and guiding others
  • Confident presenting information and answering questions with clarity, whether you’re speaking with colleagues, managers, senior leaders, or customers
  • Highly organised, with strong planning skills that help you stay in control, even when priorities are shifting
  • Build trusted relationships with senior stakeholders, using clear communication and effective negotiation to get the right outcomes
  • Stay calm under pressure, able to make sound decisions and deliver results to tight timescales
  • Naturally analytical, able to break down problems, gather the right data, understand the facts, and drive issues through to resolution
Job Responsibility
Job Responsibility
  • Lead the Operational Incident Team within Channel Operations
  • Ensure incidents are managed effectively, services remain available, and our people have what they need to deliver for customers
  • Act as the channel lead during major incidents
  • Lead day to day operational support, including oversight of the monitoring centre and live operational performance
  • Manage incidents that impact Consumer Operations and the services we provide
  • Ensure Business Continuity plans are in place, well understood, and activated when required
  • Guarantee accurate, timely, and right first time system access and setup for colleagues across the operation
  • Act as the principal representative for all Channels during major incidents, coordinating actions, communications, and decision making across operational teams and senior stakeholders
  • Provide clear direction, timely escalations, and regular updates that safeguard customer experience and operational performance
  • Own key service relationships as the Service Action Owner
What we offer
What we offer
  • Excellent basic salary
  • Bonus
  • Up to 28 days off plus bank holidays
  • Paid time for charity work
  • Discounts
  • Vouchers
  • Pension plan
  • Learning tools
  • Parental leave policies
  • Fulltime
Read More
Arrow Right