CrawlJobs Logo

Incident Commander, Program Manager

cash.app Logo

Cash App

Location Icon

Location:
United States , Bay Area

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

135800.00 - 245400.00 USD / Year

Job Description:

Block's Critical Incident Management Team (CIMT) plays an important role in protecting our operations, customers, and regulatory standing in the face of significant business incidents. As our organization continues to grow, we are expanding our incident command capabilities to ensure fast, coordinated, and effective responses to high-impact events. As an Incident Commander within the Risk organization, you will be at the center of Block's response to complex, high-stakes incidents, helping maintain our operational resilience and uphold our commitment to responsible innovation.

Job Responsibility:

  • Be the lead incident commander for high-severity incidents across Block's ecosystem, including fraud events, customer-impacting issues, regulatory matters, and escalated incidents
  • Direct the real-time response and coordination among cross-functional teams such as Legal, Compliance, Engineering, Product, and Customer Success
  • Be the central point of accountability for incident escalation, containment, remediation, and resolution
  • Document and maintain detailed incident timelines, key decisions, and supporting evidence throughout the lifecycle of the incident
  • Oversee the preparation of final incident reports
  • Manage internal communication channels and ensure accurate, updates are shared with stakeholders at all levels
  • Facilitate post-incident reviews to identify lessons learned and improve incident preparedness and response

Requirements:

  • 7 + years of Proven experience in incident management, crisis response, or a related role within a high-stakes operational environment
  • 5+ years of Project management, especially under time-sensitive and high-pressure conditions
  • Understanding of regulatory and compliance considerations in the financial services or technology space
What we offer:
  • Remote work
  • medical insurance
  • flexible time off
  • retirement savings plans
  • modern family planning

Additional Information:

Job Posted:
February 21, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Incident Commander, Program Manager

Crisis & Incident Management Lead – Operational Resilience - Vice President

The VP, Crisis & Incident Management Lead is responsible for the strategic leade...
Location
Location
United States Of America , NEW YORK
Salary
Salary:
150000.00 - 180000.00 USD / Year
credit-agricole.com Logo
Crédit Agricole
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Risk Management, Information Technology, Business Continuity, or a related field
  • Minimum 10+ years of experience in crisis/incident management, operational resilience, or business continuity
  • Experience leading cross-border incident response and regulatory engagement
  • Proficiency in English (both written and verbal)
  • Incident Management: Ability to analyze, prioritize, and manage incidents effectively. Cross-functional command and coordination
  • Strategic Thinking: Ability to align crisis and incident management initiatives with business objectives and regulatory requirements
  • Communication&Documentation: Ensure thorough documentation and clear communications over crisis and incident management activities
  • Leadership&Team Management: Proven track record of building and leading high performing teams. Strong project management skills. Ability to thrive in fast-paced, high-stakes environment
  • Regulatory Compliance: Expertise in navigating banking regulations and audit readiness. Deep understanding of financial compliance requirements and regulatory frameworks, including FFIEC, DORA, PRA and OCC
  • Crisis Leadership: Demonstrated ability to lead complex incident response efforts across business, technology, cyber, and third-party domains
Job Responsibility
Job Responsibility
  • Develop and lead a crisis and incident management strategy aligned to the bank’s operational resilience framework and key business services
  • Translate regulatory expectations (e.g., FFIEC, DORA, OCC, PRA) into actionable, risk-informed response strategies
  • Establish and manage governance forums and escalation protocols for crisis and incident oversight
  • Support the definition and testing of impact tolerances and maximum tolerable downtimes (MTD/MTLD) in partnership with Operational Resiliency Testing Lead, Business, and Technology stakeholders
  • Act as the lead coordinator during regional crises, ensuring structured, timely, and effective command, control, and communications
  • Maintain and continuously improve incident response plans, escalation playbooks, crisis decision trees, and communication protocols
  • Ensure that major incidents—including those involving third parties and cyber events—are managed in line with regulatory requirements
  • Integrate internal communications tools and channels into a unified communications strategy
  • Maintain and operate an auditable major incident log, with clear decision documentation, timelines, and actions taken
  • Drive optimization of incident response processes using data analytics, metrics and automation opportunities
  • Fulltime
Read More
Arrow Right

Manager II, Security Incident Command

Uber’s Incident Command team, part of the Threat Defense and Response (TDR) orga...
Location
Location
United States , New York; Seattle; San Francisco; Sunnyvale
Salary
Salary:
232000.00 - 258000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in one or more of the following: Security incident response
  • Production incident management (e.g., SRE, Ring0, reliability engineering)
  • Security or infrastructure operations
  • Experience leading or coordinating high-severity incidents in a complex, distributed environment
  • Experience serving as an incident commander, incident lead, or equivalent leadership role during critical incidents
  • Strong systems thinking: ability to navigate incidents across infrastructure, applications, and services
  • Excellent communication and stakeholder management skills, especially under pressure
  • Experience mentoring or managing engineers or operational responders
Job Responsibility
Job Responsibility
  • Lead a global team of incident commanders managing Uber’s highest severity security incidents
  • Drive structured, effective coordination across engineering, security, and business teams during high-impact events
  • Partner with Security, Legal, and Privacy on sensitive incidents requiring careful judgment and handling
  • Evolve incident management practices by integrating security IR and SRE/Ring0 disciplines
  • Own postmortem, premortem, and incident simulation programs to improve resilience and organizational readiness
  • Translate external incidents and emerging threats into actionable risk reduction across Uber
  • Build and integrate automation and AI-driven capabilities into incident response, postmortems, premortems, and incident simulations
  • Translate incident processes into scalable systems, defining safe automation boundaries and human-in-the-loop decision frameworks
  • Mentor and grow incident commanders in leadership, decision-making, engineering, and operational excellence
  • Foster an inclusive, high-performing culture grounded in accountability, learning, and continuous improvement
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • All full-time employees are eligible to participate in a 401(k) plan
  • Eligible for various benefits
  • Fulltime
Read More
Arrow Right
New

Program Leader, Outage Management Lead – Community Operations

At Uber, reliability is critical - and when outages or severe product issues occ...
Location
Location
United States , Phoenix; San Francisco
Salary
Salary:
130000.00 - 144000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience in program management, incident management, customer operations or large-scale cross-functional environments
  • 3+ years of experience leading teams
  • Bachelors Degree Obtained
  • Experience working in high-severity or crisis / incident / outage environments requiring structured decision-making under pressure
  • Excellent written and verbal communication skills, including experience delivering executive-level updates
  • Ability to drive consensus and actionable results across highly cross-functional teams
  • Experience in large-scale customer support or global operations environments
  • Experience in designing and implementing support processes, or AI-enabled operational solutions
  • Data-driven mentality and strong business judgment
  • Track record of balancing analytical strategic thinking with quick decision-making, change management, and timely execution
Job Responsibility
Job Responsibility
  • Lead the development of the vision and support strategy for outage management and severe incident management within the Community Operations org
  • Establish governance, decision rights, and success metrics that drive consistent execution across line of business
  • Develop further and run the Outage + Critical Outage frameworks
  • Design and continuously improve the processes, playbooks, and escalation/routing standards for both standard outages and critical outages
  • Create risk- and scenario-based preparedness working with global functional leads
  • Define the CommOps “source of truth” for impact sizing and operational reporting during outages and remediations phases
  • Lead and Develop the Ops Commander Function
  • Strengthen Early Detection and Incident Readiness
  • Drive Technology Strategy
  • Lead CommOps coordination during live Critical Outages
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • All full-time employees are eligible to participate in a 401(k) plan
  • Eligible for various benefits
  • Fulltime
Read More
Arrow Right

Service Delivery Manager – Infrastructure Operations

The Service Delivery Manager (SDM) is responsible for end-to-end service deliver...
Location
Location
United States , Mahwah
Salary
Salary:
180000.00 - 190000.00 USD / Year
techmahindra.com Logo
Tech Mahindra
Expiration Date
April 13, 2026
Flip Icon
Requirements
Requirements
  • 10+ years in IT Infrastructure Operations or Service Delivery Management
  • Experience in Command center services and Datacenter operations
  • Strong background in Network, Midrange servers, and Mainframe operations
  • Proven experience managing 24x7 global operations
  • Technical Knowledge: High level understanding of Network infrastructure (circuits, routers, switches, APs)
  • Technical Knowledge: High level understanding of Server and midrange platforms
  • Technical Knowledge: High level understanding of Mainframe operations (LPARs, IPL, storage, hardware maintenance)
  • Technical Knowledge: High level understanding of ITSM tools (ServiceNow or equivalent)
  • Technical Knowledge: High level understanding of Incident, Change, and Problem Management frameworks
  • Leadership & Soft Skills: Strong stakeholder and executive communication
Job Responsibility
Job Responsibility
  • Service Delivery & Operations Management: Own and manage L1/L1.5 operations delivery across Midrange Servers, Network and Mainframe platforms
  • Service Delivery & Operations Management: Ensure adherence to SLAs and KPIs across all supported technology towers
  • Service Delivery & Operations Management: Drive RAG-based service health reporting and execution of continuous improvement plans
  • Service Delivery & Operations Management: Lead daily, weekly, and monthly service reviews with stakeholders
  • Network Operations Control (NOC): Oversee L1/L1.5 support of global network infrastructure across data centers
  • Network Operations Control (NOC): Ensure event monitoring and incident management for: Data circuits, Routers, switches, access points (APs), Internal and GNS-procured network hardware
  • Network Operations Control (NOC): Coordinate incident remediation with applicable internal teams and external providers
  • Network Operations Control (NOC): Manage troubleshooting and dispatch of Technology Support Group (TSG) via Service Orders
  • Network Operations Control (NOC): Ensure timely escalation and restoration for business-critical network events
  • Midrange Operations (MRO): Manage L1/1.5 support of midrange infrastructure, including: Open systems servers, Critical workstations, Globally deployed services
What we offer
What we offer
  • medical
  • vision
  • dental
  • life
  • disability insurance
  • paid time off (including holidays, parental leave, and sick leave, as required by law)
  • Fulltime
Read More
Arrow Right

Emergency Management and Business Continuity Specialist

Support organizational planning to address disasters, interruptions of business ...
Location
Location
United States , Kansas City
Salary
Salary:
Not provided
kansashealthsystem.com Logo
The University of Kansas Health System
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors Degree from an accredited College or University in Nursing, Disaster Recovery, Incident Response, Emergency Preparedness, Business/Organizational Leadership, Occupational Safety Management, or related field.
  • National Incident Management System (NIMS) IS 100, IS 200, IS 700, and IS 800 training within 120 days of hire. Completes IS 300 and 400 within the timeline set by supervisor.
  • ICS 300: Intermediate Incident Command System for Expanding Incidents.
  • ICS 400: Advanced Incident Command System for Command and General Staff - Complex Incidents.
Job Responsibility
Job Responsibility
  • Support organizational planning to address disasters, interruptions of business functions and enterprise resilience.
  • Assist with the development of plans for continuity of essential functions and resumption of complete business operations.
  • Monitor business and operation changes to ensure plans remain current and valid.
  • Perform business process analysis/business impact analysis (BPA/BIA), risk assessments of essential functions and/or information systems.
  • Collaborate with key infrastructure teams to identify gaps, set recovery time objectives, and convey business needs/expectations.
  • Ensure accurate documentation of system resilience and redundancy is maintained.
  • Collaborates with the Regional Hospital Emergency Preparedness designee in supporting The University of Kansas Health System (TUKHS) Emergency Management Program (EMP0 mission, values, and goals.
  • Support the tests and exercises related to the execution of Business Continuity (BC) plans.
  • Assist in the execution of the table-top, functional and Disaster Recovery exercises.
  • Support the vision and system strategic development, implementation, and sustainment of the Business Continuity Program.
  • Fulltime
Read More
Arrow Right

Data Center Incident Program Manager

The Data Center Incident Program Manager is responsible for designing, operating...
Location
Location
United States
Salary
Salary:
125600.00 - 228000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in mission-critical infrastructure, data center operations, or reliability engineering
  • Direct experience leading major incidents (P1/P0 equivalent)
  • Strong familiarity with facilities systems, hardware operations, or network infrastructure
  • Demonstrated experience running war rooms and executive updates
  • Experience conducting root cause analysis and corrective action tracking
  • Ability to remain calm and decisive under high-pressure conditions
Job Responsibility
Job Responsibility
  • Define and maintain incident severity levels (SEV definitions), classification criteria, and escalation thresholds
  • Establish end-to-end incident response standards: protocols, lifecycle stages (declare → stabilize → mitigate → recover → close), and operating cadence
  • Build and maintain governance artifacts: runbooks, war room formats, reporting templates, and decision/communication standards
  • Create and operationalize notification trees, stakeholder comms templates (initial, periodic updates, recovery/closure), and executive escalation criteria
  • Define clear RACI across Facilities, Hardware Ops, Network, Security, and vendor/partner teams, including handoffs and accountability paths
  • Set and manage SLAs/OLAs for acknowledgment, escalation, containment, mitigation, and reporting
  • Implement and run incident management tooling (ticketing, paging, logging) and ensure integrations with monitoring and workflow systems
  • Establish dashboards and program health metrics to track incident performance and readiness
  • Lead readiness activities: tabletop exercises, cross-functional simulations, IC/Deputy training, and a rotating on-call IC bench with certification standards
  • Serve as Incident Commander as needed: declare severity, stand up the war room, assign functional leads, and drive structured execution under pressure
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Service Engineer II

Are you excited about working on one of Microsoft’s most strategic and high‑visi...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in incident management, service engineering, program management, or related technical roles
  • Strong track record commanding high-pressures, complex, cross-team incidents across cloud or large-scale distributed systems
  • 3+ years of hands-on experience working with cloud technologies (Azure preferred)
  • Strong understanding of Azure architecture, core services, and internal operational workflows
  • Exceptional communication skills, with the ability to simplify complex technical issues for senior executives and customers
  • Experience collaborating in matrixed engineering environments with diverse stakeholders (PG, EngOPS, Field, GPMs, PMs, SREs)
  • Strong analytical skills
  • ability to drive insight from data and influence direction through evidence
  • Proven experience driving pilots, building prototypes, or contributing to innovation in livesite or automation scenarios
  • Demonstrated experience in AI/ML-based solutions—automation, anomaly detection, NLP, or reliability tooling
Job Responsibility
Job Responsibility
  • Lead high‑severity Azure incidents with strong command presence and clear decision‑making under pressure
  • Drive the end‑to‑end incident lifecycle, including detection, triage, mitigation, communication, and post‑incident learning
  • Partner across Azure product groups, EngOPS, and field teams to accelerate diagnosis, reduce time‑to‑mitigation, and drive sustainable fixes
  • Represent the voice of the customer by surfacing systemic issues, platform gaps, and reliability risks to engineering teams
  • Drive operational maturity through repeatable processes, strong governance, high‑quality execution, and measurable reliability metrics
  • Identify live‑site patterns and hotspots across services and lead cross‑team action plans to address them
  • Convert customer and incident pain points into automation, AI‑assisted workflows, and process improvements
  • Lead or co‑own pilots, proofs‑of‑concept, and tech accelerators that enhance incident response velocity and quality
  • Contribute to internal playbooks, frameworks, and tooling that leverage AI/ML for improved live‑site management
  • Fulltime
Read More
Arrow Right

Senior Service Engineer

Are you excited about working on one of Microsoft’s most strategic and high‑visi...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in incident management, service engineering, program management, or related technical roles
  • Strong track record commanding high-pressures, complex, cross-team incidents across cloud or large-scale distributed systems
  • 5+ years of hands-on experience working with cloud technologies (Azure preferred)
  • Strong understanding of Azure architecture, core services, and internal operational workflows
  • Exceptional communication skills, with the ability to simplify complex technical issues for senior executives and customers
  • Experience collaborating in matrixed engineering environments with diverse stakeholders (PG, EngOPS, Field, GPMs, PMs, SREs)
  • Strong analytical skills
  • ability to drive insight from data and influence direction through evidence
  • Proven experience driving pilots, building prototypes, or contributing to innovation in live‑site or automation scenarios
  • Demonstrated experience in AI/ML-based solutions—automation, anomaly detection, NLP, or reliability tooling. Exposure to Power BI, Kusto (KQL), or other analytical tooling
Job Responsibility
Job Responsibility
  • Lead high‑severity Azure incidents with strong command presence and clear decision‑making under pressure
  • Drive the end‑to‑end incident lifecycle, including detection, triage, mitigation, communication, and post‑incident learning
  • Partner across Azure product groups, EngOPS, and field teams to accelerate diagnosis, reduce time‑to‑mitigation, and drive sustainable fixes
  • Represent the voice of the customer by surfacing systemic issues, platform gaps, and reliability risks to engineering teams
  • Drive operational maturity through repeatable processes, strong governance, high‑quality execution, and measurable reliability metrics
  • Identify live‑site patterns and hotspots across services and lead cross‑team action plans to address them
  • Convert customer and incident pain points into automation, AI‑assisted workflows, and process improvements
  • Lead or co‑own pilots, proofs‑of‑concept, and tech accelerators that enhance incident response velocity and quality
  • Contribute to internal playbooks, frameworks, and tooling that leverage AI/ML for improved live‑site management
  • Fulltime
Read More
Arrow Right