This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Lead Infrastructure and Application Disaster Recovery testing and Data Center Power-down events. Drive adoption of the mandated controls which are in place with application teams. Provide guidance to application owners on how they can adapt a recovery procedure to adhere to the uplifted controls in place. Disaster Recovery tests scope events to include the interdependencies of shared services, up-steam and downstream application dependencies, Order of recovery, etc. Cyber Attack Recovery Testing Driving teams to become resilient and have the ability to recover during a cyber-attack, Test the cyber-attack recovery procedures. Power-down events establish critical milestones, establish order of recovery, verify dependency of various infrastructure components. Coordinate and manage regulatory resiliency recovery tests, such as SIFMA's industry-wide exercises, SPOOR-related tests, and those guided by the Monetary Authority of Singapore (MAS), to ensure compliance with industry standards and regulatory requirements. This involves liaising with various internal & external teams, scheduling test activities, monitoring progress, and documenting outcomes to support robust audit and risk management processes. Identify gaps in process and procedures and enhance those processes. Identify opportunities for automation. Oversee and manage the execution plans. Initiate inventory, infrastructure & Application ready for business checks. Manage incidents and escalations related to the activities we perform.
Job Responsibility:
Lead Infrastructure and Application Disaster Recovery testing and Data Center Power-down events
Drive adoption of the mandated controls which are in place with application teams.
Provide guidance to application owners on how they can adapt a recovery procedure to adhere to the uplifted controls in place.
Disaster Recovery tests scope events to include the interdependencies of shared services, up-steam and downstream application dependencies, Order of recovery, etc.
Cyber Attack Recovery Testing Driving teams to become resilient and have the ability to recover during a cyber-attack, Test the cyber-attack recovery procedures.
Power-down events establish critical milestones, establish order of recovery, verify dependency of various infrastructure components
Coordinate and manage regulatory resiliency recovery tests, such as SIFMA's industry-wide exercises, SPOOR-related tests, and those guided by the Monetary Authority of Singapore (MAS), to ensure compliance with industry standards and regulatory requirements. This involves liaising with various internal & external teams, scheduling test activities, monitoring progress, and documenting outcomes to support robust audit and risk management processes
Identify gaps in process and procedures and enhance those processes.
Identify opportunities for automation
Oversee and manage the execution plans
Initiate inventory, infrastructure & Application ready for business checks
Manage incidents and escalations related to the activities we perform.
Requirements:
Bachelor’s degree
Minimum 4-5 years of experience in technology stack including infrastructure and application
Experience in Managing Resiliency testing for On-Prem Database, NAS, Object Storage, Block Storage etc.,
Understanding of disaster recovery procedures
Understanding of RTO, RPO and how these metrics are calculated
Knows differences between resiliency testing and cyber-attack recovery/Repave test.
Background in cyber-attack recovery
Background in disaster recovery.
Strong analytical, communication, interpersonal, problem solving, organizational and time management skills
Basic understanding of excel and the ability to manipulate data using excel Knowledge of basic excel formulas used in data manipulation
Self-motivated with an ability to work on one's own with a strong sense of ownership and accountability
Highly organized, strong attention to detail and excellent follow-up skills
Strong process and project management skills with the ability to improve process efficiency and effectiveness
Strong written and verbal communication skills with an ability to summarize complicated technical information to people with less technical knowledge
Excellent influencing skills at all levels and the ability to develop and maintain good relationships with senior leadership, colleagues, and clients
Nice to have:
5-7 years of experience in disaster recovery and cyber-attack recovery programs.
Hands on experience in Managing Resiliency testing for On-Prem Database, NAS, Object Storage, Block Storage etc.,
Hands-on expertise with Cloud platforms (AWS, Azure, GCP) and Kubernetes to support, manage and DR activities
Key player in building a disaster recovery program and extensive knowledge of RTO, RTA, RPO, RPA, MTD and other DR metrics.
Has guided teams in building recovery test plans and has understanding of what should be in disaster recovery plans.
Candidates possess solid understanding of core Data Center Infrastructure ( Network Appliances, Storage technology, Unix/Linux/Windows, IP Telephony etc), order of recovery in case of any incident.
Strong understanding of various excel formulas used for data manipulation in excel.
Project Management skills with ability to coordinate multiple Disaster Recovery tests and/or power down events simultaneously
An understanding of anyone, or more, of the following Technology Risk domains to include information security, business continuity, technology resilience, controls monitoring, risk assurance, and risk governance
Prior experience as either System Administrator or Application support role
Ability to perform analysis or troubleshooting when an issue arises and provide possible alternatives to help establish solutions and confirm remediation of the issue