This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Wells Fargo is seeking a Senior Technology Resiliency Engineer. We are building a team of senior technology resiliency engineers to define and shape the future of how technology operates at scale within a global financial institution. This role is responsible for designing, influencing, and embedding enterprise-wide resiliency strategies across platforms, applications, infrastructure, and operating models. The Senior Technology Resiliency Engineer will partner closely with engineering, architecture, risk, cyber security, operations, and business leadership to ensure critical services are resilient by design, aligned to regulatory expectations, and capable of operating through severe but plausible disruption scenarios. This role requires deep technical expertise, strong systems thinking, and the ability to influence at VP/MD level—bridging engineering reality with strategic intent.
Job Responsibility
Define and evolve the enterprise technology resiliency strategy, aligned to business critical services, regulatory expectations, and long-term technology roadmaps
Establish resiliency design principles and patterns for cloud, hybrid, and on-prem platforms (e.g., multi-region, multi-AZ, active/active, degradation strategies)
Influence architecture decisions to ensure resiliency, recoverability, and operability are built in from inception—not retrofitted
Partner with enterprise and domain architects to embed resiliency requirements into standards, reference architectures, and engineering practices
Support identification and mapping of Important Business Services and underpinning technology dependencies
Translate business impact tolerances into actionable technology recovery objectives (RTO, RPO, MTO)
Assess end-to-end service resilience, identifying single points of failure across applications, data, infrastructure, vendors, and people
Drive remediation strategies for material resiliency gaps, balancing risk reduction with real-world delivery constraints
Act as a trusted advisor to engineering teams on fault tolerance, high availability, disaster recovery, and graceful degradation
Review major platform and system designs from a resiliency and operability perspective
Promote modern resiliency practices such as: Chaos engineering and failure injection
Automation-first recovery
Observability and service-level indicators
Immutable infrastructure and infrastructure-as-code
Guide the adoption of resiliency tooling and metrics at scale
Shape the firm’s approach to resilience testing, including scenario-based testing, disaster recovery exercises, and severe-but-plausible events
Design and participate in enterprise simulation exercises involving technology and business stakeholders
Drive a culture of continuous learning through post-incident analysis focused on systemic improvement rather than blame
Ensure lessons learned feed directly into architecture, standards, and engineering practices
Partner with Risk, Compliance, and Audit to ensure resiliency practices meet internal policy and external regulatory expectations
Contribute to regulatory responses, exams, and remediation programs related to operational and technology resilience
Help define meaningful, decision-useful resiliency metrics and management reporting
Act as a senior point of expertise during regulatory discussions related to technology resilience
Serve as a subject-matter leader within the resiliency engineering community
Mentor junior engineers and help build resiliency engineering as a distinct capability
Influence senior technology leadership by articulating complex technical risk in clear business terms
Contribute to the long-term evolution of the bank’s technology operating model
Requirements
7+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
7+ years of experience in large-scale technology engineering, architecture, SRE, platform engineering, or infrastructure roles
7 plus years Proven experience designing and operating highly available, distributed systems in complex enterprise environments
7+ years' experience with: Cloud and hybrid architectures (AWS, Azure, GCP or equivalent)
Disaster recovery and availability patterns
Data replication and consistency models
Observability, monitoring, and incident management
7+ years' experience influencing architecture and design decisions at scale
Nice to have
Experience in banking, capital markets, or other highly regulated industries
Background in Site Reliability Engineering (SRE) or large-scale production operations
Experience with chaos engineering or large-scale resilience testing
Familiarity with service management, ITIL, or modern operating model transformations
Advanced degree in Computer Science, Engineering, or equivalent practical experience
What we offer
Health benefits
401(k) Plan
Paid time off
Disability benefits
Life insurance, critical illness insurance, and accident insurance