This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Wells Fargo is seeking a Senior Systems Operations Engineer
Job Responsibility:
Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area
Contribute in increasing system efficiencies and lowering the human intervention time on related tasks
Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability
Work with vendors and other technical personnel for problem resolution
Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards
Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability
Provide 24x7 production support for enterprise backup and recovery environments using NetBackup, Rubrik ,Cohesity, Avamar, and Data Domain
Perform root cause analysis (RCA) for recurring issues and implement permanent fixes
Troubleshoot complex backup issues (job failures, media/storage/network, replication/duplication, certificates/security) and manage incident/problem/change tickets via ITIL, actively participating in major incident calls to deliver timely resolution and clear communication.
Work closely with backup vendors (Veritas, Cohesity, Dell EMC) for case management, patching, bug fixes, and product escalations.
Collaborate with internal Engineering, Storage, Network, and Security teams to resolve cross-functional issues.
Track vendor escalations to closure and validate implemented solutions.
Manage and support: Backup duplications and replications (source, target, and optimized duplication), Certificates and security configurations (SSL, CA, internal/external certificates, renewals, and troubleshooting), Deliver BMR and encrypted-disk recovery (UEFI/BIOS, GPT/MBR, EFI/BCD, Linux bootloader, drivers/NIC/storage) with secure key handling.
Support backup upgrades, hotfixes, migrations, and new client onboarding.
Use scripting languages such as Shell, Python, Ansible to reduce manual effort and improve operational efficiency.
Drive automation (Python/Ansible/APIs), integrate with ServiceNow/Splunk/Grafana/Teams, enable Docker/K8s/OCP + Azure/GCP, apply AI ops, document, and support on-call/RCA.
Requirements:
4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Experience in Plan, Build, Operate & Maintain enterprise backup solutions, such as: Veritas NetBackup, Rubrik, Cohesity, Dell EMC Avamar
Deep understanding of restore operations and recovery patterns: File-level, VM-level, database-level, application-consistent backups, granular restores, cross-platform recovery
Experience protecting systems with encrypted disks and restoring with appropriate key handling procedures
Hands-on experience defining and meeting RPO/RTO, validating recoverability, and running restore drills / DR tests
Strong scripting (Python , Ansible) and automation skills, API integrations to streamline backup operations and reduce manual effort.
Bare Metal Recovery - Strong understanding of snapshot concepts and mechanics: crash-consistent vs application-consistent, copy-on-write/redirect-on-write behavior, and snapshot performance/space impact.
Ability to design snapshot strategies aligned to RPO/RTO, including frequency, retention, expiration, and restore procedures.
Hands-on experience designing and operating Bare Metal Backup / Bare Metal Recovery (BMR) for physical and/or virtual servers, covering full system image, boot components, and system state.
Proven ability to perform end-to-end bare metal restores from backup media to new hardware or clean VM targets, including post-restore validation and service bring-up.
Strong understanding of boot and partitioning fundamentals impacting recoverability: UEFI vs BIOS, GPT vs MBR, EFI System Partition, Boot Configuration Data (Windows), initramfs/grub (Linux).
Good understanding of DevOps tools like Ansible , Docker , Kubernetes, Github & Jenkins
Good understanding of Cloud technologies like Redhat OCP , Azure , GCP and Cloud Foundry
Good understanding of Storage – Block, File & Object technologies
Experience in integrating automation with enterprise tooling: ServiceNow (ticketing/workflows), Splunk/Grafana (logging/alerting), email/Teams notifications, and dashboards
Ability to leverage AI-assisted operations to improve reliability and reduce MTTR across backup, snapshot, and restore workflows.