Job Description
Are you looking for more? Find it here. At Wells Fargo, we believe that a meaningful career is much more than just a job. It's about finding all the elements that help you thrive, in one place. #LivingTheWellLife means you're supported in life, not just work. It means having a competitive salary, a robust benefits package, and programs to support your work-life balance and well-being. It means being rewarded for investing in your community, celebrated for being your authentic self, and empowered to grow. About this role: Wells Fargo is seeking a Lead Systems Operations Engineer in technology as part of Commercial and Corporate & Investment Banking Technology (CCIBT). Team is seeking a Payments Modernization Platform Engineer / SRE to join Payments & Liquidity Technology (GPLT), supporting the transformation of the high-value payments landscape as we modernize legacy wire platforms onto a new, event-driven, cloud-native architecture. This role sits at the intersection of engineering, service management, and reliability. You will be embedded early in the Payments Modernization lifecycle to ensure platforms are operable, resilient, observable, and supportable by design, not retrofitted after go-live. You will work closely with Application Engineering, Architecture and Service Operations teams to shape how new payment capabilities are built, tested, released, and stabilized at scale. This role embeds Site Reliability Engineering and production engineering practices into Payments Modernization initiatives from design through early life support, ensuring platforms are resilient, scalable, observable, and operationally ready. The engineer defines and validates non‑functional requirements, leads capacity and performance testing for high‑volume and peak payment events, and ensures robust replay, retry, and exception‑handling for event‑driven flows. They own permit‑to‑operate readiness, cutover and early life support models, and ensure runbooks, on‑call readiness, and escalation paths are production‑grade before go‑live. The role drives end‑to‑end observability, SLOs and error budgets, reduces alert noise, and analyzes early defect patterns to stabilize services. Through chaos engineering, blameless RCAs, and continuous service improvement, this position strengthens recovery, reduces risk, and improves the reliability of critical payment platforms.