This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
About this role: Wells Fargo is seeking a Systems Operations Engineer
Job Responsibility:
Participate in complex technical issues and initiatives related to large scale applications, systems, databases, or other technical products and services
Identify opportunity for process improvements within technical support strategies and plans
Review and analyze technical queries to extract data, create standard databases, or perform limited programming to fine tune systems supporting low to medium risk technical deliverables
Present recommendations for resolving complex technical queries
Exercise some independent judgment to analyze performance trends and recommend process improvements while developing understanding of technical process controls or standards
Work as an internal consultant regarding use of tools and processes
Provide information related to supported system area to functional colleagues, internal partners and stakeholders, including internal customers
Provide hands-on production support for enterprise monitoring, batch management, scheduling, and observability platforms in line with ITIL service operations
Ensure platforms meet defined availability, reliability, and performance objectives, excluding approved maintenance windows
Drive incident management activities including triage, escalation, coordination, remediation, and post-incident reviews
Contribute to problem management by identifying recurring issues and driving root-cause fixes
Execute change management activities such as configuration changes, and platform enhancements following standard controls
Proactively monitor platform health and identify risks to stability, capacity, or performance
Apply an SRE mindset by identifying opportunities for automation, self-service, and toil reduction
Strengthen observability practices through effective use of metrics, logs, dashboards, alerts, and synthetic monitoring
Create, maintain, and enhance runbooks and playbooks to improve mean time to detect (MTTD) and mean time to restore (MTTR)
Participate in on-call rotations and provide timely response and ownership during production incidents
Knowledge and experience of Agentic AI and Data
Requirements:
2+ years of Systems Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Nice to have:
Hands-on experience supporting enterprise application platforms in the areas of monitoring, scheduling, or observability (e.g., Autosys, Grafana, Splunk, Thousand Eyes, or equivalent)
Strong understanding of ITIL processes, including incident, problem, and change management
Proven ability to triage complex production issues, perform root cause analysis, and drive issues to closure
Solid understanding of SRE and reliability engineering concepts, including observability, error budgets, and automation
Ability to operate independently with a strong ownership and accountability mindset
Effective communication skills to work across global teams and time zones