This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Wells Fargo is seeking a Senior Systems Operations Engineer.
Job Responsibility:
Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area
Contribute in increasing system efficiencies and lowering the human intervention time on related tasks
Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability
Work with vendors and other technical personnel for problem resolution
Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards
Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability
Perform Administration and Infrastructure management of CI/CD tools like UCD, JFrog Artifactory, Github SaaS, Github Actions, Harness, Jenkins, etc.
Manage and maintain a large sized CI/CD Infrastructure footprint - on-premise and on public or vendor cloud – for various CI/CD tools – on VMs and on Kubernetes
Respond to and resolve Incidents on above CI/CD tools in a timely manner
Plan and execute Change Tasks, Tool Upgrades, Alerts and Incidents resolution, Business Continuity Planning(BCP) exercise, etc.
Work per on-call schedule (morning/evening shifts over weekdays/weekends)
Create and maintain operational runbooks for standard incident categories and problem management
Experience with monitoring, operational dashboards, alert configurations, log analysis using tools like Splunk, Dynatrace, Prometheus, Grafana, etc.
Interact and build a strong relationship with multiple teams like Engineering teams and CIO application teams
Requirements:
4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
4+ years in CI/CD Tools Administration and large scale CI/CD tools Infrastructure management (on-premise and on cloud)
Good knowledge of python and experience of using it to automate Infrastructure operations
Experienced in creating alerts, creating dashboards and using Observability and monitoring tools like AppDynamics, Splunk, Grafana, Prometheus, ThousandEyes
Experienced in managing and scaling Docker and Kubernetes based applications and infrastructure
Establish metrics to review and manage capacity, performance, error rate, etc. regularly to pro-actively manage scale, performance and stability
Fundamental knowledge of GenAI like basics of Large Language models, prompt engineering, RAG, etc.