CrawlJobs Logo

Senior Site Reliability Engineer - GCP & Container Platforms

https://www.wellsfargo.com/ Logo

Wells Fargo

Location Icon

Location:
United States , CHARLOTTE, North Carolina / CHANDLER, Arizona

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are seeking a Senior Site Reliability Engineer (SRE) to help develop our cybersecurity platform operations across Windows, Linux, and cloud-native environments. This role is central to our transformation from app-specific support to platform-wide reliability engineering. You will bring deep expertise in Google Cloud Platform (GCP), container orchestration, and automation, enabling scalable, secure, and resilient infrastructure that supports diverse applications across our enterprise.

Job Responsibility:

  • Ensure high availability, performance, and security of production systems across Windows, Linux, and GCP environments
  • Engineer and support containerized workloads using Kubernetes (GKE) and Docker, enabling scalable microservices architectures
  • Lead infrastructure provisioning and configuration using Terraform, Ansible, and GCP-native tools
  • Develop automation scripts and pipelines to eliminate manual toil and accelerate incident response
  • Implement observability frameworks using SLIs/SLOs, Prometheus, Grafana, and GCP Operations Suite
  • Drive proactive monitoring, alerting, and telemetry across hybrid environments
  • Lead incident response, root cause analysis, and postmortems
  • Build self-healing systems and automated remediation workflows using GCP-native services and scripting
  • Collaborate with InfoSec to enforce hardening standards, manage vulnerabilities, and support compliance initiatives
  • Integrate security into CI/CD pipelines and container platforms using IAM, encryption, and policy enforcement
  • Partner with developers, application owners, and infrastructure teams to deliver reliable, cloud-native platforms
  • Document configurations, runbooks, and operational procedures to enable cross-team reuse and transparency

Requirements:

  • 4+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 2+ years of experience with GCP services, including GKE, IAM, Cloud Functions, and Cloud Monitoring

Nice to have:

  • 4+ years of experience in Windows Server administration and production support
  • Strong scripting skills in PowerShell, Python, or Shell
  • Proficiency in container technologies: Docker and Kubernetes
  • Familiarity with Linux system administration and hybrid cloud environments
  • Experience with infrastructure-as-code tools: Terraform, Ansible
  • Strong understanding of Active Directory, DNS, DHCP, and Windows security principles
  • Security certifications (e.g., CISSP, Security+, GCP Professional Cloud Security Engineer)
  • Experience with CI/CD tools (e.g., GitLab CI and Jenkins)
  • Familiarity with ITIL practices and change management
  • Exposure to ServiceNow, load balancers, certificate management, and endpoint protection tools

Additional Information:

Job Posted:
February 08, 2026

Expiration:
February 13, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Site Reliability Engineer - GCP & Container Platforms

Senior Vice President, Cloud Security Site Reliability Engineer

This role sits within the Cloud Security team which is responsible for Private a...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent work experience
  • 8+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills. Able to communicate efficiently at multiple levels of seniority
  • Certification or formal training in site reliability engineering concepts and practices
  • Prior experience working towards SLIs, SLOs and observability capabilities at a large scale
  • 5+ years experience in Python (preferable) or Java, on large scale systems alongside Linux based scripting languages
  • Experience working on observability, logging and metrics toolsets
  • Experience of k8s and container technologies such as Docker, Openshift and EKS.
  • Experience with public cloud technologies such as AWS, GCP or Azure
  • Experience with Secrets products such as HashiCorp Vault or CyberArk
Job Responsibility
Job Responsibility
  • Working across Container products and Secrets products, across Public and Private Cloud, as well as Cloud native specific products
  • Architecting and building tools and platforms that provide capabilities for SRE
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organization
  • Actively owning production level incidents till resolution.
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer Cloud Platform

Zilliz is a fast-growing startup developing the industry’s leading vector databa...
Location
Location
Salary
Salary:
175000.00 - 225000.00 USD / Year
zilliz.com Logo
Zilliz
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience in site reliability engineering or similar roles with a focus on cloud-native systems
  • Proficiency in scripting languages such as Python, Go, or Java
  • Strong knowledge of container orchestration technologies like Kubernetes and Docker
  • Expertise with cloud platforms such as AWS, GCP, or Azure, and their respective monitoring and management tools
  • Experience with infrastructure as code tools such as Terraform or Ansible
  • Familiarity with CI/CD tools such as Jenkins, GitLab CI, or Argo
  • Proven ability to troubleshoot complex distributed systems and resolve issues promptly
  • Bachelor’s degree or above in computer science, software engineering, or other relevant disciplines
  • Ability to thrive in a fast-paced, startup environment and handle multiple projects simultaneously
Job Responsibility
Job Responsibility
  • Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms
  • Ensure the reliability, availability, and performance of Zilliz’s distributed database systems
  • Develop and implement strategies for monitoring, incident management, and disaster recovery
  • Automate system operations and maintenance tasks to improve efficiency and reduce manual intervention
  • Design and build tools to manage and monitor infrastructure, ensuring scalability and robustness
  • Collaborate with software engineers to enhance system reliability, scalability, and performance
  • Maintain and improve the CI/CD pipeline to ensure smooth and rapid deployment of changes
  • Actively contribute to the Milvus Vector Database open-source community, focusing on improving reliability and operational efficiency
  • Fulltime
Read More
Arrow Right

Vice President - Cloud Security Site Reliability Engineer

This role sits within the Cloud Security team which is responsible for Private a...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent work experience
  • 6+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills. Able to communicate efficiently at multiple levels of seniority
  • Certification or formal training in site reliability engineering concepts and practices
  • Prior experience working towards SLIs, SLOs and observability capabilities at a large scale
  • 4+ years experience in Python (preferable) or Java, on large scale systems alongside Linux based scripting languages
  • Experience working on observability, logging and metrics toolsets
  • Experience of k8s and container technologies such as Docker, Openshift and EKS
  • Experience with public cloud technologies such as AWS, GCP or Azure
  • Experience with Secrets products such as HashiCorp Vault or CyberArk
Job Responsibility
Job Responsibility
  • Working across Container products and Secrets products, across Public and Private Cloud, as well as Cloud native specific products
  • Architecting and building tools and platforms that provide capabilities for SRE
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organisation
  • Actively owning production level incidents till resolution.
  • Fulltime
Read More
Arrow Right

Senior Full Stack Engineer

The Senior Full Stack Engineer will support the modernization of IRS mission-cri...
Location
Location
United States , McLean
Salary
Salary:
Not provided
bln24.com Logo
BLN24
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or related field
  • Minimum 6 years of experience in full-stack software development and architecture
  • Demonstrated expertise in designing and implementing RESTful and GraphQL APIs, and building service-oriented architectures
  • Proficiency with front-end frameworks such as React, Angular, or Vue, and backend technologies such as Node.js, Python, Java, or Spark
  • Solid working knowledge of core web technologies, including HTML, CSS, JavaScript, and modern UI component libraries
  • Hands-on experience with cloud platforms (AWS, GCP, Azure) and container orchestration tools including Kubernetes and OpenShift
  • Familiarity with platforms such as Databricks for data engineering, pipeline integration, or ML model support
  • Experience designing scalable, secure web applications and microservices architectures with considerations for caching, authentication, and maintainability
  • Working knowledge of SQL and NoSQL databases, CI/CD pipelines, infrastructure-as-code, and cloud monitoring tools
  • Experience collaborating in Agile delivery environments, and contributing to code reviews, documentation, and team-based development workflows
Job Responsibility
Job Responsibility
  • Design and develop scalable APIs using REST, GraphQL, and gRPC in compliance with IRS enterprise architecture and security standards (OAuth, JWT)
  • Lead full-stack development of modern, modular web applications that interface with IRS systems and external users
  • Decompose and migrate legacy system functionality (e.g., COBOL-based command codes) into modern service-oriented components
  • Integrate AI-driven services, including ML model endpoints, auto-generated documentation, code conversion workflows, and intelligent test automation
  • Implement CI/CD pipelines and automated testing tools (e.g., Postman, Newman) to ensure secure, validated, and maintainable code
  • Collaborate with DevOps and Site Reliability Engineers to embed observability tools (e.g., Prometheus, Datadog, New Relic) and monitoring dashboards
  • Translate business and functional requirements into API contracts and reusable service patterns, working within Agile Scrum teams
  • Maintain backward compatibility with legacy systems while building toward scalable, cloud-optimized services
  • Ensure IRS and Treasury IT governance compliance, including Section 508 accessibility and cybersecurity policies
What we offer
What we offer
  • Generous medical, dental, and vision plans
  • Opportunity to work in different sectors
  • Flexibility to balance quality work and personal lives
  • Remote working opportunities
  • Fulltime
Read More
Arrow Right
New

CT Technologist Internship

Baptist Health is offering a 4-month Computed Tomography (CT) Internship at Bapt...
Location
Location
United States , Jacksonville
Salary
Salary:
Not provided
baptistjax.com Logo
Baptist Health (Florida)
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Graduate of an approved School of Radiology
  • BLS certification
  • Meet ARRT ethics and education standards
  • CRT licensure
  • Commitment to sit for the ARRT CT exam after program completion
Job Responsibility
Job Responsibility
  • Gain hands-on experience in CT imaging under direct supervision
  • Assist with patient transports
  • Assist with patient screenings
  • Assist with record management
  • Progression to performing diagnostic CT exams as training advances
  • Complete clinical procedures required for ARRT CT registry
What we offer
What we offer
  • Competitive pay
  • Benefits
  • Growth opportunities
  • Supportive, inclusive workplace
  • Fulltime
Read More
Arrow Right
New

Night Care Assistant

Come and be part of the great Stonehaven team in a home from home environment. W...
Location
Location
United Kingdom , Exeter
Salary
Salary:
12.82 - 13.07 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Right to work in the UK
  • Nice, friendly, dedicated personality
  • High self-set standards
  • Positive behavior at all times
  • Putting residents' best interests at heart
What we offer
What we offer
  • Comprehensive learning package
  • Competitive pay with enhancements
  • Fulltime
Read More
Arrow Right
New

Kitchen Porter

Are you ready to be an essential part of a bustling kitchen, working behind the ...
Location
Location
United Kingdom , Castlefield
Salary
Salary:
12.21 GBP / Hour
brunningandprice.co.uk Logo
Brunning & Price
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Previous experience as a kitchen assistant, kitchen porter, pot washer or kitchen cleaner in a busy pub, restaurant or hotel is preferred but not essential
  • Keen to work in a friendly kitchen where you’ll really be part of the team
What we offer
What we offer
  • Paid overtime
  • Great cash tips
  • Free meals on shift
  • 30% discount for you, your friends and family across B&P and our group including wagamama
  • NEST pension
  • Great discounts via Perks on Tap saving you money on everyday purchases and more
  • £1,000 referral bonus for introducing new Managers or Chefs to the company
  • Wagestream - use flexible pay to choose when to get paid
  • Weekly pay
  • Free 24-hour confidential legal and information helpline for you and your family
  • Fulltime
Read More
Arrow Right
New

Lead Infrastructure as Code (IaC) Developer

Wells Fargo is seeking a seasoned Lead Infrastructure as Code (IaC) Developer to...
Location
Location
United States , IRVING, CHARLOTTE
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
February 09, 2026
Flip Icon
Requirements
Requirements
  • 5+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 5+ years of full‑stack software development experience using Java
  • 3+ years of experience deploying and operating Redis Enterprise (cloud, hybrid, or on‑prem)
  • 3+ years of experience with data modeling and SQL
  • 3+ years of experience with IaC tools such as Terraform and Ansible
  • 3+ years of experience implementing GitOps or similar tools
  • 2+ years of experience with Kubernetes/OCP, containerization, and hybrid cloud platforms (AWS, Azure, GCP)
  • 2+ years of experience designing and consuming RESTful APIs and integrating automation into platform services
Job Responsibility
Job Responsibility
  • Lead large-scale initiatives to automate provisioning, configuration, and lifecycle operations for Redis Enterprise clusters and databases
  • Architect and develop reusable IaC components (Terraform, Ansible) for Redis Enterprise cluster creation, node scaling, database provisioning, failover configuration, and policy enforcement
  • Develop robust APIs using Java SpringBoot to expose Redis Enterprise provisioning, configuration, capacity management, and governance workflows
  • Design and implement GitOps-driven workflows to automate cluster and database changes—such as database sizing, sharding policies, persistence modes (AOF/RDB), and multi‑zone topologies
  • Build and maintain self-service platform capabilities enabling developers to provision Redis Enterprise databases, request resources, apply configurations, and consume metrics via intuitive APIs or service catalogs
  • Define and enforce Redis Enterprise platform standards, including memory management policies, eviction strategies, HA/DR patterns, Active‑Active topology guidelines, and enterprise security standards
  • Collaborate across engineering, security, and product teams to align Redis Enterprise automation with organizational priorities and best practices
  • Participate in architecture and code reviews while mentoring engineers on Redis Enterprise operations, IaC patterns, and automation best practices
  • Continuously improve platform reliability, performance, scalability, and operational efficiency through automation and infrastructure modernization
What we offer
What we offer
  • Relocation assistance in not available for this position
  • Fulltime
!
Read More
Arrow Right