This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Schwab, you’re empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us challenge the status quo and transform the finance industry together. We believe in the importance of in-office collaboration and fully intend for the selected candidate for this role to work on site in the specified location(s). As a Sr Specialist – Site Reliability Engineer (SRE) within Client Data Technology, you will play a critical role in ensuring the availability, performance, and resiliency of highly visible cloud-based platforms and applications. In this role, you will influence how systems are designed, built, and operated, driving measurable improvements in reliability and scalability while advancing modern SRE practices across the organization. You will partner closely with engineering and platform teams to define and implement sustainable operating models, enabling consistent, repeatable, and high-performing systems at scale. Your impact will include identifying and executing opportunities to enhance service health and telemetry, shaping and delivering forward-looking resiliency and availability roadmaps, and leading the adoption of cloud-native technologies aligned with established SRE standards. Through strong collaboration and technical leadership, you will promote a proactive, shift-left approach that embeds reliability, fault tolerance, and performance into the development lifecycle from the start. This role requires a balance of strategic thinking and hands-on problem-solving to optimize systems, reduce operational toil, and improve key metrics such as MTTD and MTTR, ultimately ensuring a seamless and reliable experience for clients.
Job Responsibility
Ensure availability, performance, and resiliency of highly visible cloud-based platforms and applications
Influence how systems are designed, built, and operated, driving measurable improvements in reliability and scalability
Partner closely with engineering and platform teams to define and implement sustainable operating models
Identify and execute opportunities to enhance service health and telemetry
Shape and deliver forward-looking resiliency and availability roadmaps
Lead adoption of cloud-native technologies aligned with established SRE standards
Promote a proactive shift-left approach embedding reliability, fault tolerance, and performance into the development lifecycle
Optimize systems, reduce operational toil, and improve key metrics such as MTTD and MTTR
Requirements
Bachelor's degree in Computer Engineering, Computer Science, or related field
6+ years of software development and site reliability engineering experience supporting production applications in cloud environments such as Pivotal Cloud Foundry (PCF) or Google Cloud Platform (GCP)
4+ years of DevOps engineering leadership experience focused on automation, tooling, and improving production operations
2+ years of technical leadership experience guiding engineering teams and driving operational efficiencies
2+ years of experience implementing and maturing operational best practices, including SLOs, SLIs, error budgets, monitoring, capacity planning, and incident management processes
Proficiency in programming and automation using tools such as Python, CloudFormation, or Terraform to build infrastructure-as-code solutions
Strong knowledge of database technologies (SQL, Aerospike, Postgres)
Experience working with messaging and streaming platforms such as RabbitMQ and Kafka
Nice to have
4+ years of advanced technical leadership experience supporting highly skilled engineering teams
Demonstrated ability to influence development teams to design and build cloud-native systems that are scalable, maintainable, and resilient from initial deployment onward
What we offer
401(k) with company match
Employee stock purchase plan
Paid time for vacation
Volunteering time
28-day sabbatical after every 5 years of service for eligible positions