This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Schwab, you’re empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us challenge the status quo and transform the finance industry together. We believe in the importance of in-office collaboration and fully intend for the selected candidate for this role to work on site 4-days per week during night shifts (2pm-10pm CST), and weekends as needed, in the specified location(s). As a Production Support Engineer, you will help safeguard the stability and resilience of Schwab’s Order Management System in a high-availability environment. Your work directly supports clients by ensuring critical trading and operational systems remain reliable, secure, and performant. In this role, you’ll take ownership of complex production situations—from assessing impact and leading incident response to collaborating across application, infrastructure, database, and vendor partners to restore service. You’ll also focus on continuous improvement by identifying patterns, reducing recurring issues, and strengthening monitoring, runbooks, and operational practices that improve availability over time. You’ll be part of a collaborative, inclusive team that succeeds together as One Schwab. We value trust, accountability, and diverse perspectives, and we encourage curiosity, knowledge sharing, and continuous learning so you can grow your skills while making a meaningful, client-centered impact.
Job Responsibility
Safeguard the stability and resilience of Schwab’s Order Management System in a high-availability environment
Own complex production situations—from assessing impact and leading incident response to collaborating across application, infrastructure, database, and vendor partners to restore service
Focus on continuous improvement by identifying patterns, reducing recurring issues, and strengthening monitoring, runbooks, and operational practices that improve availability over time
Requirements
Bachelor’s degree in Computer Science or a related field, or equivalent practical experience
3+ years of experience in production support, site reliability engineering (SRE), or software operations
Working knowledge of Java (Java 17+ preferred) and SQL for troubleshooting
Experience supporting applications using observability and monitoring tools such as AppDynamics, Splunk, Grafana, InfluxDB, and Control‑M
Oracle Database experience with SQL
3+ years of experience administering Linux systems (RHEL 7/8/9 preferred)
Ability to use shell scripting or Python to automate repetitive operational tasks
Strong communication skills, particularly during incident response and post‑incident reviews
Availability to work nights and weekends as part of a rotating on‑call schedule
Nice to have
Experience with configuration management and automation tools such as Ansible, Salt, or Puppet
Demonstrated ability to identify early indicators of system instability and proactively reduce operational risk
Experience collaborating across teams to improve reliability, scalability, and operational maturity
What we offer
401(k) with company match and Employee stock purchase plan
Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions