This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Help safeguard the stability and resilience of Schwab's Order Management System in a high-availability environment
Take ownership of complex production situations—from assessing impact and leading incident response to collaborating across application, infrastructure, database, and vendor partners to restore service
Focus on continuous improvement by identifying patterns, reducing recurring issues, and strengthening monitoring, runbooks, and operational practices that improve availability over time
Requirements:
Bachelor's degree in Computer Science or a related field, or equivalent practical experience
5+ years of experience supporting production systems in site reliability engineering, production support, or software operations roles
Ability to lead high-severity incidents by assessing impact, coordinating response efforts, and restoring service efficiently
Strong troubleshooting skills applied to distributed systems and SQL-backed applications
Ability to analyze operational data to identify root causes and deliver lasting reliability improvements
Clear and effective communication skills, with the ability to explain technical issues to both technical and non-technical partners
Ability to participate in a rotating on-call schedule to support 24x7 operations
Nice to have:
Experience supporting Java-based applications in complex, high-availability environments
Ability to strengthen monitoring and alerting by improving signal quality, coverage, and actionable insights
Working knowledge of Oracle database operations, including SQL analysis and performance troubleshooting
Experience supporting Linux-based systems and applying performance tuning fundamentals
Ability to automate repetitive operational tasks using scripting or programming languages
Experience creating or improving runbooks, escalation models, and change-management practices
Demonstrated ability to mentor, coach, or guide peers during incident response or operational reviews
What we offer:
401(k) with company match and Employee stock purchase plan
Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions