This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a proactive and technically skilled Production Support Engineer to join our global support team. The ideal candidate will have hands-on experience with OpenShift (OSE), APIs, and Splunk, and will be responsible for ensuring high availability and stability of critical business applications. This role requires a strong sense of ownership, rapid incident response capabilities, and the ability to troubleshoot across complex, distributed environments.
Job Responsibility:
Provide L2/L3 production support for business-critical applications hosted on OpenShift
Troubleshoot and resolve application and infrastructure issues, ensuring minimal impact to end users
Monitor system health using Splunk dashboards and alerts
proactively identify and address anomalies
Collaborate with development and DevOps teams to triage and resolve API-related performance or connectivity issues
Write and maintain knowledge base articles, runbooks, and post-incident reports (RCA)
Participate in on-call rotation and major incident management
Automate monitoring, alerting, and operational tasks to improve reliability and efficiency
Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behaviour, conduct and business practices, and escalating, managing and reporting control issues with transparency, as well as effectively supervise the activity of others and create accountability with those who fail to maintain these standards.
Requirements:
Strong experience in Red Hat OpenShift (OSE) environments
Proficient in supporting and debugging RESTful APIs, including authentication, performance, and error handling
Expertise in Splunk: creating dashboards, alerts, and performing deep log analysis
Strong scripting skills (e.g., Bash, Python, or PowerShell) for automation and diagnostics
Working knowledge of ITIL practices and incident/problem management processes
Familiarity with CI/CD pipelines (Jenkins, GitLab CI)
Experience with cloud platforms (AWS, Azure, or GCP)
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.