This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Engineer the future of global finance. At Citi, our Tech team doesn't just support finance – we are helping to redefine it. Every day, $5 trillion crosses through our network. We do business in 180+ countries operating at a scale few can match. From deploying advanced AI to helping shape global markets, we build systems that matter. Look to join a team where your work helps influence economies, your ideas can drive innovation and outcomes, and your growth is backed by mentorship, continuous learning and flexibility with potential hybrid work opportunities. Help solve real-world challenges that touch millions and get the opportunity to build the future of finance with Citi Tech. We are seeking a technically hands-on Application Support Manager to lead and actively participate in the production support function for Citi's Instant Payments application. This role requires a leader capable of directly managing, troubleshooting, and developing a technical support team, ensuring the operational stability and performance of a modern, cloud-native application deployed across both Citi's Enterprise Cloud and Public Cloud environments.
Job Responsibility:
Directly manage, mentor, and develop a technical support team while actively engaging in day-to-day operational tasks, incident response, and problem resolution for the Instant Payments application
Take direct ownership of ensuring the operational stability and performance of the Instant Payments application across diverse cloud environments (Citi's Enterprise Cloud and Public Cloud), including active monitoring and system checks
Lead the implementation, configuration, and continuous optimization of observability (monitoring, logging, tracing tools), resiliency (designing and implementing auto-healing and retry mechanisms), and recoverability (executing disaster recovery strategies) solutions for the cloud-native Instant Payments application
Directly contribute to improving service levels by implementing operational efficiencies, performing incident management, problem management, and enhancing knowledge sharing practices for the Instant Payments application
Actively participate in defining and implementing application onboarding guidelines and standards. Provide direct technical guidance to development teams on stability and supportability improvements for the Instant Payments application
Lead and execute troubleshooting efforts for complex technical issues, perform in-depth root cause analysis, and implement permanent fixes for the Instant Payments application
Identify and implement opportunities for cost reduction and operational efficiencies through proactive analysis, performance tuning, and the development of automation scripts and tools. Ensure adherence to support process and tool standards
Effectively communicate technical details, application status, operational risks, and support initiatives to product teams, development teams, and relevant stakeholders
Directly ensure operational risk is managed effectively and compliance with applicable policies, rules, and regulations is maintained for the Instant Payments application support function
Requirements:
Progressive, hands-on experience in application support, Site Reliability Engineering (SRE), or technical operations, specifically for mission-critical, high-volume financial applications
Demonstrable direct experience with cloud-native architectures, including active configuration and management of microservices, containers (e.g., Kubernetes), and serverless technologies
Extensive practical experience with major Public Cloud platforms (e.g., AWS, Azure, GCP) and enterprise private cloud environments
Proven track record in implementing and operating comprehensive observability stacks (e.g., Prometheus, Grafana, ELK stack, Jaeger, distributed tracing)
Deep understanding and direct application of resiliency engineering principles (e.g., circuit breakers, bulkheads, retry mechanisms) and robust disaster recovery strategies
Strong technical background in instant payments or real-time financial transaction processing systems is highly desirable
Expertise in automation, scripting (e.g., Python, Go, Shell), and infrastructure-as-code principles (e.g., Terraform, CloudFormation)
Excellent communication, interpersonal, and team leadership skills, with the ability to manage and motivate a technical team while remaining deeply technical
Proven ability to troubleshoot and resolve complex technical issues independently, prioritize effectively, and make sound decisions under pressure
Bachelor's/University degree in Computer Science, Engineering, or a related technical field is required