This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Citi, we’re passionate about building and maintaining highly reliable APIs that solve critical customer problems. We support mission-critical systems, empowering our customers with a rich feature set, high availability, and stellar performance levels to pursue their financial transactions. As we continue to expand our API scope and capabilities, we are seeking an experienced and dedicated API Production Support Engineer with complete hands-on responsibilities to ensure the operational excellence and continuous improvement of our API ecosystems. This role requires an individual who brings fresh ideas, demonstrates a unique and informed viewpoint on API reliability, and enjoys collaborating with cross-functional teams to develop real-world solutions and ensure positive user experiences at every interaction. Our ultimate goal is to build proactive and predictive operational strategies, including leveraging intelligent automation, to avoid customer impacts.
Job Responsibility:
Champion stability initiatives to enable high availability and resilience for our API applications, including enhancing monitoring, failover mechanisms, and overall system health
Demonstrate calm and analytical capabilities when faced with major incidents on critical API systems, ensuring effective incident, problem, and change management at a global enterprise level
Perform proactive monitoring and management of production API environments, taking a holistic view of system health and performance
Drive the definition, analysis, and reporting of SLIs and SLOs for all supported APIs and clients, ensuring clear performance benchmarks
Contribute to the development and implementation of tools and systems designed to enhance API operational management and the client experience
Measure and optimize API system performance, always pushing capabilities forward, anticipating customer needs, and innovating for continuous improvement
Provide hands-on expert operational support for critical, large-scale distributed API ecosystems
Actively gather and analyze performance metrics from API platforms and underlying infrastructure to assist in performance tuning, fault finding, and capacity planning
Partner closely with API development teams to improve services through rigorous operational feedback loops, testing, and release procedures
Drive the creation of sustainable API operational systems and services through automation and continuous uplifts, including developing, testing, and debugging automated tasks
Conduct thorough post-incident reviews for API-related issues, identifying opportunities for automation and proactive monitoring to prevent recurrence
Actively participate in and take complete hands-on responsibility for high-priority API production support activities, ensuring swift resolution and clear communication
Requirements:
Extensive experience supporting Java and J2EE based applications
Deep technical knowledge and hands-on experience supporting and troubleshooting environments including AWS, ECS, Oracle DB, and Mongo DB
A strong understanding and practical application of SRE concepts, particularly in defining and measuring SLIs, SLOs and Error Budgets
Demonstrated experience in building and utilizing comprehensive monitoring solutions such as AppDynamics, Splunk, Kibana to proactively alert on production API-related issues and ensure system health
In-depth knowledge and hands-on experience with API Gateway technologies, specifically APIGEE, and CDN solutions like Akamai
Proven ability to proactively identify and address problems, areas for improvement, and performance bottlenecks within complex API ecosystems using software-based solutions
Strong coding experience beyond simple scripts, preferably in Java or Python, for automation and internal tool development
Bachelor’s University degree in Computer Science, Engineering, or a related field
or equivalent experience
Nice to have:
Prior experience or awareness of agentic or AI-based solutioning within the API Support domain, particularly for proactive issue detection and resolution
Exposure to ITRS monitoring tools and experience in configuring ITRS gateways
Effective in supporting Payments applications, preferably in corporate banking environments
Strong knowledge of Incident, Release, Change, and Problem management