This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
This opportunity involves a Senior L2 technical lead role with ownership of production system stability, performance, and availability. The role includes leading incident analysis, driving root cause investigations, coordinating with L1, L3, vendors, and cross-functional IT/network teams, and ensuring timely resolution with mitigation strategies. Responsibilities include deep technical analysis, design and code reviews, configuration validation, production deployments, and CR governance aligned with ITIL processes. The role supports TIBCO middleware and web platforms, drives automation initiatives, and ensures SLA compliance through operational excellence and continuous improvement.
Job Responsibility:
Monitor production system & ensure its performance and stability
Action on issues reported by L1/L2/monitoring teams. Coordinate with L3/vendor team for restoration of any faults
Resolve Trouble Tickets – Resolve day to day customer issues
Coordinate with IT/network teams to resolve any operational or functional issues
Be responsible for answering adhoc requests from various teams
Analyze the Logs and try resolving the issues, escalate issues well in advance and suggest alternative approaches
Deploy CRs on production environments and ensure smooth system performance
Follow ITIL processes and exhibit expertise in Incident, Problem and change management
Ensure SLA compliance for application and trouble tickets
Participate in solving support issues/change requests in the support phase
Follow configuration and release management processes
Carry out deployments inline with the MOPs / SOPs
Ensure system uptime and optimal performance by monitoring and carrying out BAU Activities like housekeeping, health-check and alert monitoring
Manage EMS server for TIBCO environment
Manage Hawk instances for creating new alerts / optimizing existing alerts to make the system more robust and intuitive
Carry out Shell scripting & Automation for mundane repetitive jobs
Requirements:
Deep expertise in TIBCO BWCE, BW 6.x, and Mashery, with strong hands-on capability in process development, code-level debugging, performance tuning, scaling, and production-grade integration support in complex environments
Proven experience in API and integration architecture, including API lifecycle management, security policies, throttling, containerized deployments (BWCE), and applying integration patterns for reliability, scalability, and governance
Strong messaging and streaming foundation, with excellent command over TIBCO EMS (HA/DR, failover, optimization, admin utilities) and good working knowledge of Apache Kafka, Node.js, and asynchronous, high-throughput system design
Solid Oracle DB and SQL skills for operational support, including log analysis (CLE tables), custom query creation, understanding DB dependencies, impact analysis, and application recovery during database incidents
Sound Java and protocol-level understanding, enabling effective troubleshooting of JVM-based frameworks and seamless handling of JMS, REST, SOAP, and HTTP integrations in production
ITIL-aligned operational excellence, with structured handling of incidents, problems, and changes, strong RCA mindset, improved MTTR, and consistent SLA adherence in L2 support engagements
Operational observability and automation exposure, including CLE logging, Splunk/ELK dashboards, monitoring, CI/CD-based deployments using Jenkins, and continuous improvement through DevOps practices
Excellent communication and stakeholder coordination, enabling clear incident updates, cross-team collaboration, risk articulation, and confident engagement with engineering, operations, and management teams