This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a highly skilled Observability and Monitoring Engineer to design, onboard, and manage enterprise observability solutions using Dynatrace, Splunk and other tools. This role is critical to ensuring application performance, operational stability, and proactive monitoring across the ERF line of business. The ideal candidate will have strong technical expertise in monitoring platforms, automation, and governance, coupled with excellent collaboration and communication skills.
Job Responsibility:
Deploy and configure Dynatrace across diverse environments (Windows, Linux, Mainframe)
Onboard applications into Splunk using forwarders, source types, and indexing best practices
Define and implement tagging strategies, dashboards, and alerting policies for Dynatrace and Splunk
Enable full-stack monitoring, including APM, infrastructure, logs, and synthetic monitoring
Implement distributed tracing, anomaly detection, and performance baselining
Develop scripts and workflows for automated onboarding and configuration using APIs
Integrate monitoring solutions with ticketing tools for incident management
Establish retention policies and data governance for logs and metrics
Document onboarding processes, SOPs, and troubleshooting guides
Partner with application teams, infrastructure, and CIO stakeholders to align monitoring strategies
Conduct KT sessions and create documentation for team enablement
Drive reduction in incident MTTR and improve application stability
Enable proactive monitoring and predictive analytics
Promote operational excellence and automation across the organization
Requirements:
Senior Application Programmer
3–5 years of experience in supporting IT Operations
Strong knowledge of monitoring tools (Dynatrace, Splunk)
Experience with scripting languages (Python, Perl, Unix shell)
Creative problem solver who thrives in a fast-paced environment
Must be a team player and demonstrate ability to communicate effectively with both technical and non-technical individuals
Excellent verbal and written communication skills
Clear oral communication and strong English proficiency
Self-starter, motivated, innovative, capable of handling a team and providing technical solutions
Ability to deal with complex information, processes, and relationships to derive simple solutions
Ability to liaise with business and development teams for onboarding and troubleshooting
Hands on experience with Database (Oracle Teradata)
Hands on experience in Unix, Perl, Python
Experience in Job scheduling tool - Autosys
Knowledge on monitoring tools like ITRS Geneous, Splunk, Dynatrace, Remedy
Experience with creating dashboards, reports, alerts, saved searches, and automations as per user/stakeholder requirements
Experience with configuring inputs and new connections with Splunk DB Connect
Knowledge on ITIL concepts like Incident and Problem Management
Knowledge on Incident Management (ITSM Remedy, MyITSM)
Must have previous production support experience
Willing to be flexible sometimes with providing stand-by out-of-hour support on rotational basis for production system (as needed)
Good understanding of financial/banking industry
Nice to have:
Knowledge on Incident Management (ITSM Remedy, MyITSM)
Willing to be flexible sometimes with providing stand-by out of hours support on rotational basis for production system (as needed)
Good understanding of financial/banking industry
Working Knowledge in Telemetry
Creative and strong problem-solving skills
Excellent written and verbal communications skills
Ability to operate in high-pressure situations
Results oriented, and must be able to effectively interact with Senior Management and Business Partners