This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Collaborate with engineering, operations, and other stakeholders to understand enterprise architecture, monitoring requirements, and performance goals
Identify and define key performance indicators (KPIs) and metrics, diagnose issues, and proactively identify areas for optimization
Develop and implement observability frameworks, tools, and processes to enable comprehensive monitoring, logging, and tracing of systems and applications
Ensure the availability, scalability, and reliability of infrastructure and deployment environments
Implement and manage monitoring and observability tools (such as AppDynamics, DataDog, Splunk, ELK, or Sentry) to gain insights into system performance and health
Provide timely and accurate reports on application performance, highlighting key insights and trends
Collaborate with digital squads to implement performance improvements, including code optimizations and infrastructure adjustments
Offer guidance and training to end-users and internal teams on best practices for APM and optimizing application performance
Provide recommendations on monitoring systems, logging frameworks, and distributed tracing platforms
Manage and deliver key KPI metrics across enterprise architecture and perform trend analysis
Deliver a proactive monitoring framework across infrastructure and digital experience monitoring domains
Provide expertise in problem detection, isolation, and root cause analysis during incident management, using relevant data and artifacts from observability tools and corresponding systems
Requirements
Around 8+ years of experience with IT infrastructure and applications
3-5 years of hands-on experience in observability and continuous integration
2+ years of programming background in Java or relevant technologies
Knowledge of cloud infrastructure (Azure) and cluster management tools such as Kubernetes
In-depth knowledge of application performance metrics, monitoring, and troubleshooting
Strong communication skills with the ability to align the organization on complex technical decisions
Bachelor's or Master's degree in Information Technology, Computer Science, or a related quantitative discipline