CrawlJobs Logo

Observability Engineer – Splunk Focus

Portugal, Lisbon · Job Posted July 25, 2025
Apply Position
Job Link Share

Job Description

Join our growing Monitoring team! As a Splunk Specialist, you will collaborate closely with colleagues across all regions and interact with various internal teams to support and enhance our monitoring capabilities.

Job Responsibility

  • Provide support for monitoring tools: Splunk (Enterprise & ITSI), OpenTelemetry, Cribl, SolarWinds, Dynatrace
  • Automate daily tasks using Ansible
  • Assist development and production teams in migrating to the new Splunk Enterprise and ITSI platforms
  • Build dashboards and define relevant metrics
  • Propose and implement improvements across tools, processes, and KPIs

Requirements

  • Proven expertise in Splunk Enterprise
  • Strong experience with Splunk ITSI
  • Knowledge of Cribl
  • Ability to design and implement Splunk dashboards
  • Familiarity with automation tools (e.g., Ansible)
  • Experience working in multi-regional teams is a plus

Nice to have

French

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Observability Engineer – Splunk Focus

8 matching positions

Principal Engineer - Edge Delivery & Observability

The FT is looking for a Principal Engineer (Individual Contributor) to lead our ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
ft.com Logo
Financial Times
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience technically leading teams and projects
  • Effective communicator, able to break down tasks as well as giving/receiving constructive feedback
  • Customer focused, with a strong focus on building and running reliable, stable and secure systems
  • Enthusiastic about operability & monitoring of systems and Cloud infrastructure
  • Experience with AWS, Splunk, Grafana, Prometheus, Cloudflare, Route53, Python and Go (or equivalent tools) is beneficial
Job Responsibility
Job Responsibility
  • Provide technical direction and support to teams across the group
  • Lead one or two feature teams to deliver quality tooling and products that reduce developer toil
  • Work closely with the people manager within the teams
  • Engage with other disciplines (e.g.delivery, product management) and teams across FT to make sure we are all working together effectively
  • Model and help set and reinforce our inclusive, respectful, multidisciplinary and open culture
  • Help continuously improve our technology, process and culture, take ownership of problems and see solutions through to completion
  • Manage and maintain strong relationships with vendors
  • Gain a deep understanding of the FT as a business and use that knowledge to communicate clearly with your peers, reports, and senior management
  • Actively collaborate across teams both within and outside of I&O
  • Contribute to company-wide processes, frameworks, and guidelines
What we offer
What we offer
  • A competitive bonus incentive scheme
  • Extensive learning and development opportunities including 10% time, tech talks, internal conferences and opportunities to attend external conferences and training
  • 25 days annual leave, increasing to 30 days after 2 years’ service
  • Generous parental leave
  • Very competitive pension plan, with the company doubling your contribution
Read More
Arrow Right
New

DevOps & Infrastructure Support Engineer

Your opportunity: At Schwab, you’re empowered to make an impact on your career. ...
Location
Location
United States , Austin
Salary
Salary:
57.21 - 67.79 USD / Hour
schwab.com Logo
Charles Schwab
Expiration Date
June 20, 2026
Flip Icon
Requirements
Requirements
  • 5+ years in production support, reliability engineering, or platform operations within an enterprise environment
  • Hands-on experience supporting business-critical systems with high uptime requirements
  • Experience with Java and/or .NET application stacks
  • WebSphere, IIS, and enterprise middleware
  • Strong Linux and Windows operational experience
  • Solid administration skills (RHEL, CentOS, or Ubuntu) with a strong grasp of file systems and permissions
  • Scripting: Strong hands-on experience writing Bash scripts
  • Development experience with PowerShell, Python, Bash, Java
  • Familiarity with SQL, NoSQL databases, Messaging platforms (RabbitMQ, IBM MQ)
  • Log Analysis: Proven ability to troubleshoot complex issues using system and application logs
Job Responsibility
Job Responsibility
  • Systems Administration: Maintain enterprise Linux environments, focusing specifically on file systems, permissions, and system configurations
  • Troubleshooting: Thoroughly analyze system and application logs to diagnose and resolve complex issues across multiple environments
  • Own production stability, availability, and performance for a portfolio of Java, .NET, batch jobs, and web-based applications running on Linux, Windows, on-prem, and PCF
  • Automation & CI/CD: Write and maintain Bash scripts to automate routine operational tasks, and support continuous integration and deployment (CI/CD) pipelines
  • Configuration Management: Utilize configuration management tools to streamline, automate, and standardize environments
  • Application Support: Support Java applications, WebSphere Application Server, and manage workloads in Cloud Foundry environments
  • Observability: Use Splunk and Grafana for log aggregation, creating dashboards, proactive monitoring, and alerting
  • Networking Integration: Configure and troubleshoot core networking components necessary for application delivery, including DNS, firewall rules, and load balancer routing
  • Automation & Toil Reduction: Design and build automation (scripts, tooling, frameworks) to eliminate repetitive operational tasks
  • Improve self-service diagnostics, alert hygiene, and recovery automation
What we offer
What we offer
  • 401(k) with company match and Employee stock purchase plan
  • Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
  • Paid parental leave and family building benefits
  • Tuition reimbursement
  • Health, dental, and vision insurance
  • bonus or incentive opportunities
  • Fulltime
Read More
Arrow Right

Application Support Technology Lead Analyst - Vice President

The SRE Observability Specialist is a hands-on expert, delivering the future of ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in SRE, Observability Engineering, or platform infrastructure roles focused on operational telemetry
  • Hands-on experience in observability tools and stacks such as Grafana, Prometheus, OpenTelemetry, ELK, Splunk, and similar platforms
  • Deep understanding of SLIs, SLOs, Error Budgets, and telemetry best practices in high-availability environments
  • Proven ability to troubleshoot integration issues and support observability across hybrid platforms (on-prem, cloud, containers)
  • Experience building dashboards aligned to business outcomes and incident workflows, especially in critical flows like payments
  • Familiarity with modern observability tooling ecosystems, including AI/ML capabilities, trace correlation, baselining, and alert tuning
  • Strong interpersonal and collaboration skills
  • Experience in enablement or platform teams with a track record of scaling best practices across diverse business units
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Define the roadmap for Engineering enablers for Project Orion team aligned with enterprise reliability and SRE Services organization goals
  • Translate Organization strategy into an actionable delivery plan in partnership with Services Products, Operations & Engineering function, delivering incremental, high-value milestones
  • Understand Critical Business Services functional scope and translate into End-to-End monitoring solutions
  • Deliver against the observability roadmap for Services Technology by building scalable, reusable telemetry solutions
  • Periodic review and analyze application monitoring TOIL and collaborate with stakeholders and remediate them as per organization goal
  • Create and maintain dashboards and visualizations for critical client journeys, including real-time flows across Payments
  • Guide line-of-business teams in implementing SLIs/SLOs, golden signals, and effective alerting to support operational excellence
  • Support integration and adoption of observability tooling across on-prem, public cloud (AWS/GCP), and containerized environments (ECS, Kubernetes)
  • Customize shared dashboards and observability components in partnership with CTI and other central Engineering functions, ensuring usability and flexibility
  • Provide technical support and implementation guidance to SREs and developers facing integration or tooling challenges
  • Fulltime
Read More
Arrow Right

Monitoring Engineer / Incident Manager

A team within Engineering under the Platform Excellence pillar exhibits an unwav...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
adyen.com Logo
Adyen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of experience with incident management, problem management, incident client communication, and platform monitoring operations
  • Experience with problem management practices - identifying trends across incidents, conducting root cause investigations and driving preventative action
  • Solid communication skills and the ability to develop strong working relationships throughout the organization, able to translate technical situations clearly and concisely to a diverse audience via data-visualizing dashboards and written documents
  • Willing to participate in the on-call rotation and work in a fast-paced, dynamic environment
  • Experience with monitoring and logging tools like Prometheus, Grafana, ELK Stack, etc.
  • Experience with observability platforms like Datadog, Dynatrace, Splunk
  • Excellent analytical and problem-solving skills, with the ability to analyze complex systems and spot the root cause of issues
  • Thrive in an environment where collaboration is crucial and where a global approach is key for successful implementation of processes and projects
  • Passion for defining and standardizing processes to drive strategic improvement and able to translate complex technical concepts with ease for all non technical audiences
  • Natural ability for handling complex situations and multiple responsibilities simultaneously
Job Responsibility
Job Responsibility
  • Participate in 24/7 on-call monitoring and observe platform and merchant performance and detect any issues proactively to mitigate risks in partnership with Engineering teams
  • Coordinate the mitigation, recovery, and resolution of high-impact incidents, ensuring a rapid and effective response across teams
  • Represent the customer perspective during incidents, maintaining a strong customer-centric approach
  • Communicate with merchants real time during an incident and present the most accurate and updated information to keep them informed
  • Escalate critical incidents when needed and provide structured communication to senior management
  • Go beyond reactive incident response by analyzing incident trends to identify recurring issues and systemic weaknesses and partner with engineering and product teams to advocate for long-term fixes
  • Work together with Operations, Product, and Engineering teams to integrate, grow, and continuously improve monitoring strategy and increase reliability
  • Investigate alerts and provide feedback to engineering teams to build effective logging and alerts across the platform architecture
  • Mitigate merchant impact risk by actioning on alerts in partnership with Engineering teams and contribute to the monitoring playbook by documenting learnings
  • Improve operations by leading/project managing initiatives and tools development of automation for effective monitoring
  • Fulltime
Read More
Arrow Right

Application Production Support Engineer Generative AI

We are seeking a motivated team member to support our AI and DevOps Platform Sup...
Location
Location
India , Chennai
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of relevant experience in technical support, platform operations, or engineering
  • Exposure to architecture concepts with the ability to contribute to technical discussions and understand design decisions
  • Experience working with business partners, engineering teams, or technology stakeholders
  • Demonstrated experience supporting IT services, platform operations, or infrastructure components
  • Strong verbal and written communication skills, with the ability to document technical issues clearly
  • Experience supporting operational workstreams or participating in platform improvement initiatives
  • Participation in resilience‑related or stability‑focused activities preferred
  • Ability to collaborate effectively with cross‑functional teams
  • Strong organizational skills and ability to manage daily workload and task priorities
  • Working knowledge of Generative AI concepts preferred
Job Responsibility
Job Responsibility
  • Understand how application support functions within the broader technology organization and contributes to business objectives
  • Assist with vendor coordination and day‑to‑day interactions with offshore managed services
  • Support efforts to improve service levels, including participating in incident management, problem management, and knowledge‑sharing initiatives
  • Partner with development and engineering teams to support application stability and operational readiness
  • Assist in collecting capacity, performance, and latency data to support platform planning efforts
  • Support application onboarding activities using established guidelines and standards
  • Contribute to fostering a collaborative and supportive team environment that encourages skill development
  • Participate in cost‑efficiency initiatives such as Root Cause Analysis reviews, knowledge management, and performance tuning
  • Assist in preparing materials for business review meetings and help align technology activities with business needs
  • Follow established support processes and tool standards and provide input on improvement opportunities
  • Fulltime
Read More
Arrow Right

Production Support Developer - Trading Technology

At Schwab, you’re empowered to make an impact on your career. Here, innovative t...
Location
Location
United States , Omaha, NE ; Austin, TX
Salary
Salary:
107000.00 - 135000.00 USD / Year
schwab.com Logo
Charles Schwab
Expiration Date
June 28, 2026
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or a related field, or equivalent practical experience
  • 3+ years of experience in production support, site reliability engineering (SRE), or software operations
  • Working knowledge of Java (Java 17+ preferred) and SQL for troubleshooting
  • Experience supporting applications using observability and monitoring tools such as AppDynamics, Splunk, Grafana, InfluxDB, and Control‑M
  • Oracle Database experience with SQL
  • 3+ years of experience administering Linux systems (RHEL 7/8/9 preferred)
  • Ability to use shell scripting or Python to automate repetitive operational tasks
  • Strong communication skills, particularly during incident response and post‑incident reviews
  • Availability to work nights and weekends as part of a rotating on‑call schedule
Job Responsibility
Job Responsibility
  • Safeguard the stability and resilience of Schwab’s Order Management System in a high-availability environment
  • Own complex production situations—from assessing impact and leading incident response to collaborating across application, infrastructure, database, and vendor partners to restore service
  • Focus on continuous improvement by identifying patterns, reducing recurring issues, and strengthening monitoring, runbooks, and operational practices that improve availability over time
What we offer
What we offer
  • 401(k) with company match and Employee stock purchase plan
  • Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
  • Paid parental leave and family building benefits
  • Tuition reimbursement
  • Health, dental, and vision insurance
  • Fulltime
!
Read More
Arrow Right

Microservice Senior Integration Engineer

We are seeking an experienced Senior Integration Engineer to join our Integratio...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3 years of hands-on development experience
  • Strong in Java and object-oriented programming principles, with deep expertise in Spring Boot
  • Highly proficient in Apache Camel, including routers and processors
  • Comfortable working with microservices, SOA, REST APIs and SOAP web services
  • Experienced with API management platforms such as Kong or Apigee
  • Knowledgeable in cloud concepts, particularly AWS
  • Familiar with DevOps and CI/CD concepts, including Maven, Git branching strategies, Docker, Kubernetes and Jenkins
  • Experienced in monitoring and observability tools such as Splunk, Prometheus, Grafana or AppDynamics
  • A collaborative problem-solver who challenges constructively and focuses on delivering business value
Job Responsibility
Job Responsibility
  • Design, develop and maintain microservices using Spring Boot and Apache Camel within a cloud-based architecture
  • Build and manage RESTful APIs and shared integration modules for reuse across multiple services
  • Contribute actively within Scrum or Kanban squads operating in a SAFe framework
  • Ensure quality, performance, security and reliability of owned services through best practices and standards
  • Debug and resolve complex integration and production issues using strong analytical skills
  • Implement unit testing, caching strategies, and performance tuning techniques
  • Support event-driven and messaging-based integrations using tools such as Kafka and RabbitMQ
  • Collaborate with peers to foster a culture of continuous improvement, learning, coaching and technical excellence
What we offer
What we offer
  • Opportunity to work on large-scale, enterprise integration platforms within a global telco environment
  • Exposure to modern integration patterns, cloud-native architectures and event-driven systems
  • A collaborative, agile working culture that values learning, quality and innovation
  • Access to training, certifications and continuous professional development aligned to future technologies
  • Fulltime
Read More
Arrow Right

Technical Project Manager

We are currently seeking a Technical Project Manager to join our team in Pune, M...
Location
Location
India , Pune
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of progressive experience in technical project management, software engineering, or related technical roles
  • Minimum 3-4 years managing complex technical projects in Hybrid environments
  • Proven track record delivering software products or platforms from conception through production deployment
  • Experience working directly with software engineering teams managing full SDLC
  • Demonstrated success navigating complex technical landscapes with multiple dependencies
  • Background working in product-led organizations with cross-functional teams
  • Financial services or regulated industry experience highly preferred
  • Solid understanding of software development lifecycle (SDLC) and methodologies (Agile, Waterfall, DevOps)
  • Working knowledge of modern software architecture patterns (microservices, APIs, event-driven, serverless)
  • Familiarity with cloud platforms (AWS, Azure, GCP) and cloud-native development
Job Responsibility
Job Responsibility
  • Define comprehensive project scope, objectives, success criteria, and deliverables aligned with business strategy and technical requirements
  • Develop detailed project plans including work breakdown structures, schedules, resource allocation, budget estimates, and dependency mapping
  • Establish project governance structure with clear decision rights, escalation paths, and approval gates
  • Create realistic timelines incorporating technical complexity, resource constraints, and risk factors
  • Define and track key project milestones, deliverables, and quality gates
  • Coordinate and monitor project progress across multiple workstreams and technical teams
  • Track project performance against baseline using earned value management and agile metrics
  • Conduct regular status reviews with project teams and stakeholders
  • Identify schedule slippage, budget variance, or scope creep early and implement corrective actions
  • Manage project changes through formal change control processes
  • Fulltime
Read More
Arrow Right