Observability Engineer – Splunk Focus Job at Inetum (Lisbon)

Principal Engineer - Edge Delivery & Observability

The FT is looking for a Principal Engineer (Individual Contributor) to lead our ...

Location

United Kingdom , London

Salary:

Not provided

Financial Times

Expiration Date

Until further notice

Requirements

Experience technically leading teams and projects
Effective communicator, able to break down tasks as well as giving/receiving constructive feedback
Customer focused, with a strong focus on building and running reliable, stable and secure systems
Enthusiastic about operability & monitoring of systems and Cloud infrastructure
Experience with AWS, Splunk, Grafana, Prometheus, Cloudflare, Route53, Python and Go (or equivalent tools) is beneficial

Job Responsibility

Provide technical direction and support to teams across the group
Lead one or two feature teams to deliver quality tooling and products that reduce developer toil
Work closely with the people manager within the teams
Engage with other disciplines (e.g.delivery, product management) and teams across FT to make sure we are all working together effectively
Model and help set and reinforce our inclusive, respectful, multidisciplinary and open culture
Help continuously improve our technology, process and culture, take ownership of problems and see solutions through to completion
Manage and maintain strong relationships with vendors
Gain a deep understanding of the FT as a business and use that knowledge to communicate clearly with your peers, reports, and senior management
Actively collaborate across teams both within and outside of I&O
Contribute to company-wide processes, frameworks, and guidelines

What we offer

A competitive bonus incentive scheme
Extensive learning and development opportunities including 10% time, tech talks, internal conferences and opportunities to attend external conferences and training
25 days annual leave, increasing to 30 days after 2 years’ service
Generous parental leave
Very competitive pension plan, with the company doubling your contribution

New

DevOps & Infrastructure Support Engineer

Your opportunity: At Schwab, you’re empowered to make an impact on your career. ...

Location

United States , Austin

Salary:

57.21 - 67.79 USD / Hour

Charles Schwab

Expiration Date

June 20, 2026

Requirements

5+ years in production support, reliability engineering, or platform operations within an enterprise environment
Hands-on experience supporting business-critical systems with high uptime requirements
Experience with Java and/or .NET application stacks
WebSphere, IIS, and enterprise middleware
Strong Linux and Windows operational experience
Solid administration skills (RHEL, CentOS, or Ubuntu) with a strong grasp of file systems and permissions
Scripting: Strong hands-on experience writing Bash scripts
Development experience with PowerShell, Python, Bash, Java
Familiarity with SQL, NoSQL databases, Messaging platforms (RabbitMQ, IBM MQ)
Log Analysis: Proven ability to troubleshoot complex issues using system and application logs

Job Responsibility

Systems Administration: Maintain enterprise Linux environments, focusing specifically on file systems, permissions, and system configurations
Troubleshooting: Thoroughly analyze system and application logs to diagnose and resolve complex issues across multiple environments
Own production stability, availability, and performance for a portfolio of Java, .NET, batch jobs, and web-based applications running on Linux, Windows, on-prem, and PCF
Automation & CI/CD: Write and maintain Bash scripts to automate routine operational tasks, and support continuous integration and deployment (CI/CD) pipelines
Configuration Management: Utilize configuration management tools to streamline, automate, and standardize environments
Application Support: Support Java applications, WebSphere Application Server, and manage workloads in Cloud Foundry environments
Observability: Use Splunk and Grafana for log aggregation, creating dashboards, proactive monitoring, and alerting
Networking Integration: Configure and troubleshoot core networking components necessary for application delivery, including DNS, firewall rules, and load balancer routing
Automation & Toil Reduction: Design and build automation (scripts, tooling, frameworks) to eliminate repetitive operational tasks
Improve self-service diagnostics, alert hygiene, and recovery automation

What we offer

401(k) with company match and Employee stock purchase plan
Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
Paid parental leave and family building benefits
Tuition reimbursement
Health, dental, and vision insurance
bonus or incentive opportunities

Fulltime

Application Support Technology Lead Analyst - Vice President

The SRE Observability Specialist is a hands-on expert, delivering the future of ...

Location

India , Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

7+ years of experience in SRE, Observability Engineering, or platform infrastructure roles focused on operational telemetry
Hands-on experience in observability tools and stacks such as Grafana, Prometheus, OpenTelemetry, ELK, Splunk, and similar platforms
Deep understanding of SLIs, SLOs, Error Budgets, and telemetry best practices in high-availability environments
Proven ability to troubleshoot integration issues and support observability across hybrid platforms (on-prem, cloud, containers)
Experience building dashboards aligned to business outcomes and incident workflows, especially in critical flows like payments
Familiarity with modern observability tooling ecosystems, including AI/ML capabilities, trace correlation, baselining, and alert tuning
Strong interpersonal and collaboration skills
Experience in enablement or platform teams with a track record of scaling best practices across diverse business units
Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience

Job Responsibility

Define the roadmap for Engineering enablers for Project Orion team aligned with enterprise reliability and SRE Services organization goals
Translate Organization strategy into an actionable delivery plan in partnership with Services Products, Operations & Engineering function, delivering incremental, high-value milestones
Understand Critical Business Services functional scope and translate into End-to-End monitoring solutions
Deliver against the observability roadmap for Services Technology by building scalable, reusable telemetry solutions
Periodic review and analyze application monitoring TOIL and collaborate with stakeholders and remediate them as per organization goal
Create and maintain dashboards and visualizations for critical client journeys, including real-time flows across Payments
Guide line-of-business teams in implementing SLIs/SLOs, golden signals, and effective alerting to support operational excellence
Support integration and adoption of observability tooling across on-prem, public cloud (AWS/GCP), and containerized environments (ECS, Kubernetes)
Customize shared dashboards and observability components in partnership with CTI and other central Engineering functions, ensuring usability and flexibility
Provide technical support and implementation guidance to SREs and developers facing integration or tooling challenges

Fulltime

Monitoring Engineer / Incident Manager

A team within Engineering under the Platform Excellence pillar exhibits an unwav...

Location

Netherlands , Amsterdam

Salary:

Not provided

Adyen

Expiration Date

Until further notice

Requirements

At least 5 years of experience with incident management, problem management, incident client communication, and platform monitoring operations
Experience with problem management practices - identifying trends across incidents, conducting root cause investigations and driving preventative action
Solid communication skills and the ability to develop strong working relationships throughout the organization, able to translate technical situations clearly and concisely to a diverse audience via data-visualizing dashboards and written documents
Willing to participate in the on-call rotation and work in a fast-paced, dynamic environment
Experience with monitoring and logging tools like Prometheus, Grafana, ELK Stack, etc.
Experience with observability platforms like Datadog, Dynatrace, Splunk
Excellent analytical and problem-solving skills, with the ability to analyze complex systems and spot the root cause of issues
Thrive in an environment where collaboration is crucial and where a global approach is key for successful implementation of processes and projects
Passion for defining and standardizing processes to drive strategic improvement and able to translate complex technical concepts with ease for all non technical audiences
Natural ability for handling complex situations and multiple responsibilities simultaneously

Job Responsibility

Participate in 24/7 on-call monitoring and observe platform and merchant performance and detect any issues proactively to mitigate risks in partnership with Engineering teams
Coordinate the mitigation, recovery, and resolution of high-impact incidents, ensuring a rapid and effective response across teams
Represent the customer perspective during incidents, maintaining a strong customer-centric approach
Communicate with merchants real time during an incident and present the most accurate and updated information to keep them informed
Escalate critical incidents when needed and provide structured communication to senior management
Go beyond reactive incident response by analyzing incident trends to identify recurring issues and systemic weaknesses and partner with engineering and product teams to advocate for long-term fixes
Work together with Operations, Product, and Engineering teams to integrate, grow, and continuously improve monitoring strategy and increase reliability
Investigate alerts and provide feedback to engineering teams to build effective logging and alerts across the platform architecture
Mitigate merchant impact risk by actioning on alerts in partnership with Engineering teams and contribute to the monitoring playbook by documenting learnings
Improve operations by leading/project managing initiatives and tools development of automation for effective monitoring

Fulltime

Application Production Support Engineer Generative AI

We are seeking a motivated team member to support our AI and DevOps Platform Sup...

Location

India , Chennai

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

6-8 years of relevant experience in technical support, platform operations, or engineering
Exposure to architecture concepts with the ability to contribute to technical discussions and understand design decisions
Experience working with business partners, engineering teams, or technology stakeholders
Demonstrated experience supporting IT services, platform operations, or infrastructure components
Strong verbal and written communication skills, with the ability to document technical issues clearly
Experience supporting operational workstreams or participating in platform improvement initiatives
Participation in resilience‑related or stability‑focused activities preferred
Ability to collaborate effectively with cross‑functional teams
Strong organizational skills and ability to manage daily workload and task priorities
Working knowledge of Generative AI concepts preferred

Job Responsibility

Understand how application support functions within the broader technology organization and contributes to business objectives
Assist with vendor coordination and day‑to‑day interactions with offshore managed services
Support efforts to improve service levels, including participating in incident management, problem management, and knowledge‑sharing initiatives
Partner with development and engineering teams to support application stability and operational readiness
Assist in collecting capacity, performance, and latency data to support platform planning efforts
Support application onboarding activities using established guidelines and standards
Contribute to fostering a collaborative and supportive team environment that encourages skill development
Participate in cost‑efficiency initiatives such as Root Cause Analysis reviews, knowledge management, and performance tuning
Assist in preparing materials for business review meetings and help align technology activities with business needs
Follow established support processes and tool standards and provide input on improvement opportunities

Fulltime

Production Support Developer - Trading Technology

At Schwab, you’re empowered to make an impact on your career. Here, innovative t...

Location

United States , Omaha, NE ; Austin, TX

Salary:

107000.00 - 135000.00 USD / Year

Charles Schwab

Expiration Date

June 28, 2026

Requirements

Bachelor’s degree in Computer Science or a related field, or equivalent practical experience
3+ years of experience in production support, site reliability engineering (SRE), or software operations
Working knowledge of Java (Java 17+ preferred) and SQL for troubleshooting
Experience supporting applications using observability and monitoring tools such as AppDynamics, Splunk, Grafana, InfluxDB, and Control‑M
Oracle Database experience with SQL
3+ years of experience administering Linux systems (RHEL 7/8/9 preferred)
Ability to use shell scripting or Python to automate repetitive operational tasks
Strong communication skills, particularly during incident response and post‑incident reviews
Availability to work nights and weekends as part of a rotating on‑call schedule

Job Responsibility

Safeguard the stability and resilience of Schwab’s Order Management System in a high-availability environment
Own complex production situations—from assessing impact and leading incident response to collaborating across application, infrastructure, database, and vendor partners to restore service
Focus on continuous improvement by identifying patterns, reducing recurring issues, and strengthening monitoring, runbooks, and operational practices that improve availability over time

What we offer

401(k) with company match and Employee stock purchase plan
Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
Paid parental leave and family building benefits
Tuition reimbursement
Health, dental, and vision insurance

Fulltime

!

Microservice Senior Integration Engineer

We are seeking an experienced Senior Integration Engineer to join our Integratio...

Location

India , Bangalore

Salary:

Not provided

Vodafone

Expiration Date

Until further notice

Requirements

3 years of hands-on development experience
Strong in Java and object-oriented programming principles, with deep expertise in Spring Boot
Highly proficient in Apache Camel, including routers and processors
Comfortable working with microservices, SOA, REST APIs and SOAP web services
Experienced with API management platforms such as Kong or Apigee
Knowledgeable in cloud concepts, particularly AWS
Familiar with DevOps and CI/CD concepts, including Maven, Git branching strategies, Docker, Kubernetes and Jenkins
Experienced in monitoring and observability tools such as Splunk, Prometheus, Grafana or AppDynamics
A collaborative problem-solver who challenges constructively and focuses on delivering business value

Job Responsibility

Design, develop and maintain microservices using Spring Boot and Apache Camel within a cloud-based architecture
Build and manage RESTful APIs and shared integration modules for reuse across multiple services
Contribute actively within Scrum or Kanban squads operating in a SAFe framework
Ensure quality, performance, security and reliability of owned services through best practices and standards
Debug and resolve complex integration and production issues using strong analytical skills
Implement unit testing, caching strategies, and performance tuning techniques
Support event-driven and messaging-based integrations using tools such as Kafka and RabbitMQ
Collaborate with peers to foster a culture of continuous improvement, learning, coaching and technical excellence

What we offer

Opportunity to work on large-scale, enterprise integration platforms within a global telco environment
Exposure to modern integration patterns, cloud-native architectures and event-driven systems
A collaborative, agile working culture that values learning, quality and innovation
Access to training, certifications and continuous professional development aligned to future technologies

Fulltime

Technical Project Manager

We are currently seeking a Technical Project Manager to join our team in Pune, M...

Location

India , Pune

Salary:

Not provided

NTT DATA

Expiration Date

Until further notice

Requirements

6-8 years of progressive experience in technical project management, software engineering, or related technical roles
Minimum 3-4 years managing complex technical projects in Hybrid environments
Proven track record delivering software products or platforms from conception through production deployment
Experience working directly with software engineering teams managing full SDLC
Demonstrated success navigating complex technical landscapes with multiple dependencies
Background working in product-led organizations with cross-functional teams
Financial services or regulated industry experience highly preferred
Solid understanding of software development lifecycle (SDLC) and methodologies (Agile, Waterfall, DevOps)
Working knowledge of modern software architecture patterns (microservices, APIs, event-driven, serverless)
Familiarity with cloud platforms (AWS, Azure, GCP) and cloud-native development

Job Responsibility

Define comprehensive project scope, objectives, success criteria, and deliverables aligned with business strategy and technical requirements
Develop detailed project plans including work breakdown structures, schedules, resource allocation, budget estimates, and dependency mapping
Establish project governance structure with clear decision rights, escalation paths, and approval gates
Create realistic timelines incorporating technical complexity, resource constraints, and risk factors
Define and track key project milestones, deliverables, and quality gates
Coordinate and monitor project progress across multiple workstreams and technical teams
Track project performance against baseline using earned value management and agile metrics
Conduct regular status reviews with project teams and stakeholders
Identify schedule slippage, budget variance, or scope creep early and implement corrective actions
Manage project changes through formal change control processes

Fulltime

Select Country

Observability Engineer – Splunk Focus

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?