Senior Observability Engineer Job at Foodstuffs South Island Limited

Senior Observability Engineer

Our client, a large professional services firm, is looking to hire a Senior Obse...

Location

United States

Salary:

Not provided

ClearBridge Technology Group

Expiration Date

Until further notice

Requirements

Hands-on Grafana dashboard and data source experience
Experience with Grafana Loki and LogQL
Experience deploying and configuring OpenTelemetry Collector
Grafana Alloy experience preferred
Logstash experience preferred, especially syslog parsing, Grok, filtering, and normalization
Experience collecting syslog from network devices
Experience with Cisco or similar network device log formats
Experience collecting network telemetry such as CPU, memory, uptime, interface status, bandwidth, errors, discards, device health, and alarms
Familiarity with SNMP, syslog, network device CLI configuration, and collector-based monitoring
Ability to troubleshoot ingestion, parsing, dropped logs, pipeline health, and telemetry flow issues

Job Responsibility

Support a short-term Grafana-based observability migration proof of concept
Collect, parse, normalize, and validate log data and network telemetry from infrastructure devices such as routers, switches, firewalls, wireless controllers, and other network appliances
Support collector deployment, syslog ingestion, telemetry collection, dashboard validation, troubleshooting, documentation, and handoff for a multi-site proof of concept

Fulltime

Senior Observability Engineer

Coralogix is a modern, full-stack observability platform transforming how busine...

Location

Germany , Berlin

Salary:

Not provided

Coralogix

Expiration Date

Until further notice

Requirements

5+ years of experience in Site Reliability, DevOps, or Platform Engineering with a focus on observability
Proven expertise with at least one major observability platform (e.g., Prometheus, Victoria Metrics, OpenSearch)
Hands-on experience with Kubernetes, including deep knowledge of controllers, operators, and Helm
Experience writing Kubernetes controllers (controller-runtime, KubeBuilder)
Strong programming skills in Go or Python (Rust is a plus)
Experience designing, scaling, and operating observability systems at enterprise scale
Familiarity with at least one major cloud provider (AWS, Azure, or GCP)
Strong understanding of distributed systems, telemetry pipelines, and instrumentation standards (e.g., OpenTelemetry)
Excellent communication skills with the ability to explain complex topics to diverse stakeholders

Job Responsibility

Design, implement, and maintain observability features such as Alerting, SLOs, Reporting, and Synthetic Tests
Manage and scale OpenTelemetry Collectors and other observability agents across Kubernetes environments
Write and maintain Kubernetes Controllers using frameworks like controller-runtime and KubeBuilder
Operate and optimize the internal Coralogix account, ensuring proper usage, cost efficiency, and best practices adoption
Define and enforce observability guidelines and standards across the organization
Partner with engineering teams to embed observability by default into products and services
Control observability-related costs while maximizing performance, visibility, and value
Contribute to upstream projects such as OpenTelemetry, helping shape industry standards
Explore and implement cutting-edge observability technologies, including eBPF-based approaches

Fulltime

Senior Observability Engineer

Are you passionate about building robust systems and empowering developers with ...

Location

Australia , Sydney

Salary:

Not provided

Blume Global

Expiration Date

Until further notice

Requirements

Strong hands-on experience with Elastic Stack (Elasticsearch, Logstash, Kibana, Beats/Fleet)
Practical experience with OpenTelemetry, including configuration, instrumentation, and exporters
Experience designing and operating observability pipelines in production environments
Familiarity with tracing, metrics, and logging best practices
Experience with Linux-based systems, containers (Docker), and Kubernetes
Experience building and maintaining internal tools or services that support developers
Knowledge of scripting (e.g., Bash, Python) and infrastructure-as-code practices
A solid understanding of CI/CD pipelines, Git, and DevOps workflows
Strong communication skills and a proactive attitude toward cross-team and cross-company collaboration

Senior Observability Engineer for Data Middleware

Exciting senior role in Observability with growth and flexibility. Primary respo...

Location

Poland , Warsaw

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

5+ years' experience Solution Architecture, Analysis, Design, Development, Integration and supporting Observability solutions
Experience with Observability platforms: MeshIQ products (Nastel Navigator, AutoPilot, X-Ray) would be ideal, but all relevant experience with platforms like Prometheus, Grafana, AppDynamics, ElasticSearch etc.
Experience with Automation scripting: Ansible, Linux shell, python, etc.
Experience with DevOps Tools: GitHub, Bitbucket, Harness, Jenkins, Artifactory, etc.
Experience with CICD Automation: experience deploying and troubleshooting common programming languages such as Java, Python, etc.
3+ years’ past experience developing, integrating or supporting Data Middleware products, e.g. messaging, streaming, ETL, BPM, etc.
Experience performance tuning, optimization, monitoring, and troubleshooting
Experience in AWS/ GCP / Openshift is an added advantage but not mandatory

Job Responsibility

Serve as a technology subject matter expert for internal and external stakeholders and provide direction for all firm mandated controls and compliance initiatives, all projects within the group and in creating a technology domain roadmap
Ensure that all integration of functions meet business goals
Define necessary system enhancements to deploy new products and process enhancements
Develop build deployment packaging and automation
Responsible for architecture and implementation of HA (High Availability) and DR/COB (Continuity of Business)
Manage security screening and vulnerability tracking for Citi's software certification process
Performance monitoring and L3 troubleshooting support
Integration with enterprise services such as identity management, logging, secrets management, ticketing, etc.

What we offer

Private Medical Care Program
Life Insurance Program
Pension Plan contribution (PPE Program)
Employee Assistance Program
Paid Parental Leave Program (maternity and paternity leave)
Sport Card
Holidays Allowance
Sport and team recreation activities
Special offers and discounts for employees
Access to an array of learning and development resources

Fulltime

Senior Software Engineer, Observability

We are looking for an experienced Senior Engineer to join our newly formed Obser...

Location

Germany , Berlin

Salary:

Not provided

Aiven Deutschland GmbH

Expiration Date

Until further notice

Requirements

Extensive experience with observability concepts on a big scale
A good grasp of monitoring and observability tools like Prometheus, Grafana, and OpenTelemetry
Understanding of SLAs, SLOs, and SLIs
Strong knowledge of database fundamentals, including OLAP vs. OLTP, persistence, replication, and clustering
Experience with ClickHouse specifically regarding logs, metrics, and OpenTelemetry is highly desirable
Experience in building and designing distributed systems in a cloud environment
Ability to work with SQL to interact with our platform's master database
Deep understanding of release management and testing best practices to own the delivery pipeline
A genuine interest in solving complex technical challenges with customer-focused solutions

Job Responsibility

Ensure our existing observability offering is up and running all the time
Ideate and develop innovative new features that attract our target customer segment, drive product engagement, and ultimately fuel growth
Support our existing external customer base by resolving escalated support issues and collaborating with them to understand and solve their needs
Guide the team in the hands-on implementation of key platform features, ensuring maintainability and performance
Empower your team to act as 'product custodians' by consistently addressing foundational and production issues
Practise effective communication and collaboration both within the team and across the wider organization and act as a role model in transparency for your peers

What we offer

Participate in Aiven’s equity plan
Balance work and life with our hybrid work policy
Choose the equipment you need to set yourself up for success
Use your Professional Development Plan budget for learning opportunities
Receive holistic wellbeing support through our global Employee Assistance Program
Inquire about our Global Time Off Commitment (Parental and Sick Leave, as well as Personal Time)
Enjoy country-specific benefits for our global cast

Fulltime

Senior Observability Infrastructure Engineer

We are looking for an experienced Observability Infrastructure Engineer to join ...

Location

Netherlands , Amsterdam

Salary:

Not provided

Adyen

Expiration Date

Until further notice

Requirements

10+ years of experience in the observability domain or in a relevant platform/infrastructure domain.
Observability Stack Expertise: You have hands-on experience operating core telemetry data stores at scale e.g. Elasticsearch/Opensearch/VictoriaLogs/Clickhouse for logging, Prometheus/ VictoriaMetrics for metrics and Grafana Tempo for distributed tracing.
Linux Experience: You understand the operating system at a kernel level and can debug complex networking, file system, and performance issues on both bare metal and virtualized hardware .
Production Kubernetes Experience: Proven hands-on experience operating, and troubleshooting production workloads on Kubernetes (on-prem and/or cloud), including strong day-to-day use of kubectl and Kubernetes primitives (e.g. Namespaces, Pods, Deployments/StatefulSets, Services, Ingress, ConfigMaps/Secrets)
Software Engineering Mindset: You are proficient in Go or Python and do not just write scripts
you build tools and automation platforms that treat infrastructure as code.

Job Responsibility

Build the next generation of our platform: Design and implement the future architecture of our logging and metrics systems.
Own infrastructure operations: You will take full ownership of our hybrid infrastructure, managing the lifecycle of over 1,500 servers across both bare-metal and Kubernetes environments.
Automate to reduce toil: You will write code in Go or Python to eliminate manual operational tasks.
Optimize for scale and performance: You will dive deep into performance bottlenecks within our distributed tracing and logging pipelines.
Reliability and Engineering: You will participate in on-call rotations, but your primary focus will be engineering solutions that stop alerts from firing in the first place.

Fulltime

Senior Infrastructure Engineer / Observability Specialist

Location: Remote - Anywhere in Australia (Will be required to travel to Canberra...

Location

Australia , Sydney

Salary:

Not provided

FinXL

Expiration Date

Until further notice

Requirements

Must be Australian Citizen and be able to obtain Baseline Security Clearance
Cloud Expertise: Proficiency in AWS, Azure, or Google Cloud platforms
Observability Concepts: Deep understanding of metrics, logs, and traces, including the design of alerting systems
Automation: Experience in scripting with Python, Bash, or PowerShell
Containerisation: Knowledge of Kubernetes and Docker
Soft Skills: Strong negotiation and communication skills to assist with project planning and problem resolution

Job Responsibility

Configure and support observability tools including Dynatrace, Amazon CloudWatch, Amazon CloudTrail, AWS Config, and Azure Monitor
Take ownership of observability monitoring policies, standards, and documentation
Perform fault diagnosis and root cause analysis with timely remedial action
Drive change and uplift IT teams through education and "evangelising" monitoring concepts
Provide support for AWS S3, cloud backups, and AWS RDS databases as needed
Lead incident response through to conclusion and manage assigned service queues

Senior Software Engineer - Observability

As a Senior Software Engineer, you will be directly responsible for Palantir’s o...

Location

United States , New York

Salary:

135000.00 - 200000.00 USD / Year

Palantir Technologies

Expiration Date

Until further notice

Requirements

5+ years of professional software development experience
2+ years of experience contributing to the system design or architecture (architecture, design patterns, reliability and scaling) of new and existing systems
1+ years of experience as a mentor, tech lead Or leading an engineering team
Strong coding skills in Go, Java, or equivalent
Experience designing, building, and operating high-scale observability or infrastructure systems
Bachelor's degree in Computer Science or equivalent
Active US Security clearance, or eligibility and willingness to obtain a US Security clearance

Job Responsibility

Partner with our extended leadership team to set and define a technical strategy for your team aligned with the wider team strategy
Build and champion a long-term tech roadmap to reduce operational burden, ensure scalability, reduce risk, and guide your team towards step-changes whenever possible
Be technically involved and engage in substantive discussion when reviewing technical roadmaps and project implementation with the team
Work closely with teammates and stakeholders to enable sustainable and timely delivery of technical solutions to address business needs
Facilitate partnerships between engineering teams and operators to build innovative products that help Palantir scale
Act as a multiplier for other engineers on the team. Define where the technical bar should be, and help engineers achieve it. Lead engineers and accelerate their growth by providing thoughtful feedback, technical mentorship, and effectively manage performance
Foster a non-hierarchical exchange of ideas
valuing the idea rather than the individual who communicates it

What we offer

Employees (and their eligible dependents) can enroll in medical, dental, and vision insurance as well as voluntary life insurance
Employees are automatically covered by Palantir’s basic life, AD&D and disability insurance
Commuter benefits
Relocation assistance
Take what you need paid time off, not accrual based
2 weeks paid time off built into the end of each year (subject to team and business needs)
10 paid holidays throughout the calendar year
Supportive leave of absence program including time off for military service and medical events
Paid leave for new parents and subsidized back-up care for all parents
Fertility and family building benefits including but not limited to adoption, surrogacy, and preservation

Fulltime

Select Country

Senior Observability Engineer

Job Description

Job Responsibility

Requirements

What we offer

Looking for more opportunities?

Senior Observability Engineer

Senior Observability Engineer

Senior Observability Engineer

Senior Observability Engineer

Senior Observability Engineer for Data Middleware

Senior Software Engineer, Observability

Senior Observability Infrastructure Engineer

Senior Infrastructure Engineer / Observability Specialist

Senior Software Engineer - Observability

Our AI answers in your language