CrawlJobs Logo

Senior Observability Engineer

New Zealand · Job Posted June 09, 2026
Apply Position
Job Link Share

Job Description

As a Senior Observability Engineer, you’ll lead the design and ongoing improvement of our enterprise observability platform. This is a hands-on technical leadership role where you’ll act as the authority for our Grafana Cloud ecosystem. You’ll work across Platform Engineering, CloudOps, DevSecOps, Infrastructure and Network teams to embed observability into platform design and enable reliable, scalable operations. You’ll also mentor others and drive consistency through standardisation, automation and best practice adoption across a complex hybrid environment. This is a fixed-term role for 12-months supporting a key project on delivering critical observability uplift.

Job Responsibility

  • Define and embed standards for monitoring, alerting, and dashboard design across teams
  • Drive the transition from legacy monitoring tools to modern, cloud-based platforms
  • Develop and manage data ingestion pipelines across cloud, infrastructure, and network environments
  • Build and maintain dashboards that provide meaningful operational and performance insights
  • Design alerting frameworks with clear routing, prioritisation and ITSM integration
  • Implement automation and code-driven configuration to improve consistency and reduce manual effort

Requirements

  • Deep hands-on experience with enterprise observability platforms (e.g. Grafana Cloud)
  • Strong Prometheus experience, including querying and alerting approaches
  • Experience implementing code-driven platform configuration (e.g. GitOps, Terraform, CI/CD pipelines)
  • Solid understanding of cloud environments across AWS and/or GCP
  • Exposure to Kubernetes and modern platform or infrastructure environments
  • Experience using scripting or infrastructure tooling to support scalable operations

What we offer

  • Flexible working
  • Additional paid parental leave
  • Free period products
  • Financial health checks
  • Southern Cross health insurance for you and your family after a qualifying period
  • Discounts at local gyms
  • Access to online wellbeing tools
  • Prayer and privacy room onsite
  • Continuous learning and professional growth
  • Access to our extensive online training library
  • Furry Friend Fridays
  • Monthly social events
  • Free carparking
  • In-house café

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Observability Engineer

8 matching positions

Senior Observability Engineer

Our client, a large professional services firm, is looking to hire a Senior Obse...
Location
Location
United States
Salary
Salary:
Not provided
clearbridgetech.com Logo
ClearBridge Technology Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on Grafana dashboard and data source experience
  • Experience with Grafana Loki and LogQL
  • Experience deploying and configuring OpenTelemetry Collector
  • Grafana Alloy experience preferred
  • Logstash experience preferred, especially syslog parsing, Grok, filtering, and normalization
  • Experience collecting syslog from network devices
  • Experience with Cisco or similar network device log formats
  • Experience collecting network telemetry such as CPU, memory, uptime, interface status, bandwidth, errors, discards, device health, and alarms
  • Familiarity with SNMP, syslog, network device CLI configuration, and collector-based monitoring
  • Ability to troubleshoot ingestion, parsing, dropped logs, pipeline health, and telemetry flow issues
Job Responsibility
Job Responsibility
  • Support a short-term Grafana-based observability migration proof of concept
  • Collect, parse, normalize, and validate log data and network telemetry from infrastructure devices such as routers, switches, firewalls, wireless controllers, and other network appliances
  • Support collector deployment, syslog ingestion, telemetry collection, dashboard validation, troubleshooting, documentation, and handoff for a multi-site proof of concept
  • Fulltime
Read More
Arrow Right

Senior Observability Engineer

Coralogix is a modern, full-stack observability platform transforming how busine...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Site Reliability, DevOps, or Platform Engineering with a focus on observability
  • Proven expertise with at least one major observability platform (e.g., Prometheus, Victoria Metrics, OpenSearch)
  • Hands-on experience with Kubernetes, including deep knowledge of controllers, operators, and Helm
  • Experience writing Kubernetes controllers (controller-runtime, KubeBuilder)
  • Strong programming skills in Go or Python (Rust is a plus)
  • Experience designing, scaling, and operating observability systems at enterprise scale
  • Familiarity with at least one major cloud provider (AWS, Azure, or GCP)
  • Strong understanding of distributed systems, telemetry pipelines, and instrumentation standards (e.g., OpenTelemetry)
  • Excellent communication skills with the ability to explain complex topics to diverse stakeholders
Job Responsibility
Job Responsibility
  • Design, implement, and maintain observability features such as Alerting, SLOs, Reporting, and Synthetic Tests
  • Manage and scale OpenTelemetry Collectors and other observability agents across Kubernetes environments
  • Write and maintain Kubernetes Controllers using frameworks like controller-runtime and KubeBuilder
  • Operate and optimize the internal Coralogix account, ensuring proper usage, cost efficiency, and best practices adoption
  • Define and enforce observability guidelines and standards across the organization
  • Partner with engineering teams to embed observability by default into products and services
  • Control observability-related costs while maximizing performance, visibility, and value
  • Contribute to upstream projects such as OpenTelemetry, helping shape industry standards
  • Explore and implement cutting-edge observability technologies, including eBPF-based approaches
  • Fulltime
Read More
Arrow Right

Senior Observability Engineer

Are you passionate about building robust systems and empowering developers with ...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
blumeglobal.com Logo
Blume Global
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on experience with Elastic Stack (Elasticsearch, Logstash, Kibana, Beats/Fleet)
  • Practical experience with OpenTelemetry, including configuration, instrumentation, and exporters
  • Experience designing and operating observability pipelines in production environments
  • Familiarity with tracing, metrics, and logging best practices
  • Experience with Linux-based systems, containers (Docker), and Kubernetes
  • Experience building and maintaining internal tools or services that support developers
  • Knowledge of scripting (e.g., Bash, Python) and infrastructure-as-code practices
  • A solid understanding of CI/CD pipelines, Git, and DevOps workflows
  • Strong communication skills and a proactive attitude toward cross-team and cross-company collaboration
Read More
Arrow Right

Senior Observability Engineer for Data Middleware

Exciting senior role in Observability with growth and flexibility. Primary respo...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years' experience Solution Architecture, Analysis, Design, Development, Integration and supporting Observability solutions
  • Experience with Observability platforms: MeshIQ products (Nastel Navigator, AutoPilot, X-Ray) would be ideal, but all relevant experience with platforms like Prometheus, Grafana, AppDynamics, ElasticSearch etc.
  • Experience with Automation scripting: Ansible, Linux shell, python, etc.
  • Experience with DevOps Tools: GitHub, Bitbucket, Harness, Jenkins, Artifactory, etc.
  • Experience with CICD Automation: experience deploying and troubleshooting common programming languages such as Java, Python, etc.
  • 3+ years’ past experience developing, integrating or supporting Data Middleware products, e.g. messaging, streaming, ETL, BPM, etc.
  • Experience performance tuning, optimization, monitoring, and troubleshooting
  • Experience in AWS/ GCP / Openshift is an added advantage but not mandatory
Job Responsibility
Job Responsibility
  • Serve as a technology subject matter expert for internal and external stakeholders and provide direction for all firm mandated controls and compliance initiatives, all projects within the group and in creating a technology domain roadmap
  • Ensure that all integration of functions meet business goals
  • Define necessary system enhancements to deploy new products and process enhancements
  • Develop build deployment packaging and automation
  • Responsible for architecture and implementation of HA (High Availability) and DR/COB (Continuity of Business)
  • Manage security screening and vulnerability tracking for Citi's software certification process
  • Performance monitoring and L3 troubleshooting support
  • Integration with enterprise services such as identity management, logging, secrets management, ticketing, etc.
What we offer
What we offer
  • Private Medical Care Program
  • Life Insurance Program
  • Pension Plan contribution (PPE Program)
  • Employee Assistance Program
  • Paid Parental Leave Program (maternity and paternity leave)
  • Sport Card
  • Holidays Allowance
  • Sport and team recreation activities
  • Special offers and discounts for employees
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Observability

We are looking for an experienced Senior Engineer to join our newly formed Obser...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
aiven.io Logo
Aiven Deutschland GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience with observability concepts on a big scale
  • A good grasp of monitoring and observability tools like Prometheus, Grafana, and OpenTelemetry
  • Understanding of SLAs, SLOs, and SLIs
  • Strong knowledge of database fundamentals, including OLAP vs. OLTP, persistence, replication, and clustering
  • Experience with ClickHouse specifically regarding logs, metrics, and OpenTelemetry is highly desirable
  • Experience in building and designing distributed systems in a cloud environment
  • Ability to work with SQL to interact with our platform's master database
  • Deep understanding of release management and testing best practices to own the delivery pipeline
  • A genuine interest in solving complex technical challenges with customer-focused solutions
Job Responsibility
Job Responsibility
  • Ensure our existing observability offering is up and running all the time
  • Ideate and develop innovative new features that attract our target customer segment, drive product engagement, and ultimately fuel growth
  • Support our existing external customer base by resolving escalated support issues and collaborating with them to understand and solve their needs
  • Guide the team in the hands-on implementation of key platform features, ensuring maintainability and performance
  • Empower your team to act as 'product custodians' by consistently addressing foundational and production issues
  • Practise effective communication and collaboration both within the team and across the wider organization and act as a role model in transparency for your peers
What we offer
What we offer
  • Participate in Aiven’s equity plan
  • Balance work and life with our hybrid work policy
  • Choose the equipment you need to set yourself up for success
  • Use your Professional Development Plan budget for learning opportunities
  • Receive holistic wellbeing support through our global Employee Assistance Program
  • Inquire about our Global Time Off Commitment (Parental and Sick Leave, as well as Personal Time)
  • Enjoy country-specific benefits for our global cast
  • Fulltime
Read More
Arrow Right

Senior Observability Infrastructure Engineer

We are looking for an experienced Observability Infrastructure Engineer to join ...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
adyen.com Logo
Adyen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in the observability domain or in a relevant platform/infrastructure domain.
  • Observability Stack Expertise: You have hands-on experience operating core telemetry data stores at scale e.g. Elasticsearch/Opensearch/VictoriaLogs/Clickhouse for logging, Prometheus/ VictoriaMetrics for metrics and Grafana Tempo for distributed tracing.
  • Linux Experience: You understand the operating system at a kernel level and can debug complex networking, file system, and performance issues on both bare metal and virtualized hardware .
  • Production Kubernetes Experience: Proven hands-on experience operating, and troubleshooting production workloads on Kubernetes (on-prem and/or cloud), including strong day-to-day use of kubectl and Kubernetes primitives (e.g. Namespaces, Pods, Deployments/StatefulSets, Services, Ingress, ConfigMaps/Secrets)
  • Software Engineering Mindset: You are proficient in Go or Python and do not just write scripts
  • you build tools and automation platforms that treat infrastructure as code.
Job Responsibility
Job Responsibility
  • Build the next generation of our platform: Design and implement the future architecture of our logging and metrics systems.
  • Own infrastructure operations: You will take full ownership of our hybrid infrastructure, managing the lifecycle of over 1,500 servers across both bare-metal and Kubernetes environments.
  • Automate to reduce toil: You will write code in Go or Python to eliminate manual operational tasks.
  • Optimize for scale and performance: You will dive deep into performance bottlenecks within our distributed tracing and logging pipelines.
  • Reliability and Engineering: You will participate in on-call rotations, but your primary focus will be engineering solutions that stop alerts from firing in the first place.
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer / Observability Specialist

Location: Remote - Anywhere in Australia (Will be required to travel to Canberra...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
finxl.com.au Logo
FinXL
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must be Australian Citizen and be able to obtain Baseline Security Clearance
  • Cloud Expertise: Proficiency in AWS, Azure, or Google Cloud platforms
  • Observability Concepts: Deep understanding of metrics, logs, and traces, including the design of alerting systems
  • Automation: Experience in scripting with Python, Bash, or PowerShell
  • Containerisation: Knowledge of Kubernetes and Docker
  • Soft Skills: Strong negotiation and communication skills to assist with project planning and problem resolution
Job Responsibility
Job Responsibility
  • Configure and support observability tools including Dynatrace, Amazon CloudWatch, Amazon CloudTrail, AWS Config, and Azure Monitor
  • Take ownership of observability monitoring policies, standards, and documentation
  • Perform fault diagnosis and root cause analysis with timely remedial action
  • Drive change and uplift IT teams through education and "evangelising" monitoring concepts
  • Provide support for AWS S3, cloud backups, and AWS RDS databases as needed
  • Lead incident response through to conclusion and manage assigned service queues
Read More
Arrow Right

Senior Software Engineer - Observability

As a Senior Software Engineer, you will be directly responsible for Palantir’s o...
Location
Location
United States , New York
Salary
Salary:
135000.00 - 200000.00 USD / Year
palantir.com Logo
Palantir Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of professional software development experience
  • 2+ years of experience contributing to the system design or architecture (architecture, design patterns, reliability and scaling) of new and existing systems
  • 1+ years of experience as a mentor, tech lead Or leading an engineering team
  • Strong coding skills in Go, Java, or equivalent
  • Experience designing, building, and operating high-scale observability or infrastructure systems
  • Bachelor's degree in Computer Science or equivalent
  • Active US Security clearance, or eligibility and willingness to obtain a US Security clearance
Job Responsibility
Job Responsibility
  • Partner with our extended leadership team to set and define a technical strategy for your team aligned with the wider team strategy
  • Build and champion a long-term tech roadmap to reduce operational burden, ensure scalability, reduce risk, and guide your team towards step-changes whenever possible
  • Be technically involved and engage in substantive discussion when reviewing technical roadmaps and project implementation with the team
  • Work closely with teammates and stakeholders to enable sustainable and timely delivery of technical solutions to address business needs
  • Facilitate partnerships between engineering teams and operators to build innovative products that help Palantir scale
  • Act as a multiplier for other engineers on the team. Define where the technical bar should be, and help engineers achieve it. Lead engineers and accelerate their growth by providing thoughtful feedback, technical mentorship, and effectively manage performance
  • Foster a non-hierarchical exchange of ideas
  • valuing the idea rather than the individual who communicates it
What we offer
What we offer
  • Employees (and their eligible dependents) can enroll in medical, dental, and vision insurance as well as voluntary life insurance
  • Employees are automatically covered by Palantir’s basic life, AD&D and disability insurance
  • Commuter benefits
  • Relocation assistance
  • Take what you need paid time off, not accrual based
  • 2 weeks paid time off built into the end of each year (subject to team and business needs)
  • 10 paid holidays throughout the calendar year
  • Supportive leave of absence program including time off for military service and medical events
  • Paid leave for new parents and subsidized back-up care for all parents
  • Fertility and family building benefits including but not limited to adoption, surrogacy, and preservation
  • Fulltime
Read More
Arrow Right