CrawlJobs Logo
Briefcase Icon
Category Icon

Filters

×
Filters

No filters available for this job position.

SRE Observability Lead Jobs

1 Job Offers

Filters
New
Site Reliability Engineering (SRE) / Observability Technical Lead
Save Icon
Lead Observability and SRE initiatives in London, leveraging your expertise in APM, OpenTelemetry, and Terraform. You will design reliability frameworks, automate processes, and mentor teams in a cloud-native environment. Enjoy flexible work options and comprehensive wellbeing benefits while driv...
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Looking for SRE Observability Lead jobs? This senior technical leadership role sits at the critical intersection of software engineering, systems operations, and strategic vision. An SRE Observability Lead is fundamentally responsible for architecting and driving the observability strategy for an entire organization or major business unit. They move beyond simply using monitoring tools to defining what “observability” means for their company, ensuring that every system provides the necessary telemetry—metrics, logs, and traces—to allow engineers to understand and improve system health, performance, and reliability. Professionals in these roles typically blend deep hands-on technical expertise with strategic planning and team leadership. Common responsibilities include defining a multi-year observability roadmap, establishing standards and best practices for instrumentation, and building centralized platforms and reusable services that enable product teams to observe their own systems effectively. They lead a small team of specialist Site Reliability Engineers (SREs) focused on observability tooling and practices. A key part of their mandate is to embed observability as a core non-functional requirement (NFR) into the software development lifecycle (SDLC), ensuring new applications are built with transparency and insight from the start. They act as a central hub, collaborating with platform, infrastructure, and application development teams to break down silos and create unified, enterprise-scale observability solutions that provide end-to-end visibility into critical user journeys. The typical skill set required for SRE Observability Lead jobs is extensive. Candidates generally possess over a decade of experience in SRE, infrastructure, or platform engineering, with several years in a leadership capacity. Deep, hands-on expertise with the modern observability stack is non-negotiable, including tools like Prometheus, Grafana, OpenTelemetry, ELK (Elasticsearch, Logstash, Kibana), and commercial equivalents. They must have proven experience designing and implementing telemetry strategies across hybrid environments encompassing on-premises data centers, public cloud (AWS, GCP, Azure), and container orchestration platforms like Kubernetes. A firm grasp of core SRE principles—such as Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets—is essential. Beyond technical prowess, success in this profession demands exceptional soft skills: the ability to influence senior stakeholders, mentor and grow engineers, communicate complex concepts clearly, and drive cultural change toward a data-driven, reliability-focused engineering practice. For those seeking to shape the future of system reliability and transparency, SRE Observability Lead jobs offer a challenging and impactful career path at the forefront of modern software operations.

Filters

×
Countries
Category
Location
Work Mode
Salary