Site Reliability Engineer Job at BlackRock Investments (Edinburgh)

Site Reliability Engineer

BlackRock Investments

Location:
United Kingdom , Edinburgh

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Not provided

Save Job

Apply Position

Job Description:

We’re looking for an SRE with strong Kafka experience and a deep understanding of SRE best practices. You’ll combine hands‑on technical improvements with the ability to delegate work effectively to EventBus developers. You’ll collaborate closely with the EventBus, Kafka, Telemetry, and Incident Response teams, while also working independently to improve monitoring, reduce noise, strengthen alerting, and track remediation progress. This role sits at the centre of a global platform used by hundreds of developers and joins a fast‑growing, experienced SRE group based in Edinburgh.

Job Responsibility:

Staying informed on all EventBus incidents, including impact, root cause, detection, and ongoing remediation
Responding to incidents calmly and efficiently, communicating clearly with reporters and partner teams, and recommending remediations based on urgency and impact
Proposing improvements informed by prior incidents, potential risks, and industry standards—e.g., new metrics, SLOs, fallback mechanisms
Leading incident retrospectives and sharing insights with the wider team
Creating and distributing postmortems for high‑impact operational events
Collaborating with developers to write, maintain, and promote runbooks and playbooks
Improving alert quality and reducing alert fatigue by tuning signal‑to‑noise ratios
Designing and implementing automated recovery solutions for known issues
Building a roadmap toward 24/7 availability, rapid failover recovery, self‑detection, and automated resolution of common issues
Helping EventBus users diagnose issues with their own producers and consumers

Requirements:

3+ years in an SRE role, including experience with defining and managing SLOs
Strong understanding of SRE principles (Golden Signals, error budgets, synthetic monitoring, signal‑to‑noise optimisation)
Extensive hands‑on experience with Kafka
Experience using monitoring tools (Grafana and Splunk preferred), including building dashboards, alerts, and reports

Nice to have:

Java Developer Experience: Experience with Java or another object‑oriented language
CI/CD & Release Management: Experience managing pipelines using Azure DevOps or other Git‑based tools
Cloud Experience: Practical experience with at least one public cloud provider, preferably Azure or AWS
Agile Development: Familiarity with agile ways of working, sprint ceremonies, and backlog planning
Scripting & Automation: Proficiency in Python or Golang for automating operational tasks
Monitoring & Observability: Strong understanding of logging, monitoring, and observability practices, including writing integration scripts
Collaboration & Communication: Strong cross‑team collaboration skills and excellent written and verbal communication

What we offer:

Retirement investment and tools designed to help you in building a sound financial future
Access to education reimbursement
Comprehensive resources to support your physical health and emotional well-being
Family support programs
Flexible Time Off (FTO)

Additional Information:

Job Posted:
February 20, 2026

Expiration:
February 23, 2026

Employment Type:

Fulltime

Work Type:

Hybrid work

BlackRock Investments - All Job Offers

Job Link Share:

Site Reliability Engineer

BlackRock Investments

Location:
United Kingdom , Edinburgh

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
February 20, 2026

Expiration:
February 23, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Site Reliability Engineering Manager

Cloud Security Site Reliability Engineer

Senior Software Engineer, Site Reliability

Principal Site Reliability Engineer

Staff Site Reliability Engineer

Site Reliability Engineer

BlackRock Investments

Location:United Kingdom , Edinburgh

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:February 20, 2026

Expiration:February 23, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Site Reliability Engineering Manager

Cloud Security Site Reliability Engineer

Senior Software Engineer, Site Reliability

Principal Site Reliability Engineer

Staff Site Reliability Engineer

Location:
United Kingdom , Edinburgh

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
February 20, 2026

Expiration:
February 23, 2026