CrawlJobs Logo

Site Reliability Engineering Specialist

plus.net Logo

Plusnet

Location Icon

Location:
United Kingdom , Snowhill, Birmingham

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Professional Services was formed as a progressive development towards the convergence of multiple domains across BT. We pride ourselves on providing expert third line support to an extensive range of services; ensuring the required levels of availability are maintained. The team is widely recognised for getting things done, while making transformational improvements along the way. We do this by ensuring we have the right people to achieve our high ambitions. The purpose of this role is to apply the SRE principles to major end-to-end technology introduction projects and service assurance activities on BT’s core network infrastructure. You will take the lead on complex and high impact fault resolution across multiple platforms and services. You will also build and own network automation processes to deliver flawless change to BT’s networks.

Job Responsibility:

  • Builds network engineering change processes for complex end-to-end technology introduction in the live network, utilising automation & CI/CD pipelines.
  • Leads on major incident resolution acting as a final technical escalation point within BT.
  • Leads blameless post‑incident reviews to uncover systemic root causes and convert learnings into concrete reliability, automation, and process improvements.
  • Champions a reliability‑first change culture, promoting safe deployment patterns, blameless learning, and continuous improvement across engineering teams.
  • Collaborates with design & platform teams to support the implementation of flawless change into the live network.
  • Acts as a subject matter expert within the network engineering domain. Applying this expertise to troubleshoot faults on our infrastructure crossing multiple platform domains.
  • Embeds secure by design principles when building new change processes and solutions.
  • Will champion and build effective working relationships, both internally and externally to deliver business outcomes.
  • Champions the adoption of Site Reliability Engineering practices within Professional Services, driving cultural change towards automation, observability, and reduced operational toil.

Requirements:

  • A strong understanding of multi-vendor IP/MPLS networks (Nokia, Cisco, Juniper etc)
  • A strong understanding of network routing protocols such as IS-IS, LDP, RSVP, segment routing, OSPF, eBGP, iBGP, MP-BGP
  • A strong understanding of fundamental protocols such as DNS, DHCP & NTP
  • A strong understanding of network change & incident management best practice
  • A good understanding of Linux operating systems
  • An intermediate level of proficiency in atleast one programming language preferably Python
  • You will be confident and professional in communicating with all stakeholders, both locally and with members of the Senior Management Team.
  • You will have the ability to work in a high-pressure environment.

Nice to have:

  • Strong Python programming proficiency
  • Strong understanding of IaC languages such as Ansible &Terraform
  • A strong understanding of containerisation using docker, podman or a similar container engine
  • Strong proficiency in building CI/CD pipelines
  • A good knowledge of coding best practices including code structure, peer review & testing.
What we offer:
  • Tailored training and development opportunities to continue to build your career
  • 10% on target bonus
  • 25 days’ annual leave (not including bank holidays), increasing with service
  • Life Assurance
  • Pension scheme - If you pay in a minimum of 5% of your pensionable salary every month we will pay in 10%
  • Direct Share scheme
  • Option to join the Healthcare Cash Plan or other benefits such as dental insurance, gym memberships etc.
  • 50% off EE mobile pay monthly or SIM only plans
  • Exclusive colleague discounts on our latest and greatest BT broadband packages BT TV, including TNT Sports and NOW entertainment
  • Shared Parental leave - maximum amount of leave you can share with your partner is 50 weeks

Additional Information:

Job Posted:
March 01, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Site Reliability Engineering Specialist

Manager, Reliability

Responsible for sustaining and continuously improving various mechanical compone...
Location
Location
United States , Big Spring
Salary
Salary:
Not provided
delekus.com Logo
Delek US
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 year / Bachelor's Degree (Required)
  • Four (4) or more years Experience in a related field (Required)
  • No Licensure or Certification Required
  • Manages and leads the activities of the Reliability engineers and specialists
  • Ensures compliance to Engineering Practices/Mechanical Integrity at the site level
  • Champions initiatives, projects, and programs that support the reliability vision
  • Guides Reliability Engineers to grow their technical and leadership skills
  • Develops working relationships with site leaders to guide teams on reliability centered processes and investigations
  • SPOC between Corporate Reliability and site activities
  • Reliability Department budget owner
Job Responsibility
Job Responsibility
  • Responsible for sustaining and continuously improving various mechanical components for equipment and tools
  • Ensures the safe, effective operations of the organization's production and supports continuous improvement
  • Manages reliability engineering projects
  • Performs analytical verification
  • Evaluates, tests and tracks results of reliability interventions
  • Initiates reporting for internal or third-party reported incidents
  • Creates, documents, and follows up on corrective actions
  • Prepares routine reports and memos and coordinate communications across all necessary functional groups of the organization
What we offer
What we offer
  • up to a 10% match on 401K on your hire start, with a vesting timeline of only one year
  • medical benefits that start on day one with a 30% premium rebate annually
  • access to the Calm app for FREE
  • additional annual incentives through performance management program
  • Fulltime
Read More
Arrow Right

Construction Maintenance Specialist

Join Galp and bring your curiosity and passion every day. With a customer-centri...
Location
Location
Portugal , Madeira
Salary
Salary:
Not provided
https://www.galp.com/ Logo
Galp
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s/Master’s degree in Mechanical or Civil Engineering
  • Minimum of 3 years of prior experience in similar roles
  • Professional Gas Technician certification
  • Membership in the Order of Engineers
  • Experience in Project Management
  • Proficiency in English
  • Strong computer skills, including MS Office 365, Power BI, and SharePoint
  • Excellent written and verbal communication skills
  • Customer and results-oriented mindset
  • Strong leadership skills and experience in managing service providers and teams
Job Responsibility
Job Responsibility
  • Lead and coordinate multidisciplinary teams of service providers and collaborate with other departments in executing engineering projects for Galp LPG clients or potential clients
  • Promote and manage LPG construction projects to support Residential and Enterprise business development
  • Ensure the maintenance and requalification of LPG assets in the Madeira archipelago, including networks, parks, and gas cabins
  • Participate and collaborate in the licensing process for LPG assets in Madeira
  • Manage Galp Madeira’s internal installation teams
  • Assume Technical Responsibility for the Operating Entity
  • Participate in the emergency and urgent maintenance response team of Galp Madeira
  • Ensure compliance with Health, Safety, and Environmental (HSE) standards and procedures on-site
  • Understand and follow technological development trends, acting as an agent for change management, business challenges, and requirements
  • Contribute to continuous improvement in the processes of the Construction and Renovation unit within the Technical Operations area
  • Fulltime
Read More
Arrow Right

Senior Applications Specialist

Location
Location
Canada , Mississauga
Salary
Salary:
Not provided
advancedtechsearch.com Logo
Advanced Technology Search Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Degree in Electrical Engineering, Computer Science, or related technical discipline or equivalent experience
  • At least 3 years of experience in advanced systems engineering support, focusing on complex technical problem resolution
  • Proven generalized understanding of computer networking (LAN, WAN, NAT, DNS, Basic Firewalls, etc.)
  • Hands-on experience with Linux and/or Windows (CMD, Bash, PS, regedit, etc.)
  • Demonstrated ability to diagnose sophisticated technical issues and implement effective solutions
  • Ability to work collaboratively with cross-functional teams
  • Willing to adapt to evolving technologies and industry standards
Job Responsibility
Job Responsibility
  • Analyze complex technical issues and system integrations to identify root causes and develop effective solutions
  • Conduct systematic analysis to diagnose customer system issues and implement effective technical solutions
  • Travel to customer’ sites in Canada and US for advanced troubleshooting and customer support
  • Collaborate with designers, developers, and stakeholders and well as technical support team to endure seamless product integration and customer satisfaction
  • Manage the deployment and configuration of integrated systems, ensuring optimal performance and reliability
  • Develop detailed Product Support Documents, and train internal technical support as appropriate
  • Develop comprehensive technical manuals and field installation guides to support customers during product installation, commissioning, and troubleshooting
  • Investigate and review recurring product issues to drive product improvements
  • Equip and support the team with in-depth product knowledge and configuration strategies
  • Provide post-sales customer support, including consultation on product configuration, installation, and usage
  • Fulltime
Read More
Arrow Right

Site Reliability Engineering Specialist

This role will specialise in system administration and server management with a ...
Location
Location
United Kingdom , Birmingham
Salary
Salary:
Not provided
plus.net Logo
Plusnet
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in an ISP Environment: Proven experience in a fast-paced ISP setting, managing and troubleshooting large-scale networks
  • Sysadmin/Server Management: Strong skills in system administration, server management, and compute resources with experience in deploying and managing containerised applications using orchestration tools such as Kubernetes
  • Technical Proficiency: Strong understanding of network architecture, design, and implementation
  • Monitoring and Logging Solutions: Familiarity with monitoring and logging solutions such as Elastic search, Apache Kafka, and Prometheus
  • Programming Proficiency: Proficiency in at least one programming language, such as Python, Ansible or Go
  • Growth Mindset: Self-driven attitude towards learning new skills and aiding the development of others
Job Responsibility
Job Responsibility
  • Network Delivery: Support the Implementation of flawless change into the live network, utilising automation and CI/CD pipelines
  • Network Monitoring: Configure, maintain, and monitor systems and network infrastructure to ensure optimal health, performance, and reliability
  • Automation Tools: Utilise tools such as Ansible to provision and manage infrastructure resources in a scalable and efficient manner
  • Technical Acumen: Apply your understanding of network principles to troubleshoot network faults within our systems and look at how you can optimise performance and enhance security across our infrastructure
  • Incident Management and Resolution: Be prepared to support a 365x24/7 callout, providing third line technical resolution covering an extensive range of technologies
  • Customer Focus: Be a technical expert who understands the end-to-end journey of our customers
  • Growth and Development: As a technically talented expert you should enhance the brand of the team and support those around you to be accountable and perform at their best
What we offer
What we offer
  • Competitive salary
  • 10% on target bonus
  • BT Pension scheme, minimum 5% Employee contribution, BT contribution 10%
  • 25 days annual leave (not including bank holidays), increasing with service
  • Huge range of flexible benefits including cycle to work, healthcare, season ticket loan
  • World-class training and development opportunities
  • Option to join BT Shares Saving schemes
  • Discounted broadband, mobile and TV packages
  • Access to 100’s of retail discounts including the BT shop
  • On call allowances and overtime
  • Fulltime
Read More
Arrow Right

Platform Specialist

Hermeus is a high-speed aircraft manufacturer focused on the rapid design, build...
Location
Location
United States , Atlanta
Salary
Salary:
105750.00 - 129250.00 USD / Year
hermeus.com Logo
Hermeus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Associates degree in information technology/systems, Computer Science, Engineering, or related STEM field
  • 5+ years of experience as a Network Engineer or in a similar role (Site Reliability Engineer, Platform Engineer, Cybersecurity Engineer, Software Engineer)
  • Proficient in Linux
  • Proficient in basic electronics maintenance and repair
  • Proficient in the maintenance/termination serial port connectors (RS232, etc.), and CAT5/6
  • Proficient in basic electronics packaging (design, assembly, cable routing, cooling, etc.)
  • Proficiency working at all layers of the OSI network model
  • Proficiency designing and maintaining layer 2 networks
  • Comfortable reading and maintaining low voltage wiring diagrams
  • Experience creating, explaining, and maintaining network architectures, network topology diagrams, and other interface diagrams/specifications
Job Responsibility
Job Responsibility
  • Build, maintain, troubleshoot, and repair Ground Control Stations
  • Install, configure, and maintain flight critical IT systems, communication tools, and mission-enhancing hardware/software
  • Install, configure, and maintain pilot-in-the-loop cockpits for Remotely Piloted Aircraft (RPAs)
  • Collaborate closely with other engineering teams to ensure seamless integration of mission systems
  • Design, configure, and maintain network hardware and software, including routers, switches, firewalls, Access Points (wireless), and VPNs
  • Monitor network performance and ensure system availability and reliability
  • Perform network troubleshooting to isolate and diagnose common network problems
  • Implement and maintain network security, including access controls, intrusion detection systems, and threat prevention
  • Manage and administer network services such as DNS, DHCP, and IP address management (IPAM)
  • Develop and maintain comprehensive documentation for network configurations, processes, and procedures
What we offer
What we offer
  • 100% employer-paid health care
  • 401k & retirement plans
  • Unlimited PTO
  • Weekly paid office lunches
  • Fully stocked breakrooms
  • Stock options
  • Paid Parental Leave
  • Fulltime
Read More
Arrow Right

Platform Specialist

Location
Location
United States , Atlanta
Salary
Salary:
Not provided
hermeus.com Logo
Hermeus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Associates degree in information technology/systems, Computer Science, Engineering, or related STEM field
  • 5+ years of experience as a Network Engineer or in a similar role (Site Reliability Engineer, Platform Engineer, Cybersecurity Engineer, Software Engineer)
  • Proficient in Linux
  • Proficient in basic electronics maintenance and repair
  • Proficient in the maintenance/termination serial port connectors (RS232, etc.), and CAT5/6
  • Proficient in basic electronics packaging (design, assembly, cable routing, cooling, etc.)
  • Proficient working at all layers of the OSI network model
  • Proficient designing and maintaining layer 2 networks
  • Legally authorized to work for any employer in the United States
  • Will not require employment visa sponsorship
  • Fulltime
Read More
Arrow Right

Site Reliability Engineering Specialist

BTI Professionals provide expert third-line reliability and operational support ...
Location
Location
Hungary , Budapest
Salary
Salary:
Not provided
plus.net Logo
Plusnet
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience supporting large-scale, high-availability services in an ISP / NaaS / network-centric environment
  • Strong Linux troubleshooting and systems knowledge
  • Hands-on Kubernetes experience operating applications in production
  • Experience delivering changes using GitOps and CI/CD pipelines (including release validation and rollback awareness)
  • Working knowledge of incident/problem management in ServiceNow and delivery tracking in Jira (Scrum / PI planning)
  • Experience with observability tooling: Dynatrace, Prometheus, Elasticsearch, plus event/messaging platforms such as Kafka
  • Solid networking fundamentals to support effective troubleshooting
  • Automation experience with Ansible and at least one of Python / Go / Bash
  • Experience integrating or operating services with LDAP (authentication/authorisation, troubleshooting access issues)
Job Responsibility
Job Responsibility
  • Provide SRE ownership for the Global Fabric NaaS service, ensuring availability, performance, and resilience
  • Support safe, automated change into production using CI/CD, GitOps, and automated testing
  • Operate and improve monitoring and observability using Dynatrace, Prometheus, and Elasticsearch
  • Troubleshoot incidents across Kubernetes-hosted applications, Linux systems, networking, and service integrations
  • Act as a third-line escalation point, participating in a 24x7 on-call rota
  • Manage incidents via ServiceNow and track defects and improvements in Jira
  • Contribute to Scrum ceremonies and PI planning, supporting Agile delivery
  • Drive automation using Ansible and scripting to reduce operational toil
  • Mentor and support L2 engineers, improving runbooks, troubleshooting practices, and operational readiness
What we offer
What we offer
  • Cafeteria package - HUF 600,000/ year
  • Performance-based bonus
  • Comprehensive private health care package for all the employees, which can be extended to family members
  • Nursery support for mothers returning from maternity
  • Extended paternity leave: 10+10 day fully paid days
  • Commuting allowance
  • Home office allowance
  • Employee discount opportunities
  • Highly affordable mobile packages for the family as well
  • Car allowance
  • Fulltime
Read More
Arrow Right

M&E Engineer

JLL empowers you to shape a brighter way. Our people at JLL are shaping the futu...
Location
Location
Malaysia , Subang Jaya
Salary
Salary:
Not provided
jll.com Logo
JLL
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficient in English through all mediums of communication
  • Ability to write and review good quality detailed work scripts, methods of procedure, and standard operating procedures
  • Prepare incident report
  • Bachelor’s Degree in Electrical or Mechanical engineering with relevant work experience is required
  • Candidate with relevant professional licenses or certification has an advantage such as energy management
  • Good knowledge in mechanical or electrical system operation and maintenance for system such as cooling systems, fire suppression and alarm system, BMS, uninterruptible power supply (UPS), diesel generator and electrical LV & HT power distribution
  • Knowledgeable in HVAC, mechanical or electrical system design, able to study single line diagram, schematic diagram, and systems layout
  • Perform engineering calculations such as cooling load calculation, electrical load calculation, etc.
  • Must be a team player and able to work independently to resolve issues
  • Proficient in computer applications and software, including commercial computerized maintenance management systems, Microsoft Word, PowerPoint and Excel
Job Responsibility
Job Responsibility
  • Support all critical operations and maintenance work in a data center facility for full uptime
  • Develop work operation processes, conduct training, preparation of reports, implement operations work practices and liaison with tenants for work implementations
  • Develops installation and maintenance Change Control Work application
  • Implementations or ensure implementations of Change Control work method statement including LOTO (log out/tag out) required under Change Management Workflow Process for assigned facilities and systems
  • Responsible for protecting and improving the value of our assets and ensuring the critical engineering systems reliably perform their intended function
  • Assist site engineer and technician in performing hands-on operations, conduct equipment/system functional inspection and troubleshooting prior calling vendor/equipment supplier specialist on-site
  • Support engineer in system troubleshoot ensuring critical systems performed in critical environment is accomplished efficiently with no unplanned downtime and inconvenience to customer
  • Monitoring through BMS and site inspection to ensure continuous trouble-free operation
  • Check malfunctioning equipment and ascertains corrective action required to restore to satisfactory operating condition
  • Periodically inspect and examine the maintained equipment/system done by maintenance vendor or inhouse technician
  • Fulltime
Read More
Arrow Right