CrawlJobs Logo

Filters

Location
Salary

Reliability Manager Jobs

27 Job Offers

Engineering Manager, Kernel Reliability
Save Icon
Location Icon
Location
United States; Canada , Sunnyvale; Toronto
Salary Icon
Salary
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Site Reliability Engineering Manager
Save Icon
Lead and mentor SRE teams at HPE in Bangalore, owning reliability for SASE cloud infrastructure. Drive automation, observability, and best practices using Kubernetes, Terraform, and Python. Requires 7-10 years' SRE/DevOps experience and 2+ years leading cloud teams. Enjoy comprehensive wellbeing ...
Location Icon
Location
India , Bangalore
Salary Icon
Salary
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Manager, Site Reliability Engineering and Incident Management
Save Icon
Lead our Site Reliability Engineering and Incident Management team in Atlanta. You will drive platform resilience, oversee critical incident response, and mentor a skilled team. This role requires deep cloud expertise and a passion for building reliable, scalable systems in a fast-paced SaaS envi...
Location Icon
Location
United States , Atlanta
Salary Icon
Salary
118000.00 - 160000.00 USD / Year
planetdds.com Logo
Planet DDS
Expiration Date
Until further notice
Site Reliability Engineering Manager
Save Icon
Lead a globally distributed SRE team at the Wikimedia Foundation, supporting infrastructure used by hundreds of millions. Utilize your hands-on expertise in cloud, Linux, Kubernetes, and IaC to guide critical projects and ensure reliability. This remote US role offers the chance to mentor enginee...
Location Icon
Location
United States of America
Salary Icon
Salary
132439.00 - 208378.00 USD / Year
wikimediafoundation.org Logo
Wikimedia Foundation
Expiration Date
Until further notice
Manager, Reliability
Save Icon
Lead reliability engineering initiatives in Big Spring, ensuring safe production and continuous improvement. Manage projects, analyze data, and guide a team of engineers. Enjoy day-one medical benefits, a 10% 401K match, and performance incentives.
Location Icon
Location
United States , Big Spring
Salary Icon
Salary
Not provided
delekus.com Logo
Delek US
Expiration Date
Until further notice
Senior Site Reliability Manager
Save Icon
Lead the SRE transformation for NAM and APAC operations from Sunnyvale. This senior management role requires 12+ years in SRE, with expertise in cloud platforms, automation, and mentoring engineering managers. You will ensure 24/7 operational stability, drive incident management, and foster a cul...
Location Icon
Location
United States , Sunnyvale
Salary Icon
Salary
135600.00 - 200000.00 USD / Year
commscope.com Logo
CommScope
Expiration Date
Until further notice
Reliability Manager
Save Icon
Lead reliability and maintenance engineering at our Rotterdam chemical plant. You will define long-term strategies, drive preventive maintenance, and lead a skilled team. We seek an experienced engineer with a strong background in asset integrity and process optimization. Enjoy a competitive sala...
Location Icon
Location
Netherlands , Rotterdam
Salary Icon
Salary
Not provided
lanxess.com Logo
LANXESS
Expiration Date
Until further notice

About the Reliability Manager role

The pursuit of **Reliability Manager jobs** opens the door to a critical, high-impact career focused on ensuring that complex systems, machinery, and digital platforms operate consistently, safely, and efficiently. Professionals in this field are the architects of operational stability, bridging the gap between engineering, maintenance, and IT to minimize downtime and maximize performance. While the specific industry can vary—from heavy manufacturing to cloud-based software—the core mission remains the same: to design, implement, and oversee strategies that prevent failures before they occur.

A Reliability Manager’s typical responsibilities are both strategic and hands-on. They are responsible for developing and maintaining a comprehensive reliability program that includes preventive, predictive, and condition-based maintenance. This often involves managing a Computerized Maintenance Management System (CMMS) to track asset health, work orders, and spare parts inventory. A significant part of the role is data analysis: monitoring key performance indicators like Mean Time Between Failures (MTBF) and Overall Equipment Effectiveness (OEE) to identify trends and root causes of recurring issues. When failures do happen, these managers lead Root Cause Analysis (RCA) investigations to implement permanent corrective actions. For those in digital or software environments, the role expands into Site Reliability Engineering (SRE), where the focus is on automating operations, managing cloud infrastructure, ensuring high availability (99.9%+ uptime), and using AI-driven observability to create self-healing systems. In all contexts, the Reliability Manager is a key liaison, collaborating with operations, engineering, IT, and finance teams to balance reliability investments with business goals.

To succeed in **Reliability Manager jobs**, candidates need a robust blend of technical expertise and leadership skills. On the technical side, a strong foundation in engineering (mechanical, electrical, or industrial) is common, often supported by a bachelor’s degree. Deep knowledge of maintenance strategies, condition monitoring technologies (like vibration analysis or thermography), and CMMS software is essential. For SRE-focused roles, proficiency in cloud platforms (AWS, Azure, GCP), containerization (Kubernetes), CI/CD pipelines, and scripting languages (Python, Bash) is mandatory. Beyond hard skills, these roles demand exceptional problem-solving abilities, data-driven decision-making, and the capacity to influence and coach cross-functional teams. Communication is paramount, as Reliability Managers must translate complex technical findings into actionable insights for C-suite stakeholders and frontline workers alike. Experience managing global or multi-site teams, handling large budgets, and driving cultural change toward proactive reliability is highly valued.

Ultimately, **Reliability Manager jobs** offer a dynamic career for those who thrive on preventing problems and optimizing systems. Whether ensuring a factory floor runs without interruption or that a global software platform remains accessible, these professionals are the guardians of uptime and efficiency. The role is constantly evolving, now incorporating artificial intelligence and automation to predict failures before they happen. For anyone passionate about data, continuous improvement, and making critical systems run better, this is a challenging and rewarding path that directly impacts an organization’s bottom line and reputation.