CrawlJobs Logo

Filters

Location
Salary

Senior Reliability Engineer Jobs

126 Job Offers

Senior Maintenance Reliability Engineer
Save Icon
Lead reliability programs for key manufacturing assets in Washington, IN. This senior role requires a BS in Engineering and 5+ years' industrial reliability experience, including RCM and Root Cause Analysis. Drive uptime through precision maintenance and predictive technologies. Relocation assist...
Location Icon
Location
United States , Washington
Salary Icon
Salary
Not provided
perduefarms.com Logo
Perdue Farms
Expiration Date
Until further notice
Senior Machine Learning Site Reliability Engineer
Save Icon
Join Prima's Infrastructure team in Milan as a Senior ML SRE. You will design and operate scalable systems on AWS/Kubernetes, applying strong Python and MLOps practices. Drive reliability, automation, and performance in a dynamic, regulated insurance environment. Enjoy tailored growth plans and c...
Location Icon
Location
Italy , Milan
Salary Icon
Salary
Not provided
prima.it Logo
Prima
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Fivetran as a Senior Site Reliability Engineer in Denver. You will ensure 100% availability of our data platform using Kubernetes, AWS/GCP, and Terraform. Leverage your 5+ years of SaaS experience to monitor systems and drive reliability. Enjoy top benefits like full medical coverage, RSUs, ...
Location Icon
Location
United States , Denver
Salary Icon
Salary
149927.34 - 179920.00 USD / Year
fivetran.com Logo
Fivetran
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Fivetran as a Senior Site Reliability Engineer in Oakland. You will ensure 100% availability of our data platform using Kubernetes, AWS/GCP, and Terraform. We offer top benefits like full medical coverage, RSUs, and a strong PTO policy. Help build a reliable, scalable infrastructure for a le...
Location Icon
Location
United States , Oakland
Salary Icon
Salary
166585.93 - 199911.11 USD / Year
fivetran.com Logo
Fivetran
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our team as a Senior Site Reliability Engineer in Brno or Prague. You will manage our GCP and Kubernetes-based analytics platform, ensuring high availability and performance. Key tasks include automation with Python/Go, incident management, and maintaining CI/CD pipelines. Enjoy a virtual-fi...
Location Icon
Location
Czechia , Brno; Prague
Salary Icon
Salary
Not provided
bloomreach.com Logo
Bloomreach
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our team as a Senior Site Reliability Engineer in Bratislava. You will ensure the reliability of our GCP & Kubernetes-based analytics platform, managing databases like IMF and Kafka. Key tasks include automation with Python/Go, incident management, and maintaining CI/CD pipelines. Enjoy a vi...
Location Icon
Location
Slovakia , Bratislava
Salary Icon
Salary
4200.00 EUR / Month
bloomreach.com Logo
Bloomreach
Expiration Date
Until further notice

About the Senior Reliability Engineer role

Senior Reliability Engineer jobs represent a critical intersection between software engineering and IT operations, focusing on building and maintaining highly scalable, fault-tolerant systems. Professionals in this role are responsible for ensuring that complex distributed systems remain available, performant, and resilient under demanding workloads. As organizations increasingly rely on cloud-native architectures, the demand for these specialists continues to grow across industries.

The core mission of a Senior Reliability Engineer is to bridge the gap between development and operations, applying a software engineering mindset to infrastructure challenges. These professionals design, implement, and manage the systems that keep digital services running smoothly. They work extensively with cloud platforms, containerization technologies, and orchestration tools to create automated, self-healing infrastructure. A significant portion of their work involves developing and maintaining CI/CD pipelines, implementing Infrastructure as Code practices, and building monitoring and observability solutions that provide real-time visibility into system health.

Typical responsibilities include automating operational tasks to eliminate manual processes, participating in incident response and post-mortem analysis, and driving continuous improvement through root cause analysis. Senior Reliability Engineers often serve as the technical authority during outages, coordinating response efforts and implementing preventive measures. They collaborate closely with software development teams to ensure new features are designed with operability and scalability in mind, often influencing architectural decisions early in the development lifecycle. Performance tuning, capacity planning, and cost optimization of cloud resources are also common duties.

To succeed in senior reliability engineering jobs, candidates typically need strong programming skills in languages like Python, Go, or Ruby, combined with deep expertise in Linux system administration. Experience with configuration management tools such as Puppet, Ansible, or Terraform is essential. A thorough understanding of networking, security best practices, and database management rounds out the technical requirements. Beyond technical skills, these roles demand excellent problem-solving abilities, strong communication skills, and the capacity to work effectively in on-call rotations. Many positions also value experience with incident response frameworks and formal post-incident review processes.

The profession attracts individuals who enjoy solving complex infrastructure puzzles, automating away repetitive work, and building systems that can gracefully handle failure. As digital transformation accelerates across all sectors, senior reliability engineer jobs will remain vital for organizations that cannot tolerate downtime or degraded performance. These professionals ultimately enable businesses to deliver reliable, fast, and secure digital experiences to their users at global scale.