CrawlJobs Logo

Filters

Location
Salary

Senior Reliability Engineer Jobs

126 Job Offers

Senior Software Engineer - Observability and Reliability
Save Icon
Join our San Francisco team as a Senior Software Engineer focused on Observability and Reliability. You will build critical platforms for metrics, logging, tracing, and alerting using Go and Kubernetes. We seek a collaborative engineer with 5+ years of experience, a product mindset for infrastruc...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
150000.00 - 220000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Microsoft's Defender team as a Senior Site Reliability Engineer. You will build and deliver secure cloud solutions to protect critical government systems. This role requires a Top Secret clearance and expertise in ensuring 24/7 reliability for large-scale, distributed environments. Drive sec...
Location Icon
Location
United States , Multiple Locations
Salary Icon
Salary
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our team as a Senior Site Reliability Engineer in New York City. You will build and scale our critical Kafka-based data export system, Currents, handling billions of messages daily. Leverage your expertise in Kubernetes, observability, and Java/Kotlin to ensure reliability and performance. W...
Location Icon
Location
United States , New York City
Salary Icon
Salary
129600.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our Jira SRE team as a Senior Site Reliability Engineer. You will scale Cloud services, own critical infrastructure, and drive automation with AWS and modern programming languages. This role requires 5+ years of production experience and offers health coverage and paid volunteer days.
Location Icon
Location
Salary Icon
Salary
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Baxter International as a Senior Site Reliability Engineer in Deerfield. Ensure 24/7 availability for critical healthcare applications on Azure. Leverage your cloud, automation, and scripting expertise in a regulated environment with comprehensive benefits.
Location Icon
Location
United States , Deerfield
Salary Icon
Salary
96000.00 - 132000.00 USD / Year
https://www.baxter.com/ Logo
Baxter
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Baxter as a Senior Site Reliability Engineer in Deerfield, US. Ensure 24/7 availability and security for life-saving healthcare platforms on Microsoft Azure. Leverage your cloud infrastructure and automation expertise within a regulated environment. Enjoy comprehensive benefits including hea...
Location Icon
Location
United States , Deerfield
Salary Icon
Salary
96000.00 - 132000.00 USD / Year
https://www.baxter.com/ Logo
Baxter
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Atlassian as a Senior Site Reliability Engineer in San Francisco. Architect and automate large-scale cloud infrastructure using Python, Terraform, and AWS to enhance performance for enterprise customers. This role requires expertise in CI/CD, monitoring, and Linux/Windows systems. We offer h...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180960.00 - 230900.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Checkr as a Senior Site Reliability Engineer in Denver or San Francisco. You will design core observability tools, drive platform adoption, and ensure reliability across AWS/Azure environments. This role requires 6+ years of Python/GoLang experience, Kubernetes, and incident response experti...
Location Icon
Location
United States , Denver; San Francisco
Salary Icon
Salary
138000.00 - 191000.00 USD / Year
https://checkr.com Logo
Checkr
Expiration Date
Until further notice
Senior Vice President, Cloud Security Site Reliability Engineer
Save Icon
Lead the Cloud Security SRE transformation at a global scale. Architect and build reliable platforms for container and secrets products across public and private cloud. This Singapore-based senior role requires deep expertise in SRE, Python/Java, Kubernetes, and major cloud providers.
Location Icon
Location
Singapore , Singapore
Salary Icon
Salary
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our SRE/Platform Engineering team in Chennai as a Senior Site Reliability Engineer. You will build, scale, and maintain our AWS & Kubernetes platform, ensuring high reliability and security. We seek an expert with 8-10+ years in AWS, Kubernetes, Terraform, and CI/CD tools. Enjoy competitive ...
Location Icon
Location
India , Chennai
Salary Icon
Salary
Not provided
arcadia.com Logo
Arcadia
Expiration Date
Until further notice
Senior Software Engineer, Site Reliability
Save Icon
Join Babylist as a Senior Site Reliability Engineer on our Platform team. You will ensure system stability and scalability using AWS, Terraform, and Kubernetes. This remote role in the US/Canada offers strong benefits, including comprehensive health insurance and a supportive, AI-forward environm...
Location Icon
Location
United States; Canada
Salary Icon
Salary
186818.00 - 224183.00 USD; CAD / Year
babylist.com Logo
Babylist
Expiration Date
Until further notice
Senior Reliability Engineer - PCBA, Harness & Connectors
Save Icon
Join our team in San Jose as a Senior Reliability Engineer, specializing in PCBA, harness, and connectors. You will develop and execute reliability test plans, utilizing tools like Weibull++ and industry standards (AECQ, JEDEC). Lead DFMEA efforts and provide data-driven recommendations to ensure...
Location Icon
Location
United States , San Jose
Salary Icon
Salary
150000.00 - 225000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join HiveWatch as a Staff Site Reliability Engineer in El Segundo, CA. Architect and maintain mission-critical edge infrastructure for a SaaS platform, ensuring exceptional performance and reliability. Leverage 7+ years of software engineering and 5+ years of SRE expertise with AWS, Kubernetes, a...
Location Icon
Location
United States , El Segundo
Salary Icon
Salary
183000.00 - 235000.00 USD / Year
hivewatch.com Logo
HiveWatch
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Miniclip in Lisbon as a Senior Site Reliability Engineer. You will design resilient systems on AWS using Terraform and containerization, while building observability tools to prevent outages. Automate processes and collaborate with teams to ensure high performance and reliability. Strong cod...
Location Icon
Location
Portugal , Lisbon
Salary Icon
Salary
Not provided
miniclip.com Logo
Miniclip
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Prolific as a Senior Site Reliability Engineer to ensure platform resilience and performance. You'll leverage your GCP, Kubernetes, and Terraform expertise to build scalable infrastructure in the UK. Champion SRE principles, enhance observability, and enjoy a remote role with competitive ben...
Location Icon
Location
United Kingdom
Salary Icon
Salary
Not provided
prolific.com Logo
Prolific
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our team as a Senior Site Reliability Engineer, focusing on our self-hosted product platform. You will architect and maintain containerized systems (Kubernetes, Docker) and ensure seamless customer deployments. This remote US role offers competitive salary, equity, and comprehensive benefits...
Location Icon
Location
United States
Salary Icon
Salary
200000.00 - 220000.00 USD / Year
tines.com Logo
Tines
Expiration Date
Until further notice
Senior Site Reliability Engineer Cloud Platform
Save Icon
Join Zilliz, a leader in vector database technology for AI. As a Senior SRE, you'll ensure the reliability and scalability of our cloud-native platform using Kubernetes, AWS/GCP/Azure, and Terraform. Automate operations and collaborate with engineers in a dynamic startup environment. Your experti...
Location Icon
Location
Salary Icon
Salary
175000.00 - 225000.00 USD / Year
zilliz.com Logo
Zilliz
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Affirm in Poland as a Senior Site Reliability Engineer. You will design and operate highly available distributed systems using AWS, Kubernetes, and Python/Kotlin. Drive reliability frameworks, lead incident management, and support a global engineering team. Enjoy premium benefits, including ...
Location Icon
Location
Poland
Salary Icon
Salary
301000.00 - 401000.00 PLN / Year
affirm.com Logo
Affirm
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Affirm in Spain as a Senior Site Reliability Engineer. Design and launch scalable backend systems using Python, Kotlin, AWS, and Kubernetes. Drive reliability, incident management, and tooling for honest financial products. Enjoy comprehensive benefits, including full health coverage and fle...
Location Icon
Location
Spain
Salary Icon
Salary
85000.00 - 115000.00 EUR / Year
affirm.com Logo
Affirm
Expiration Date
Until further notice
Senior Electrical Reliability Engineer
Save Icon
Lead electrical reliability efforts for a mill-wide distribution system in Ashdown, USA. Utilize your engineering degree and 3+ years' experience in root cause analysis, capital planning, and KPI tracking. Enjoy a competitive package in a supportive environment focused on safety and continuous im...
Location Icon
Location
United States , Ashdown
Salary Icon
Salary
Not provided
domtar.com Logo
Domtar
Expiration Date
Until further notice

About the Senior Reliability Engineer role

Senior Reliability Engineer jobs represent a critical intersection between software engineering and IT operations, focusing on building and maintaining highly scalable, fault-tolerant systems. Professionals in this role are responsible for ensuring that complex distributed systems remain available, performant, and resilient under demanding workloads. As organizations increasingly rely on cloud-native architectures, the demand for these specialists continues to grow across industries.

The core mission of a Senior Reliability Engineer is to bridge the gap between development and operations, applying a software engineering mindset to infrastructure challenges. These professionals design, implement, and manage the systems that keep digital services running smoothly. They work extensively with cloud platforms, containerization technologies, and orchestration tools to create automated, self-healing infrastructure. A significant portion of their work involves developing and maintaining CI/CD pipelines, implementing Infrastructure as Code practices, and building monitoring and observability solutions that provide real-time visibility into system health.

Typical responsibilities include automating operational tasks to eliminate manual processes, participating in incident response and post-mortem analysis, and driving continuous improvement through root cause analysis. Senior Reliability Engineers often serve as the technical authority during outages, coordinating response efforts and implementing preventive measures. They collaborate closely with software development teams to ensure new features are designed with operability and scalability in mind, often influencing architectural decisions early in the development lifecycle. Performance tuning, capacity planning, and cost optimization of cloud resources are also common duties.

To succeed in senior reliability engineering jobs, candidates typically need strong programming skills in languages like Python, Go, or Ruby, combined with deep expertise in Linux system administration. Experience with configuration management tools such as Puppet, Ansible, or Terraform is essential. A thorough understanding of networking, security best practices, and database management rounds out the technical requirements. Beyond technical skills, these roles demand excellent problem-solving abilities, strong communication skills, and the capacity to work effectively in on-call rotations. Many positions also value experience with incident response frameworks and formal post-incident review processes.

The profession attracts individuals who enjoy solving complex infrastructure puzzles, automating away repetitive work, and building systems that can gracefully handle failure. As digital transformation accelerates across all sectors, senior reliability engineer jobs will remain vital for organizations that cannot tolerate downtime or degraded performance. These professionals ultimately enable businesses to deliver reliable, fast, and secure digital experiences to their users at global scale.