CrawlJobs Logo

Filters

Location
Salary
Clear all filters

Site Reliability Engineer United States Jobs

172 Job Offers

Senior Site Reliability Engineer
Save Icon
Join Cinder as a Senior Site Reliability Engineer in New York. You will architect and evolve our robust, compliant cloud infrastructure using AWS, Terraform, and Kubernetes. Drive operational excellence, ensure high reliability, and make a substantial impact at a fast-growing AI startup. We offer...
Location Icon
Location
United States , New York
Salary Icon
Salary
180000.00 - 240000.00 USD / Year
cinder.co Logo
Cinder
Expiration Date
Until further notice
Staff Software Engineer - Site Reliability
Save Icon
Join Ironclad as a Staff Site Reliability Engineer in San Francisco or NYC. Build a scalable cloud platform using Kubernetes, AWS, and Terraform. Ensure enterprise-grade reliability while mentoring a team. Enjoy top-tier health coverage, generous leave, and flexible PTO.
Location Icon
Location
United States , San Francisco; New York City
Salary Icon
Salary
210000.00 - 235000.00 USD / Year
ironcladapp.com Logo
Ironclad
Expiration Date
Until further notice
Senior Site Reliability Engineer - Fleet Reliability
Save Icon
Join Lambda's mission to make superintelligence compute ubiquitous. As a Senior SRE for Fleet Reliability, you'll define health metrics, build automation, and ensure system availability in San Francisco. Leverage your 7+ years in SRE/DevOps, Python/Go skills, and AI infrastructure expertise. We o...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
230000.00 - 345000.00 USD / Year
lambda.ai Logo
Lambda
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Replit's Site Reliability Engineering team to ensure the reliability and scalability of our global platform. You will design observability solutions, automate infrastructure, and implement SLOs using Kubernetes and cloud-native tech. We offer competitive salary, equity, full benefits, and a ...
Location Icon
Location
United States
Salary Icon
Salary
160000.00 - 250000.00 USD / Year
replit.com Logo
Replit
Expiration Date
Until further notice
Staff Site Reliability Engineer
Save Icon
Join Replit as a Staff Site Reliability Engineer to ensure the reliability and scalability of our global platform. You will architect observability, lead incident response, and drive automation using Kubernetes and cloud-native tech. This US-based role offers competitive salary, equity, health be...
Location Icon
Location
United States
Salary Icon
Salary
220000.00 - 325000.00 USD / Year
replit.com Logo
Replit
Expiration Date
Until further notice
Forward Deployed Engineer - Site Reliability / Infrastructure
Save Icon
Join Lambda as a Forward Deployed Engineer, embedding directly with a strategic customer in Bellevue or San Francisco. You will architect and ship full-stack infrastructure solutions for critical AI/ML workloads using Kubernetes, Go, and Python. This role requires deep SRE expertise to navigate a...
Location Icon
Location
United States , Bellevue, WA, San Francisco Office
Salary Icon
Salary
240000.00 - 425000.00 USD / Year
lambda.ai Logo
Lambda
Expiration Date
Until further notice
Senior Site Reliability Engineer - Networking
Save Icon
Join Lambda to build the world's premier AI cloud infrastructure. As a Senior SRE - Networking, you'll scale our high-performance, multi-tenant cloud using SDN, Spine/Leaf architecture, and automation tools like Python and Ansible. This role requires 5+ years of network reliability experience, Ku...
Location Icon
Location
United States , San Francisco; San Jose; Bellevue
Salary Icon
Salary
227000.00 - 401000.00 USD / Year
lambda.ai Logo
Lambda
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our team as a Senior Site Reliability Engineer in Austin. You will autonomously manage and optimize large-scale infrastructure (5,000+ hosts) with Kafka, Redis, and Kubernetes. Drive system stability, lead incident response, and enhance observability in a cloud environment. We offer comprehe...
Location Icon
Location
United States , Austin
Salary Icon
Salary
185000.00 - 225000.00 USD / Year
bumble.com Logo
Bumble Inc.
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
140000.00 - 185000.00 USD / Year
heidihealth.com Logo
Heidi
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our tech modernization as a Site Reliability Engineer in Irving, USA. You will design performance and resiliency tests for Azure cloud migration, using Docker, Python, and observability tools. We seek strong microservices support experience with Java/Spring Boot. Enjoy comprehensive benefits...
Location Icon
Location
United States , Irving
Salary Icon
Salary
61.00 - 70.00 USD / Hour
apexsystems.com Logo
Apex Systems
Expiration Date
Until further notice
Site Reliability Engineer Staff
Save Icon
Join our team as a Site Reliability Engineer Staff in San Juan. You will design and enhance cloud infrastructure using AWS/GCP, Terraform, and Kubernetes. Develop robust CI/CD pipelines, strengthen security, and ensure system reliability with tools like Prometheus and Grafana. This role requires ...
Location Icon
Location
United States , San Juan
Salary Icon
Salary
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Site Reliability Engineer II
Save Icon
Location Icon
Location
United States , North Bethesda, MD or Boston, Massachusetts
Salary Icon
Salary
135000.00 - 165000.00 USD / Year
cherry.vc Logo
Cherry Ventures
Expiration Date
Until further notice
Site Reliability Engineer II
Save Icon
Join our team as a Site Reliability Engineer II in Peachtree Corners. You will enhance the reliability of our global Kubernetes platform on AWS/Azure, using Python or Go. We seek a US citizen with 3+ years in cloud-native platform engineering and CI/CD. Enjoy competitive pay, 401k match, comprehe...
Location Icon
Location
United States , Peachtree Corners
Salary Icon
Salary
115500.00 - 184800.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Location Icon
Location
United States , Austin
Salary Icon
Salary
129600.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
129600.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Braze as a Senior Site Reliability Engineer in Chicago. You will ensure massive-scale platform reliability for over 3.3 billion monthly users. Apply your 5+ years of SRE experience with Go/Ruby, Kubernetes, and Terraform to automate and solve complex challenges. Enjoy competitive equity, fle...
Location Icon
Location
United States , Chicago
Salary Icon
Salary
128842.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Location Icon
Location
United States , Austin
Salary Icon
Salary
128842.00 - 232200.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
144000.00 - 258000.00 USD / Year
braze.com Logo
Braze
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Location Icon
Location
United States , Redmond
Salary Icon
Salary
84200.00 - 165200.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Site Reliability Engineer 2
Save Icon
Join the M365 Copilot App Platform team as a Site Reliability Engineer in Redmond. You will ensure the reliability and performance of critical platform APIs and infrastructure for a key Microsoft AI product. This role requires expertise in cloud services, distributed systems, and software enginee...
Location Icon
Location
United States , Redmond
Salary Icon
Salary
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice

About the Site Reliability Engineer role

Explore the dynamic and critical field of Site Reliability Engineering (SRE) and discover a wealth of Site Reliability Engineer jobs designed for those who bridge the gap between development and operations. A Site Reliability Engineer is a specialized software engineer focused on creating scalable and highly reliable software systems. The core philosophy of the role is to treat operational challenges as software problems, applying engineering principles to automate solutions, improve system resilience, and streamline processes. Professionals in this field are responsible for the entire lifecycle of a service, from design and deployment to monitoring and incident response, ensuring that systems are not only functional but also efficient and robust under real-world conditions.

Typical responsibilities for a Site Reliability Engineer are multifaceted. A primary duty is defining and upholding Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to quantitatively measure a system's reliability and performance. This involves extensive work in observability, implementing comprehensive logging, metrics, and tracing to gain deep insights into system behavior. SREs actively work to eliminate manual, repetitive work (often called "toil") through automation, using infrastructure as code (IaC) tools to manage provisioning and configuration. They architect and build internal platforms and tooling that enable consistent deployments and efficient operations. Crucially, SREs are on the front lines of incident management, owning production issues from detection through to resolution and conducting blameless post-mortems to prevent future occurrences.

The skill set required for Site Reliability Engineer jobs is a powerful blend of software engineering and systems expertise. Proficiency in programming languages like Python, Go, or Java is essential for automation and tool development. A deep understanding of operating systems, networking, and cloud platforms (such as AWS, GCP, or Azure) forms the foundation. Experience with containerization and orchestration technologies like Docker and Kubernetes is now standard, as is familiarity with CI/CD pipelines and IaC tools like Terraform or Ansible. Beyond technical prowess, successful SREs possess strong problem-solving skills, a proactive mindset focused on prevention, and excellent collaboration abilities to work closely with development and product teams. They are driven by a passion for building systems that are scalable, secure, and resilient. If you are an engineer who thrives at the intersection of code, infrastructure, and operational excellence, pursuing Site Reliability Engineer jobs offers a challenging and rewarding career path at the heart of modern technology organizations.