CrawlJobs Logo
Briefcase Icon
Category Icon

Site Reliability Engineer Jobs

313 Job Offers

Filters
Site Reliability Engineer
Save Icon
Join iCapital's Site Reliability Engineering team in Lisbon. Apply your 7+ years of SRE expertise to design scalable systems, implement observability with SLOs/SLIs, and automate on AWS/Kubernetes. Enjoy a competitive package with bonus, equity, and full health coverage.
Location Icon
Location
Portugal , Lisbon
Salary Icon
Salary
Not provided
icapital.com Logo
iCapital Network
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Cinder as a Senior Site Reliability Engineer in New York. You will architect and evolve our robust, compliant cloud infrastructure using AWS, Terraform, and Kubernetes. Drive operational excellence, ensure high reliability, and make a substantial impact at a fast-growing AI startup. We offer...
Location Icon
Location
United States , New York
Salary Icon
Salary
180000.00 - 240000.00 USD / Year
cinder.co Logo
Cinder
Expiration Date
Until further notice
Staff Software Engineer - Site Reliability
Save Icon
Join Ironclad as a Staff Site Reliability Engineer in San Francisco or NYC. Build a scalable cloud platform using Kubernetes, AWS, and Terraform. Ensure enterprise-grade reliability while mentoring a team. Enjoy top-tier health coverage, generous leave, and flexible PTO.
Location Icon
Location
United States , San Francisco; New York City
Salary Icon
Salary
210000.00 - 235000.00 USD / Year
ironcladapp.com Logo
Ironclad
Expiration Date
Until further notice
Senior Site Reliability Engineer - Fleet Reliability
Save Icon
Join Lambda's mission to make superintelligence compute ubiquitous. As a Senior SRE for Fleet Reliability, you'll define health metrics, build automation, and ensure system availability in San Francisco. Leverage your 7+ years in SRE/DevOps, Python/Go skills, and AI infrastructure expertise. We o...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
230000.00 - 345000.00 USD / Year
lambda.ai Logo
Lambda
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Replit's Site Reliability Engineering team to ensure the reliability and scalability of our global platform. You will design observability solutions, automate infrastructure, and implement SLOs using Kubernetes and cloud-native tech. We offer competitive salary, equity, full benefits, and a ...
Location Icon
Location
United States
Salary Icon
Salary
160000.00 - 250000.00 USD / Year
replit.com Logo
Replit
Expiration Date
Until further notice
Staff Site Reliability Engineer
Save Icon
Join Replit as a Staff Site Reliability Engineer to ensure the reliability and scalability of our global platform. You will architect observability, lead incident response, and drive automation using Kubernetes and cloud-native tech. This US-based role offers competitive salary, equity, health be...
Location Icon
Location
United States
Salary Icon
Salary
220000.00 - 325000.00 USD / Year
replit.com Logo
Replit
Expiration Date
Until further notice
Site Reliability Engineer - Linux & KDB
Save Icon
Join Barclays Technology in Pune as a Site Reliability Engineer specializing in Linux & KDB. You will build and maintain critical data pipelines, warehouses, and lakes, ensuring security and accuracy. Collaborate with data scientists on ML models and lead complex tasks in a cutting-edge FinTech e...
Location Icon
Location
India , Pune
Salary Icon
Salary
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Forward Deployed Engineer - Site Reliability / Infrastructure
Save Icon
Join Lambda as a Forward Deployed Engineer, embedding directly with a strategic customer in Bellevue or San Francisco. You will architect and ship full-stack infrastructure solutions for critical AI/ML workloads using Kubernetes, Go, and Python. This role requires deep SRE expertise to navigate a...
Location Icon
Location
United States , Bellevue, WA, San Francisco Office
Salary Icon
Salary
240000.00 - 425000.00 USD / Year
lambda.ai Logo
Lambda
Expiration Date
Until further notice
Senior Site Reliability Engineer - Networking
Save Icon
Join Lambda to build the world's premier AI cloud infrastructure. As a Senior SRE - Networking, you'll scale our high-performance, multi-tenant cloud using SDN, Spine/Leaf architecture, and automation tools like Python and Ansible. This role requires 5+ years of network reliability experience, Ku...
Location Icon
Location
United States , San Francisco; San Jose; Bellevue
Salary Icon
Salary
227000.00 - 401000.00 USD / Year
lambda.ai Logo
Lambda
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join MaintainX as a Site Reliability Engineer in Montreal or Toronto. You'll enhance platform reliability and observability, partnering with engineering teams. Apply your 3-5+ years in SRE/DevOps and cloud-native expertise to build resilient systems. Enjoy competitive pay, equity, full benefits, ...
Location Icon
Location
Canada , Montreal & Toronto
Salary Icon
Salary
Not provided
getmaintainx.com Logo
MaintainX
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join our team as a Senior Site Reliability Engineer in Austin. You will autonomously manage and optimize large-scale infrastructure (5,000+ hosts) with Kafka, Redis, and Kubernetes. Drive system stability, lead incident response, and enhance observability in a cloud environment. We offer comprehe...
Location Icon
Location
United States , Austin
Salary Icon
Salary
185000.00 - 225000.00 USD / Year
bumble.com Logo
Bumble Inc.
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Together AI in San Francisco as a Site Reliability Engineer. You'll ensure smooth operations for user-facing services using Ansible, Terraform, and Kubernetes. Apply your expertise in monitoring, cloud services, and automation to build scalable, reliable infrastructure. Enjoy competitive com...
Location Icon
Location
United States of America , San Francisco
Salary Icon
Salary
150000.00 - 200000.00 USD / Year
together.ai Logo
Together AI
Expiration Date
Until further notice
Lead Site Reliability Engineer
Save Icon
Location Icon
Location
Salary Icon
Salary
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Lead Site Reliability Engineer
Save Icon
Location Icon
Location
Salary Icon
Salary
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Intermediate Site Reliability Engineer SRE – AI Reliability & Automation
Save Icon
Join PointClickCare as an Intermediate SRE focused on AI Reliability & Automation in Mississauga. You will build ML-based anomaly detection, self-healing systems, and AI-driven automation using Python, Azure, and Kubernetes. We seek 5+ years in software engineering with AI/ML production experienc...
Location Icon
Location
Canada , Mississauga
Salary Icon
Salary
115000.00 - 128000.00 CAD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Location Icon
Location
Australia , Sydney, Melbourne
Salary Icon
Salary
Not provided
heidihealth.com Logo
Heidi
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
140000.00 - 185000.00 USD / Year
heidihealth.com Logo
Heidi
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Location Icon
Location
United Kingdom , London
Salary Icon
Salary
Not provided
heidihealth.com Logo
Heidi
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Zuora's Operations team in Costa Rica as a Site Reliability Engineer. You'll ensure the reliability and scalability of our SaaS platform using AI, automation, and modern tools like Kubernetes and Python. We seek an expert in incident management, predictive monitoring, and building self-heali...
Location Icon
Location
Costa Rica
Salary Icon
Salary
Not provided
zuora.com Logo
Zuora
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our team in Medellin as a Site Reliability Engineer. You will partner with development teams to ensure high availability and optimize AWS cloud infrastructure. Your expertise in Kubernetes, Terraform, and CI/CD pipelines will be key to automating workflows and improving system reliability. T...
Location Icon
Location
Colombia , Medellin
Salary Icon
Salary
20.00 USD / Hour
signifytechnology.com Logo
Signify Technology
Expiration Date
Until further notice
Explore the dynamic and critical field of Site Reliability Engineering (SRE) and discover a wealth of Site Reliability Engineer jobs designed for those who bridge the gap between development and operations. A Site Reliability Engineer is a specialized software engineer focused on creating scalable and highly reliable software systems. The core philosophy of the role is to treat operational challenges as software problems, applying engineering principles to automate solutions, improve system resilience, and streamline processes. Professionals in this field are responsible for the entire lifecycle of a service, from design and deployment to monitoring and incident response, ensuring that systems are not only functional but also efficient and robust under real-world conditions. Typical responsibilities for a Site Reliability Engineer are multifaceted. A primary duty is defining and upholding Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to quantitatively measure a system's reliability and performance. This involves extensive work in observability, implementing comprehensive logging, metrics, and tracing to gain deep insights into system behavior. SREs actively work to eliminate manual, repetitive work (often called "toil") through automation, using infrastructure as code (IaC) tools to manage provisioning and configuration. They architect and build internal platforms and tooling that enable consistent deployments and efficient operations. Crucially, SREs are on the front lines of incident management, owning production issues from detection through to resolution and conducting blameless post-mortems to prevent future occurrences. The skill set required for Site Reliability Engineer jobs is a powerful blend of software engineering and systems expertise. Proficiency in programming languages like Python, Go, or Java is essential for automation and tool development. A deep understanding of operating systems, networking, and cloud platforms (such as AWS, GCP, or Azure) forms the foundation. Experience with containerization and orchestration technologies like Docker and Kubernetes is now standard, as is familiarity with CI/CD pipelines and IaC tools like Terraform or Ansible. Beyond technical prowess, successful SREs possess strong problem-solving skills, a proactive mindset focused on prevention, and excellent collaboration abilities to work closely with development and product teams. They are driven by a passion for building systems that are scalable, secure, and resilient. If you are an engineer who thrives at the intersection of code, infrastructure, and operational excellence, pursuing Site Reliability Engineer jobs offers a challenging and rewarding career path at the heart of modern technology organizations.

Filters

×
Countries
Category
Location
Work Mode
Salary