CrawlJobs Logo
Briefcase Icon
Category Icon

Site Reliability Engineer Jobs (Hybrid work)

64 Job Offers

Filters
Site Reliability Engineer
Save Icon
Join our Manchester team as a Site Reliability Engineer. You'll architect resilient, self-healing systems on Kubernetes and major cloud platforms (AWS/Azure/GCP). Proactively integrate AIOps to automate operations and predict failures. Enjoy equity, 30+ days holiday, health insurance, and a forwa...
Location Icon
Location
United Kingdom , Manchester
Salary Icon
Salary
49600.00 - 74400.00 GBP / Year
matillion.com Logo
Matillion
Expiration Date
Until further notice
Site Reliability Engineer - FedRAMP
Save Icon
Join Confluent as a Site Reliability Engineer in Toronto, supporting our FedRAMP-compliant platform. You'll ensure high availability for federal clients using Kubernetes, Terraform, and cloud monitoring tools. This remote-first role offers equity, flexible time off, and a collaborative culture fo...
Location Icon
Location
Canada , Toronto
Salary Icon
Salary
113200.00 - 130200.00 CAD / Year
confluent.io Logo
Confluent
Expiration Date
Until further notice
Site Reliability Engineer - FedRAMP
Save Icon
Join Confluent as a Site Reliability Engineer in Austin, supporting our FedRAMP-compliant platform. You'll ensure high availability for federal agencies, deploying changes and managing incidents for cloud-native systems. This role requires 0-2 years of SRE experience, proficiency in Kubernetes, T...
Location Icon
Location
United States , Austin
Salary Icon
Salary
137400.00 - 158000.00 USD / Year
confluent.io Logo
Confluent
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our team in Guadalajara as a Site Reliability Engineer. You will ensure the efficiency of Azure cloud systems, manage incident responses, and perform deployments. The role requires Azure certification, experience with Kubernetes, Python, and a strong problem-solving ability. Work with cuttin...
Location Icon
Location
Mexico , Guadalajara
Salary Icon
Salary
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Explore the dynamic and critical field of Site Reliability Engineering (SRE) and discover a wealth of Site Reliability Engineer jobs designed for those who bridge the gap between development and operations. A Site Reliability Engineer is a specialized software engineer focused on creating scalable and highly reliable software systems. The core philosophy of the role is to treat operational challenges as software problems, applying engineering principles to automate solutions, improve system resilience, and streamline processes. Professionals in this field are responsible for the entire lifecycle of a service, from design and deployment to monitoring and incident response, ensuring that systems are not only functional but also efficient and robust under real-world conditions. Typical responsibilities for a Site Reliability Engineer are multifaceted. A primary duty is defining and upholding Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to quantitatively measure a system's reliability and performance. This involves extensive work in observability, implementing comprehensive logging, metrics, and tracing to gain deep insights into system behavior. SREs actively work to eliminate manual, repetitive work (often called "toil") through automation, using infrastructure as code (IaC) tools to manage provisioning and configuration. They architect and build internal platforms and tooling that enable consistent deployments and efficient operations. Crucially, SREs are on the front lines of incident management, owning production issues from detection through to resolution and conducting blameless post-mortems to prevent future occurrences. The skill set required for Site Reliability Engineer jobs is a powerful blend of software engineering and systems expertise. Proficiency in programming languages like Python, Go, or Java is essential for automation and tool development. A deep understanding of operating systems, networking, and cloud platforms (such as AWS, GCP, or Azure) forms the foundation. Experience with containerization and orchestration technologies like Docker and Kubernetes is now standard, as is familiarity with CI/CD pipelines and IaC tools like Terraform or Ansible. Beyond technical prowess, successful SREs possess strong problem-solving skills, a proactive mindset focused on prevention, and excellent collaboration abilities to work closely with development and product teams. They are driven by a passion for building systems that are scalable, secure, and resilient. If you are an engineer who thrives at the intersection of code, infrastructure, and operational excellence, pursuing Site Reliability Engineer jobs offers a challenging and rewarding career path at the heart of modern technology organizations.

Filters

×
Countries
Category
Location
Work Mode
Salary