CrawlJobs Logo

Filters

Location
Salary
Clear all filters

Site Reliability Engineer Jobs (On-site work)

90 Job Offers

Software Engineer, Site Reliability
Save Icon
Join Fireworks AI as a Site Reliability Engineer in San Mateo. You will ensure the reliability and performance of our world-scale AI cloud using your expertise in SRE, Kubernetes, and public cloud platforms. Partner with top engineers to automate and scale cutting-edge, distributed systems. This ...
Location Icon
Location
United States , San Mateo
Salary Icon
Salary
Not provided
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our team as a Site Reliability Engineer in New York City or San Francisco. You will optimize platform uptime, manage CI/CD pipelines, and enhance observability using AWS, Kubernetes, and Terraform. We offer unlimited PTO, comprehensive insurance, catered meals, and a competitive equity packa...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join HiveWatch as a Staff Site Reliability Engineer in El Segundo, CA. Architect and maintain mission-critical edge infrastructure for a SaaS platform, ensuring exceptional performance and reliability. Leverage 7+ years of software engineering and 5+ years of SRE expertise with AWS, Kubernetes, a...
Location Icon
Location
United States , El Segundo
Salary Icon
Salary
183000.00 - 235000.00 USD / Year
hivewatch.com Logo
HiveWatch
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our team in Barcelona as a Site Reliability Engineer. Design and maintain scalable AWS infrastructure using Terraform, Docker, and Kubernetes. Implement robust monitoring with DataDog and CloudWatch. Enjoy competitive pay, equity, private healthcare, and a flexible work model.
Location Icon
Location
Spain , Barcelona
Salary Icon
Salary
Not provided
yokoy.io Logo
Yokoy
Expiration Date
Until further notice
Site Reliability Engineer 2
Save Icon
Join FreeWheel's Global Operations team as a Site Reliability Engineer 2 in Denver or Chicago. You'll ensure system reliability and performance by managing infrastructure, automating operations, and resolving technical issues. The role requires 1-3 years of SRE/DevOps experience, proficiency in P...
Location Icon
Location
United States , Chicago; Englewood; Philadelphia
Salary Icon
Salary
84478.50 - 126717.75 USD / Year
comcastadvertising.com Logo
Comcast Advertising
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Optiver as a Site Reliability Engineer in Amsterdam. Maintain and enhance the reliability of our proprietary, low-latency trading infrastructure. Utilize your Python automation and Unix/Linux skills in a world-class, on-premise data center environment. Enjoy top-tier benefits and a collabora...
Location Icon
Location
Netherlands , Amsterdam
Salary Icon
Salary
Not provided
optiver.com Logo
Optiver
Expiration Date
Until further notice
Site Reliability Engineer III
Save Icon
Join Zuora's Cloud Engineering team as a Site Reliability Engineer III in Chennai. You will ensure high-availability for SaaS platforms using AWS, Terraform, and Jenkins. This role requires 6-8 years of SRE experience with strong Linux and Python skills. We offer competitive compensation, flexibl...
Location Icon
Location
India , Chennai
Salary Icon
Salary
Not provided
zuora.com Logo
Zuora
Expiration Date
Until further notice
Product Infrastructure Engineer - Site Reliability
Save Icon
Join Zyphra as a Product Infrastructure Engineer - Site Reliability in Palo Alto. Design and maintain robust, scalable systems for ML workloads using IaC tools like Terraform. Ensure reliability, security, and observability while collaborating with ML and DevOps teams. Enjoy comprehensive benefit...
Location Icon
Location
United States , Palo Alto
Salary Icon
Salary
Not provided
zyphra.com Logo
Zyphra
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our team as a Site Reliability Engineer in Kochi or Trivandrum. You will ensure high availability of our 24x7 Azure cloud environment, performing L1/L2 support and proactive monitoring. The role requires strong Azure, Windows Server, and networking expertise. Bring your troubleshooting skill...
Location Icon
Location
India , Kochi; Trivandrum
Salary Icon
Salary
Not provided
experionglobal.com Logo
Experion Technologies
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join Crusoe as a Site Reliability Engineer in Dublin. You'll ensure infrastructure reliability, automate processes, and maintain high SLAs/SLOs. Key requirements include 1-3 years of SRE experience, proficiency in Python/Go, and expertise with Kubernetes and Terraform. Enjoy benefits like private...
Location Icon
Location
Ireland , Dublin
Salary Icon
Salary
Not provided
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice

About the Site Reliability Engineer role

Explore the dynamic and critical field of Site Reliability Engineering (SRE) and discover a wealth of Site Reliability Engineer jobs designed for those who bridge the gap between development and operations. A Site Reliability Engineer is a specialized software engineer focused on creating scalable and highly reliable software systems. The core philosophy of the role is to treat operational challenges as software problems, applying engineering principles to automate solutions, improve system resilience, and streamline processes. Professionals in this field are responsible for the entire lifecycle of a service, from design and deployment to monitoring and incident response, ensuring that systems are not only functional but also efficient and robust under real-world conditions.

Typical responsibilities for a Site Reliability Engineer are multifaceted. A primary duty is defining and upholding Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to quantitatively measure a system's reliability and performance. This involves extensive work in observability, implementing comprehensive logging, metrics, and tracing to gain deep insights into system behavior. SREs actively work to eliminate manual, repetitive work (often called "toil") through automation, using infrastructure as code (IaC) tools to manage provisioning and configuration. They architect and build internal platforms and tooling that enable consistent deployments and efficient operations. Crucially, SREs are on the front lines of incident management, owning production issues from detection through to resolution and conducting blameless post-mortems to prevent future occurrences.

The skill set required for Site Reliability Engineer jobs is a powerful blend of software engineering and systems expertise. Proficiency in programming languages like Python, Go, or Java is essential for automation and tool development. A deep understanding of operating systems, networking, and cloud platforms (such as AWS, GCP, or Azure) forms the foundation. Experience with containerization and orchestration technologies like Docker and Kubernetes is now standard, as is familiarity with CI/CD pipelines and IaC tools like Terraform or Ansible. Beyond technical prowess, successful SREs possess strong problem-solving skills, a proactive mindset focused on prevention, and excellent collaboration abilities to work closely with development and product teams. They are driven by a passion for building systems that are scalable, secure, and resilient. If you are an engineer who thrives at the intersection of code, infrastructure, and operational excellence, pursuing Site Reliability Engineer jobs offers a challenging and rewarding career path at the heart of modern technology organizations.