CrawlJobs Logo
Briefcase Icon
Category Icon

Reliability Engineer I United States, San Francisco Jobs

10 Job Offers

Filters
New
Senior Software Engineer - Observability and Reliability
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
150000.00 - 220000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
New
Senior Software Engineer - Observability and Reliability
Save Icon
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
170000.00 - 215000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our SRE team to scale Atlassian's Cloud products. You'll own caching infrastructure, tooling, and automation across key US tech hubs. We seek engineers with 1+ years in cloud (AWS/GCP/Azure), Linux, and backend coding (Java/Go/Python). Enjoy health coverage, paid volunteer days, and wellness...
Location Icon
Location
United States , San Francisco; Austin; Mountain View; Washington DC; Seattle; New York
Salary Icon
Salary
116700.00 - 187400.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Principal Site Reliability Engineer
Save Icon
Join our SRE team as a Principal Engineer in San Francisco or Mountain View. You will advocate for reliability, scaling cloud services on AWS/GCP/Azure, and driving complex initiatives. We seek an expert with 8+ years in Java and 5+ in cloud & high-availability systems. Enjoy equity, bonuses, and...
Location Icon
Location
United States , San Francisco; Mountain View
Salary Icon
Salary
170800.00 - 274300.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Atlassian as a Senior Site Reliability Engineer in San Francisco. Architect and automate large-scale cloud infrastructure using Python, Terraform, and AWS to enhance performance for enterprise customers. This role requires expertise in CI/CD, monitoring, and Linux/Windows systems. We offer h...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180960.00 - 230900.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Checkr as a Senior Site Reliability Engineer in Denver or San Francisco. You will design core observability tools, drive platform adoption, and ensure reliability across AWS/Azure environments. This role requires 6+ years of Python/GoLang experience, Kubernetes, and incident response experti...
Location Icon
Location
United States , Denver; San Francisco
Salary Icon
Salary
138000.00 - 191000.00 USD / Year
https://checkr.com Logo
Checkr
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our team as a Site Reliability Engineer in New York City or San Francisco. You will optimize platform uptime, manage CI/CD pipelines, and enhance observability using AWS, Kubernetes, and Terraform. We offer unlimited PTO, comprehensive insurance, catered meals, and a competitive equity packa...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Senior Software Engineer - Observability and Reliability
Save Icon
Join our San Francisco team as a Senior Software Engineer focused on Observability and Reliability. You will build critical platforms for metrics, logging, tracing, and alerting using Go and Kubernetes. We seek a collaborative engineer with 5+ years of experience, a product mindset for infrastruc...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
170000.00 - 215000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Director of Engineering & Reliability
Save Icon
Lead engineering excellence and reliability for hyperscale AI data centers at Crusoe. This Director role in San Francisco sets technical standards and governs MEP systems across a growing 50-400 MW portfolio. You will implement reliability programs, optimize performance, and lead a team to ensure...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
216000.00 - 260000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Site Reliability Engineer, Cloud Infrastructure
Save Icon
Join Quizlet as a Site Reliability Engineer in San Francisco. Enhance system reliability and scalability using your software engineering skills in Python, Go, or PHP. You'll automate infrastructure, work with CI/CD and Terraform, and ensure resilient, cloud-native systems. Enjoy competitive benef...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
120000.00 - 168488.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Explore Reliability Engineer I jobs and launch a career dedicated to ensuring system integrity and operational excellence. A Reliability Engineer I is an entry-level professional focused on proactively preventing failures, optimizing performance, and maximizing the uptime of critical systems. This foundational role exists across diverse industries, from technology and software to manufacturing, energy, and industrial operations. While the specific systems vary—encompassing software applications, cloud infrastructure, or physical machinery like rotating equipment—the core mission is universal: to build and maintain resilient, efficient, and dependable operations through engineering principles. Individuals in these roles typically engage in a blend of monitoring, analysis, automation, and collaboration. Common responsibilities include assisting in the design and implementation of monitoring and alerting systems to gain visibility into system health. They analyze performance data, incident reports, and maintenance records to identify patterns and potential points of failure. A significant part of the role involves contributing to automation efforts, writing scripts to manage infrastructure, deploy applications, or streamline repetitive operational tasks, thereby reducing manual toil and human error. Reliability Engineers also participate in incident response, helping to diagnose and resolve issues, and contribute to post-incident reviews to document root causes and implement preventive measures. They often work closely with development and operations teams to advocate for reliability standards, such as scalable architecture and robust fault tolerance, throughout a system's lifecycle. Typical skills and requirements for Reliability Engineer I positions include a strong foundational understanding of engineering concepts, often supported by a bachelor's degree in computer science, engineering, or a related technical field. Key technical proficiencies often include scripting or programming languages (like Python, Bash, or Shell), familiarity with operating systems, and an introductory knowledge of relevant domain tools. For software-centric roles, this might mean basic knowledge of cloud platforms (AWS, Azure, GCP), containerization (Docker, Kubernetes), and CI/CD pipelines. For industrial roles, understanding mechanical systems, statistical analysis, and reliability methodologies (like Root Cause Analysis or Failure Mode and Effects Analysis) is crucial. Regardless of the domain, successful candidates demonstrate a problem-solving mindset, a passion for automation, keen analytical abilities, and effective communication skills to collaborate across teams. Pursuing Reliability Engineer I jobs is ideal for those who enjoy the intersection of development and operations, possess a meticulous attention to detail, and derive satisfaction from building systems that users and businesses can depend on every day. It is a career path built on continuous learning and offers a clear trajectory for growth into more senior engineering and specialist positions.

Filters

×
Category
Location
Work Mode
Salary