CrawlJobs Logo
Briefcase Icon
Category Icon

Reliability Engineer United States, San Francisco Jobs

8 Job Offers

Filters
Site Reliability Engineer
Save Icon
Join our SRE team to scale Atlassian's Cloud products. You'll own caching infrastructure, tooling, and automation across key US tech hubs. We seek engineers with 1+ years in cloud (AWS/GCP/Azure), Linux, and backend coding (Java/Go/Python). Enjoy health coverage, paid volunteer days, and wellness...
Location Icon
Location
United States , San Francisco; Austin; Mountain View; Washington DC; Seattle; New York
Salary Icon
Salary
116700.00 - 187400.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Principal Site Reliability Engineer
Save Icon
Join our SRE team as a Principal Engineer in San Francisco or Mountain View. You will advocate for reliability, scaling cloud services on AWS/GCP/Azure, and driving complex initiatives. We seek an expert with 8+ years in Java and 5+ in cloud & high-availability systems. Enjoy equity, bonuses, and...
Location Icon
Location
United States , San Francisco; Mountain View
Salary Icon
Salary
170800.00 - 274300.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Atlassian as a Senior Site Reliability Engineer in San Francisco. Architect and automate large-scale cloud infrastructure using Python, Terraform, and AWS to enhance performance for enterprise customers. This role requires expertise in CI/CD, monitoring, and Linux/Windows systems. We offer h...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
180960.00 - 230900.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Senior Site Reliability Engineer
Save Icon
Join Checkr as a Senior Site Reliability Engineer in Denver or San Francisco. You will design core observability tools, drive platform adoption, and ensure reliability across AWS/Azure environments. This role requires 6+ years of Python/GoLang experience, Kubernetes, and incident response experti...
Location Icon
Location
United States , Denver; San Francisco
Salary Icon
Salary
138000.00 - 191000.00 USD / Year
https://checkr.com Logo
Checkr
Expiration Date
Until further notice
Site Reliability Engineer
Save Icon
Join our team as a Site Reliability Engineer in New York City or San Francisco. You will optimize platform uptime, manage CI/CD pipelines, and enhance observability using AWS, Kubernetes, and Terraform. We offer unlimited PTO, comprehensive insurance, catered meals, and a competitive equity packa...
Location Icon
Location
United States , New York City; San Francisco
Salary Icon
Salary
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Senior Software Engineer - Observability and Reliability
Save Icon
Join our San Francisco team as a Senior Software Engineer focused on Observability and Reliability. You will build critical platforms for metrics, logging, tracing, and alerting using Go and Kubernetes. We seek a collaborative engineer with 5+ years of experience, a product mindset for infrastruc...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
170000.00 - 215000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Director of Engineering & Reliability
Save Icon
Lead engineering excellence and reliability for hyperscale AI data centers at Crusoe. This Director role in San Francisco sets technical standards and governs MEP systems across a growing 50-400 MW portfolio. You will implement reliability programs, optimize performance, and lead a team to ensure...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
216000.00 - 260000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Site Reliability Engineer, Cloud Infrastructure
Save Icon
Join Quizlet as a Site Reliability Engineer in San Francisco. Enhance system reliability and scalability using your software engineering skills in Python, Go, or PHP. You'll automate infrastructure, work with CI/CD and Terraform, and ensure resilient, cloud-native systems. Enjoy competitive benef...
Location Icon
Location
United States , San Francisco
Salary Icon
Salary
120000.00 - 168488.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Explore a dynamic and critical career path with Reliability Engineer jobs, a profession dedicated to ensuring systems and assets operate with maximum uptime, performance, and efficiency. Reliability Engineers are the guardians of operational integrity, applying engineering principles and data-driven analysis to prevent failures, optimize performance, and implement sustainable processes. This field broadly splits into two key domains: IT/Software Reliability and Industrial/Physical Asset Reliability, both united by the core mission of building and maintaining resilient systems. In the technology sector, often titled Site Reliability Engineer (SRE), professionals blend software engineering and systems administration to create scalable and highly reliable software platforms. Their general responsibilities include designing and automating infrastructure deployment, building robust monitoring and alerting systems, and managing incident response through on-call rotations. They focus on key service level indicators (SLIs) and objectives (SLOs) to measure and improve user experience. Typical tasks involve writing code for automation, conducting post-incident reviews (blameless postmortems), and collaborating with development teams to embed reliability into the software lifecycle from the start. Common requirements for these roles include proficiency in programming/scripting (e.g., Python, Go), expertise in cloud platforms (AWS, Azure, GCP), container orchestration (Kubernetes), and infrastructure-as-code tools, alongside a strong grasp of CI/CD pipelines and observability stacks. Conversely, in industrial settings like manufacturing, energy, or oil and gas, Reliability Engineers focus on physical assets such as rotating equipment, electrical systems, and production machinery. Their work is centered on predictive and preventive maintenance strategies. General duties involve analyzing equipment performance data, conducting Root Cause Failure Analysis (RCFA), developing maintenance procedures, and managing reliability-centered maintenance (RCM) programs. They use statistical analysis and reliability modeling to predict asset lifecycles, recommend improvements, and manage risk. Typical skills include a strong mechanical or electrical engineering foundation, knowledge of condition monitoring technologies (vibration analysis, thermography), familiarity with Computerized Maintenance Management Systems (CMMS), and expertise in process safety and lifecycle cost analysis. Across both domains, successful Reliability Engineers are systematic problem-solvers with a proactive mindset. They possess strong analytical skills to interpret complex data, excellent communication skills to collaborate across teams and justify investments, and a relentless focus on continuous improvement. Whether ensuring a global web service remains online or a refinery operates safely and efficiently, Reliability Engineer jobs are foundational to modern operational excellence. For those passionate about building systems that don't fail and optimizing performance through engineering, this profession offers a challenging and impactful career with opportunities spanning virtually every industry.

Filters

×
Category
Location
Work Mode
Salary