CrawlJobs Logo

Filters

Location
Salary
Clear all filters

Staff Site Reliability Engineer United States Jobs

11 Job Offers

Sr Staff / Principal Site Reliability Engineer- Network & Security Operations
Save Icon
Seeking a Sr Staff / Principal Site Reliability Engineer for Network & Security Operations in Santa Clara, CA. You will lead IaC automation using Terraform & Ansible, manage cloud orchestration on GCP/GKE, and design global compute infrastructure. Requires 8+ years of SRE experience, expert firew...
Location Icon
Location
United States , Santa Clara
Salary Icon
Salary
154000.00 - 249500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Staff Site Reliability Engineer
Save Icon
Location Icon
Location
United States , Oakland
Salary Icon
Salary
196033.00 - 245041.50 USD / Year
fivetran.com Logo
Fivetran
Expiration Date
Until further notice
Member of Technical Staff, Site Reliability Engineer (HPC)
Save Icon
Join Microsoft's AI mission as an HPC Site Reliability Engineer in Mountain View. Ensure the reliability and efficiency of large-scale distributed AI infrastructure powering cutting-edge model training. Leverage your expertise in Kubernetes, cloud platforms, and automation within a high-performan...
Location Icon
Location
United States , Mountain View
Salary Icon
Salary
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Staff Site Reliability Engineer, Storage
Save Icon
Location Icon
Location
United States , San Francisco, Sunnyvale
Salary Icon
Salary
204000.00 - 247000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Staff Site Reliability Engineer, Managed AI
Save Icon
Location Icon
Location
United States , San Francisco; Sunnyvale
Salary Icon
Salary
204000.00 - 247000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Staff Site Reliability Engineer
Save Icon
Location Icon
Location
United States , Lehi
Salary Icon
Salary
242050.00 USD / Year
sunrun.com Logo
Sunrun
Expiration Date
Until further notice
Staff Software Engineer - Site Reliability
Save Icon
Join Ironclad as a Staff Site Reliability Engineer in San Francisco or NYC. Build a scalable cloud platform using Kubernetes, AWS, and Terraform. Ensure enterprise-grade reliability while mentoring a team. Enjoy top-tier health coverage, generous leave, and flexible PTO.
Location Icon
Location
United States , San Francisco; New York City
Salary Icon
Salary
210000.00 - 235000.00 USD / Year
ironcladapp.com Logo
Ironclad
Expiration Date
Until further notice
Staff Site Reliability Engineer
Save Icon
Join Replit as a Staff Site Reliability Engineer to ensure the reliability and scalability of our global platform. You will architect observability, lead incident response, and drive automation using Kubernetes and cloud-native tech. This US-based role offers competitive salary, equity, health be...
Location Icon
Location
United States
Salary Icon
Salary
220000.00 - 325000.00 USD / Year
replit.com Logo
Replit
Expiration Date
Until further notice
Site Reliability Engineer Staff
Save Icon
Join our team as a Site Reliability Engineer Staff in San Juan. You will design and enhance cloud infrastructure using AWS/GCP, Terraform, and Kubernetes. Develop robust CI/CD pipelines, strengthen security, and ensure system reliability with tools like Prometheus and Grafana. This role requires ...
Location Icon
Location
United States , San Juan
Salary Icon
Salary
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Staff Site Reliability Engineer
Save Icon
Join our team as a Staff Site Reliability Engineer in Sunnyvale. You will own critical internal infrastructure, migrating systems and ensuring high availability. We seek expertise in cloud platforms (AWS/Azure/GCP), IaC (Terraform/Ansible), and building automated, reliable systems. Drive automati...
Location Icon
Location
United States , Sunnyvale
Salary Icon
Salary
175000.00 - 250000.00 USD / Year
figure.ai Logo
Figure
Expiration Date
Until further notice
Staff Site Reliability Engineer
Save Icon
Lead our infrastructure reliability strategy as a Staff Site Reliability Engineer. Architect large-scale, fault-tolerant AWS systems using Terraform and ECS expertise. Drive technical initiatives, mentor engineers, and tackle complex operational challenges. This remote US role offers a discretion...
Location Icon
Location
United States
Salary Icon
Salary
151040.00 - 188800.00 USD / Year
bugcrowd.com Logo
Bugcrowd
Expiration Date
Until further notice

About the Staff Site Reliability Engineer role

Explore high-impact Staff Site Reliability Engineer jobs and discover a critical career at the intersection of software engineering and systems operations. A Staff Site Reliability Engineer (SRE) is a senior-level practitioner responsible for ensuring that large-scale, complex systems are reliable, scalable, and efficient. This role transcends traditional system administration by applying software engineering principles to solve operational problems, automate manual processes, and build resilient infrastructure. Professionals in these jobs act as pivotal leaders and force multipliers, embedding reliability practices into an organization's engineering culture and strategic direction.

The core mission of a Staff SRE is to balance the need for rapid innovation with the imperative of system stability. Common responsibilities include designing and implementing robust observability frameworks—encompassing logging, monitoring, and alerting—to provide deep insights into system health. They define and manage Service Level Objectives (SLOs) and Error Budgets, creating quantifiable targets for reliability that align business and engineering goals. A significant portion of the role involves proactive engineering: automating deployment, scaling, and recovery procedures to eliminate toil, and architecting systems with self-healing patterns like circuit breakers and bulkheads. Staff SREs also lead incident response, conduct rigorous post-mortem analyses to foster a blameless culture of learning, and champion chaos engineering practices to preemptively uncover system weaknesses.

Typical skills and requirements for these senior positions are extensive. Candidates generally possess 8+ years of experience in cloud engineering or software development, with substantial expertise in cloud-native ecosystems (e.g., AWS, GCP, Azure). Proficiency in infrastructure-as-code tools like Terraform, container orchestration with Kubernetes, and CI/CD pipelines is standard. Strong programming or scripting skills in languages such as Python or Go are essential for building automation and tooling. Beyond technical acumen, a successful Staff SRE demonstrates exceptional soft skills: the ability to influence and mentor across teams, define strategic roadmaps, communicate complex concepts clearly, and navigate ambiguous, high-pressure environments. They are customer-focused problem-solvers who translate operational data into engineering improvements, systematically reducing technical debt while advocating for scalable architectures.

Ultimately, Staff Site Reliability Engineer jobs are for those who view operational excellence as a software challenge. These leaders don't just keep the lights on; they design systems that are inherently more reliable and empower entire organizations to build and ship software with confidence. If you are passionate about building resilient, automated systems and driving cultural change toward DevOps and SRE principles, exploring Staff SRE opportunities could be your next career-defining move.