CrawlJobs Logo
Briefcase Icon
Category Icon

Principal Site Reliability Engineer Jobs (Remote work)

5 Job Offers

Filters
Principal Site Reliability Engineer (AI-first SRE)
Save Icon
Lead the AI-driven reliability transformation at Groupon as a Principal SRE. Architect self-healing systems using AI/ML, GCP/AWS, and Kubernetes to ensure 99.9%+ availability. Leverage your 10+ years of experience to build predictive, intelligent platforms in a transformative, remote-friendly env...
Location Icon
Location
Peru
Salary Icon
Salary
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Principal Site Reliability Engineer
Save Icon
Lead the evolution to AI-driven resilience as a Principal SRE at Groupon. Architect self-healing systems on GCP/AWS with Kubernetes and Terraform, leveraging AIOps for predictive reliability. This role in Colombia offers a chance to shape global platform stability with cutting-edge tech and signi...
Location Icon
Location
Colombia
Salary Icon
Salary
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Principal Site Reliability Engineer
Save Icon
Lead the evolution to AI-driven resilience as a Principal SRE at Groupon. Architect self-healing systems on GCP/AWS with Kubernetes and Terraform, leveraging AIOps for predictive operations. This role in Ecuador offers a chance to shape global platform reliability with cutting-edge tech and signi...
Location Icon
Location
Ecuador
Salary Icon
Salary
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Principal Site Reliability Engineer
Save Icon
Lead the CVML Platform team as a Principal SRE, architecting a secure, cost-effective hybrid infrastructure for robotics. Integrate edge devices, on-prem, and cloud (AWS, K8s) using Terraform, Python, and Go. Optimize performance and stability while collaborating cross-functionally in the autonom...
Location Icon
Location
United States
Salary Icon
Salary
166000.00 - 293000.00 USD / Year
bluerivertechnology.com Logo
Blue River Technology
Expiration Date
Until further notice
Principal Site Reliability Engineer (AI-first SRE)
Save Icon
Lead the AI-driven reliability transformation at Groupon as a Principal SRE. You will architect self-healing systems using AI/ML, GCP/AWS, and Kubernetes to ensure 99.9%+ availability. This role requires 10+ years of experience, expertise in AIOps, and offers a chance to shape scalable, predictiv...
Location Icon
Location
Salary Icon
Salary
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Pursue Principal Site Reliability Engineer Jobs and step into a role that sits at the strategic apex of software engineering and IT operations. A Principal Site Reliability Engineer (SRE) is a senior-level expert and technical leader responsible for architecting, building, and advocating for highly scalable, resilient, and efficient software systems. This is not merely an operational role; it is a strategic position focused on engineering solutions that prevent problems before they occur, thereby ensuring that critical services meet their reliability and performance objectives, often defined by Service Level Objectives (SLOs). Professionals in these jobs are the bridge between development teams and operational needs, instilling a culture of reliability and continuous improvement across an entire organization. The common responsibilities of a Principal SRE are extensive and leadership-oriented. Typically, they involve designing and implementing the overall reliability architecture for complex, distributed systems. This includes developing strategies to incorporate SRE principles—such as error budgets, toil automation, and blameless post-mortems—into the product lifecycle from the very beginning. A key duty is the deep analysis of system performance and availability, using advanced data analytics and often prototyping machine learning models for anomaly detection and trend forecasting. They are champions of automation, creating sophisticated systems to manage infrastructure as code, automate common operational procedures, and streamline incident response. Furthermore, Principal SREs are tasked with evaluating and integrating new technologies, creating organization-wide standards for software design and development, and conducting rigorous capacity planning to ensure systems scale efficiently and cost-effectively. The typical skills and requirements for these high-level jobs are demanding, reflecting the seniority and breadth of the position. A strong background in software engineering is paramount, with expert-level proficiency in languages like Java, Go, or Python. Deep, hands-on experience with major public cloud platforms (AWS, GCP, Azure) and their service offerings is essential. Candidates are expected to have a comprehensive understanding of distributed systems architecture, networking fundamentals, and modern database technologies (both SQL and NoSQL). Beyond technical prowess, exceptional leadership and communication skills are non-negotiable. Principal SREs must be able to influence technical and business strategy, drive large, cross-organizational initiatives to completion, and effectively mentor and coach other engineers. They are problem-solvers at their core, with a proven history of innovation and a passion for building systems that are not just functional, but fundamentally robust and elegant. If you are seeking Principal Site Reliability Engineer Jobs, you are looking for a career-defining role where your technical vision and leadership will directly shape the technological backbone of a business.

Filters

×
Countries
Category
Location
Work Mode
Salary