CrawlJobs Logo
Briefcase Icon
Category Icon

Site Reliability Engineering (SRE) Jobs (Remote work)

2 Job Offers

Filters
Principal Site Reliability Engineer (AI-first SRE)
Save Icon
Lead the AI-driven reliability transformation at Groupon as a Principal SRE. Architect self-healing systems using AI/ML, GCP/AWS, and Kubernetes to ensure 99.9%+ availability. Leverage your 10+ years of experience to build predictive, intelligent platforms in a transformative, remote-friendly env...
Location Icon
Location
Peru
Salary Icon
Salary
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Principal Site Reliability Engineer (AI-first SRE)
Save Icon
Lead the AI-driven reliability transformation at Groupon as a Principal SRE. You will architect self-healing systems using AI/ML, GCP/AWS, and Kubernetes to ensure 99.9%+ availability. This role requires 10+ years of experience, expertise in AIOps, and offers a chance to shape scalable, predictiv...
Location Icon
Location
Salary Icon
Salary
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Explore the dynamic and critical field of Site Reliability Engineering (SRE) jobs, where software engineering meets operations to build scalable, reliable, and efficient systems. SRE is a discipline that applies a software engineering mindset to infrastructure and operations problems. Professionals in this role, known as Site Reliability Engineers, are the bridge between development and IT operations, ensuring that services are highly available, performant, and resilient. Their core mission is to systematically eliminate manual operational work through automation while maintaining a focus on the end-user experience and system health. The typical responsibilities of an SRE are multifaceted. A primary duty is ensuring service reliability and availability, often measured against Service Level Objectives (SLOs) and managing error budgets. This involves designing, building, and maintaining monitoring, alerting, and observability platforms to gain deep insights into system behavior. SREs proactively work on capacity planning, performance tuning, and disaster recovery strategies. A significant portion of their work is dedicated to automation, creating software to automate repetitive tasks, manage infrastructure as code, and streamline deployment pipelines. When incidents occur, SREs lead the response, conducting thorough post-mortems and root cause analysis to implement permanent fixes and prevent future outages. They also collaborate closely with development teams to advocate for reliability best practices from the initial design phase, often by developing tools and frameworks that improve the entire software development lifecycle. To succeed in SRE jobs, a specific blend of skills is required. A strong software engineering background is fundamental, with proficiency in programming languages like Python, Go, or Java, and scripting in Bash or PowerShell. Deep knowledge of modern infrastructure is essential, including expertise in cloud platforms (AWS, GCP, Azure), containerization with Docker, and orchestration with Kubernetes. Experience with Infrastructure as Code tools like Terraform, Ansible, or Puppet is standard. SREs must have a solid grasp of networking, operating systems (Linux/Unix), and database management. Equally important are the analytical and problem-solving skills to diagnose complex distributed systems issues. Familiarity with the full CI/CD pipeline and a commitment to DevOps culture of collaboration and shared responsibility are crucial. Soft skills such as effective communication, a proactive mindset, and a focus on blameless post-mortems are highly valued in this collaborative, high-stakes field. For those passionate about building robust systems, solving intricate puzzles, and writing code to automate infrastructure, Site Reliability Engineering offers a challenging and rewarding career path. SRE jobs are at the heart of modern digital enterprises, making them crucial roles for anyone looking to impact product stability and user satisfaction directly. Discover your next opportunity in this essential tech discipline.

Filters

×
Countries
Category
Location
Work Mode
Salary