CrawlJobs Logo
Briefcase Icon
Category Icon

Filters

×

Senior Software Engineer, Site Reliability Jobs

2 Job Offers

Filters
Senior Software Engineer - Site Reliability Engineer/SRE
Save Icon
Join our team in Sunnyvale as a Senior SRE. You will ensure the reliability of software-defined vehicle platforms using your expertise in Azure, Terraform, and monitoring tools. This role requires strong DevOps fundamentals, on-call experience, and modern SRE practices. Relocation support is avai...
Location Icon
Location
United States , Sunnyvale
Salary Icon
Salary
152100.00 - 232900.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Senior Software Engineer, Site Reliability
Save Icon
Join Babylist as a Senior Site Reliability Engineer on our Platform team. You will ensure system stability and scalability using AWS, Terraform, and Kubernetes. This remote role in the US/Canada offers strong benefits, including comprehensive health insurance and a supportive, AI-forward environm...
Location Icon
Location
United States; Canada
Salary Icon
Salary
186818.00 - 224183.00 USD; CAD / Year
babylist.com Logo
Babylist
Expiration Date
Until further notice

About the Senior Software Engineer, Site Reliability role

Explore Senior Software Engineer, Site Reliability jobs and discover a critical career at the intersection of software development and IT operations. Professionals in this role, often referred to as SREs (Site Reliability Engineers), are the architects of system resilience, dedicated to building and scaling highly reliable, efficient, and automated platforms. Their core mission is to balance the need for rapid innovation with the imperative of system stability, ensuring that services are available, performant, and capable of meeting user demand. This is not merely an administrative role; it is a software engineering discipline applied to infrastructure and operational challenges.

Typically, individuals in these positions spend a significant portion of their time on engineering tasks aimed at automating operational work, eliminating manual toil, and preventing future issues. Common responsibilities include designing, building, and maintaining cloud infrastructure using Infrastructure as Code (IaC) principles with tools like Terraform or CloudFormation. They develop and optimize robust Continuous Integration and Continuous Deployment (CI/CD) pipelines to enable safe and rapid software delivery. A major focus is on implementing comprehensive observability through monitoring, logging, and alerting systems to gain deep insights into system health. SREs also establish Service Level Objectives (SLOs) and Error Budgets to quantitatively manage reliability. Furthermore, they are integral in capacity planning, performance analysis, and conducting post-incident reviews to foster a culture of continuous learning and improvement.

The typical skill set for Senior Software Engineer, Site Reliability jobs is broad and deep. It requires strong software engineering fundamentals, often in languages like Go, Python, or Java, coupled with profound expertise in cloud platforms such as AWS, Google Cloud, or Microsoft Azure. Proficiency with containerization (Docker) and orchestration systems (Kubernetes) is standard. A solid understanding of networking, distributed systems, and database fundamentals is essential. Beyond technical prowess, successful SREs possess exceptional problem-solving and debugging skills to troubleshoot complex system issues. They must have strong communication and collaboration abilities to work effectively with development teams, advocating for reliability best practices. Experience with on-call rotations and incident management is a common requirement, emphasizing the role's responsibility for live system support. For those passionate about building scalable systems and solving complex puzzles, Senior Software Engineer, Site Reliability jobs offer a challenging and impactful career path where code meets production reality.

Filters

×
Countries
Category
Location
Work Mode
Salary