This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Instructure, we believe in the power of people to grow and succeed throughout their lives. Our goal is to amplify that power by creating intuitive products that simplify learning and personal development, facilitate meaningful relationships, and inspire people to go further in their education and careers. We do this by giving smart, creative, passionate people opportunities to create awesome. By focusing on our values — ownership, openness, simplicity, equality, relationships, and learning — and using a modern stack that includes AWS, Docker, Kubernetes, Vault, Terraform, Ruby, and Go, we are pushing the boundaries of managing services at scale. We are seeking an experienced and highly skilled Senior Site Reliability Engineer to join our growing team. In this role, you will be a key contributor to building and maintaining a highly reliable, scalable, and performant infrastructure on AWS. You will apply a software engineering mindset to solve operational challenges, automate everything, and ensure that our platform remains stable and responsive as we scale. The ideal candidate has a deep understanding of the AWS ecosystem, a strong background in automation, and a passion for operational excellence.
Job Responsibility:
Partnering with engineering teams to automate & optimize service availability, scalability, performance, monitoring, and alerting
Developing software to automate the toil required to keep our SaaS offerings highly available, scalable, and resilient, while delivering quickly
Building and maintaining platforms and services that are force multipliers used by internal teams
Developing and maintaining methodologies of iteratively deploying Instructure’s cloud-based architecture
Building resilient and self-scaling systems so that you can sleep through the night
Push the boundaries of cloud automation and gain hands-on experience running services with AWS
Requirements:
A desire for ownership and love for problem-solving and learning
Experience managing a 24/7 SaaS with amazing uptime
Experience with at least one cloud platform (Amazon Web Services [preferred], Google Cloud, or Microsoft Azure)
Programming experience in at least one modern programming language (Python, Go, TypeScript / Node.js, Java, or Ruby)
Experience with containers (Docker) or container orchestration (Kubernetes, ECS, Mesos)
Deep understanding of configuration management, automation, and infrastructure-as-code tools (Ansible, Terraform, or Puppet)