This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Join Esker as a Site Reliability Engineer. We’re seeking a Site Reliability Engineer to join our US-based SRE team of 10 engineers, part of a global SRE organization of 35+ professionals responsible for the reliability, scalability, and performance of our multi-tenant SaaS platform. In this role, you’ll work hands-on with large-scale production systems, collaborating with experienced engineers while helping drive automation, observability, and reliability improvements across the platform. This is an exciting opportunity for someone moving from systems administration or DevOps into SRE, or for an early-career SRE looking to deepen their expertise across a broad and modern technology stack—all while supporting a platform used by millions of users worldwide.
Job Responsibility:
Improve platform reliability and availability by participating in our on-call rotation, learning from real production incidents, and helping evolve our incident response and post‑incident practices
Increase delivery speed and consistency through infrastructure and deployment automation using Terraform, Ansible, and Azure DevOps
Reduce time to detect and resolve issues by building actionable monitoring, alerting, and dashboards that improve visibility into system health and performance
Enable scalable growth by designing and operating infrastructure that supports a rapidly expanding global customer base across Azure and on‑premises environments
Empower engineering and support teams by delivering self-service tools, automation, and guardrails that allow safe and efficient production operations
Strengthen a culture of reliability and continuous improvement through agile collaboration, knowledge sharing, blameless postmortems, and cross‑team initiatives
Requirements:
3+ years in a Site Reliability, DevOps or a related role supporting production systems
2+ years of hands-on experience operating and maintaining Linux systems in live production environments (Windows server experience is a plus)
Bachelor's degree in Computer Science or related field preferred (equivalent experience considered)
Practical experience with Infrastructure as Code tools such as Terraform, Ansible, Chef, Puppet, or similar
A solid foundation in networking concepts, including TCP/IP, DNS, firewalls, and proxying
Experience working with relational and/or NoSQL databases in production environments
Scripting proficiency in one or more languages such as Python, Bash, or PowerShell
What we offer:
Student loan repayment assistance
Flexible work schedule, summer hours, and work from home options
Profit sharing options
Paid time off for community outreach and volunteer opportunities
Yearly stipend for employee wellness, hobbies, or educational activities