This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The DevOps team are responsible for managing, monitoring and optimizing our SaaS platform, i.e. ensuring system Availability, Durability, Reliability, Resiliency and Fault Tolerance. They support and work closely with our in-house developers, providing a world class developer experience. As Xelix continues to expand, so have the demands on our DevOps team. We are seeking an additional mid-level DevOps Engineer to own delivery of the initiatives on our 2026 Tech Roadmap, for example: Improve our CI/CD process to allow quicker, more frequent and easier deployments; Reduce our AWS Cloud Spend by rightsizing, replacing services with others or developing a strategy for using spot instances; Ensure our platform scales to support our client growth; Remove manual or tedious process across the business through smart scripting and automations; Maintain platform uptime SLAs by designing reliable, resilient architecture with intelligent monitoring and alerting techniques. Our platform runs on Amazon Web Services, managed with Terraform so familiarity with common AWS Services (RDS, S3, ECS, EC2 etc.) is essential for this role. Xelix handles large volumes of sensitive commercial data, making security crucial across all our activities. In this role you will build on our ISO:27001 and SOC 2 accreditations by helping to implement and maintain security best practices.
Job Responsibility:
Operating production AWS workloads
Provision, upgrade and maintain AWS infrastructure using Terraform
Release engineering: Maintain and improve CI/CD pipelines through Jenkins / Github Actions and other technology
Help design solution architecture for new business features
Manage cloud costs by planning usage, rightsizing and ensuring unused infrastructure is removed
Support the Development and AI Engineering Teams - help troubleshoot their issues
Bring observability through dashboards, alerting and log aggregation