This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a DevOps Engineer III, you will be part of the L3 support team for Operations across Edge/on‑prem and cloud, owning complex incidents end‑to‑end: triage, deep‑dive debugging, root‑cause analysis, remediation, and follow‑ups. Strong Linux administration (RHEL primarily, plus Ubuntu) and OpenShift/Kubernetes expertise are essential. To reduce Ops toil, you will build targeted automations (Python, Bash, Ansible) and automate new and existing SOPs used by Operations. You will execute safe deployments and upgrades via GitOps and IaC pipelines (Flux, Ansible, Terraform) on AKS and GKE—coordinating validation and rollback plans—and contribute to the maintenance of existing GitLab CI/CD pipelines together with the DevOps engineering teams. You will design and continuously refine Alertmanager rules and standardize actionable Grafana dashboards with Operations, ensuring effective use of Prometheus metrics and logs (Grafana Alloy, Thanos). Beyond day‑to‑day operations, you’ll apply deep DevOps, CI/CD, and infrastructure automation expertise, drive best practices, share knowledge through workshops and mentoring, write and maintain documentation and SOPs (Standard Operating Procedure), test infrastructure, and collaborate across teams to optimize systems and workflows.
Job Responsibility:
Part of the L3 support team for Operations across Edge/on‑prem and cloud, owning complex incidents end‑to‑end: triage, deep‑dive debugging, root‑cause analysis, remediation, and follow‑ups
Build targeted automations (Python, Bash, Ansible) and automate new and existing SOPs used by Operations
Execute safe deployments and upgrades via GitOps and IaC pipelines (Flux, Ansible, Terraform) on AKS and GKE—coordinating validation and rollback plans
Contribute to the maintenance of existing GitLab CI/CD pipelines together with the DevOps engineering teams
Design and continuously refine Alertmanager rules and standardize actionable Grafana dashboards with Operations
Apply deep DevOps, CI/CD, and infrastructure automation expertise, drive best practices, share knowledge through workshops and mentoring, write and maintain documentation and SOPs, test infrastructure, and collaborate across teams to optimize systems and workflows
Designs and maintains CI/CD pipelines using GitLab CI/CD
Implements Infrastructure as Code (IaC) with tools like Terraform
Oversees advanced CI/CD pipeline setups, including GitOps with Flux CD
Automates complex workflows and enhances infrastructure scalability
Troubleshoots and optimizes Kubernetes cluster operations
Integrates monitoring solutions for observability
Writes and maintains system operations documentation
Keeps up-to-date on best practices and new technologies
Conducts, designs, and executes staging/UAT/production and mass service deployment scenarios
Collaborates on technical architecture and system design
Reproduces and simulates application incidents to create debug reports and coordinate delivery of application fixes
Evaluates existing components or systems to determine integration requirements
Interacts with cross-functional management on high profile technical operations while providing clear feedback and leadership to support teams
Authoring knowledgebase articles and driving internal knowledge sharing
Work in off-routine hours occasionally
Work with customers and travel to international customer or partner locations high-profile
Collaborate with the Ops teams for troubleshooting and solving L3 tickets, create automations to reduce and optimize workload
Work closely with the wider DevOps engineering teams, your manager, developers and QA engineers
Collaborate with the team to ensure the security of our cloud and edge solutions
Requirements:
4+ years in DevOps-related roles with a strong focus on automation
Proficient in DNS, routing, container communication, firewalls, reverse-proxying, load-balancing, edge to cloud communication and troubleshooting
Strong system administration skills are required for deploying and troubleshooting OS level outages and containerized Edge application in customer network
Extensive experience with Azure (or GCP), including fully automated infrastructure and deployment
Experience with monitoring and optimizing cloud costs
Proven experience in implementing and managing CI/CD pipelines (GitLab CI/CD preferred) and excellent knowledge of Git and associated workflows
Proven experience with monitoring, logging, and alerting tools and stacks
Excellent scripting skills in Bash and Python
Advanced knowledge of Kubernetes and Openshift, including cluster management, orchestration and auto-scaling, deployments using Helm charts and GitOps
Proven experience with microservices architecture and related deployment strategies
Expertise with Terraform modules
Deep experience with Ansible, including writing complex playbooks, roles, and using Ansible Vault for secrets management
Strong understanding of DevSecOps principles and experience implementing security best practices within CI/CD pipelines
Strong analytical and problem-solving abilities
Excellent presentation, oral, and written communication skills
Fluent business English is a requirement
A passionate advocate for determining and delivering solutions with a high level of customer satisfaction
Demonstrated interest in learning and a strong desire to expand knowledge
Capable of engaging in technical discussions with stakeholders and leading DevOps projects
Nice to have:
Experience in using Service Mesh solution like Istio
Experience in using Tracing solutions like Grafana Tempo, Jaeger