This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Own the foundations that all services rely on. Build and operate a secure, scalable Kubernetes platform with automated infrastructure, observability, and lifecycle management. Enable product teams to ship faster by reducing platform friction and strengthening engineering reliability.
Job Responsibility:
Own the foundations that all services rely on
Build and operate a secure, scalable Kubernetes platform with automated infrastructure, observability, and lifecycle management
Enable product teams to ship faster by reducing platform friction and strengthening engineering reliability
Ensure a scalable and secure platform foundation is in place and continuously improved, with standardized Kubernetes building blocks (Istio, ingress, secrets management) enabling reliable and consistent service delivery
Ensure infrastructure provisioning is fast, repeatable, and auditable, through reusable Terraform modules supporting both AWS (VPC, IAM, KMS, compute, storage) and on-prem environments (vSphere/VxRail)
Ensure system health and reliability are transparent and actionable, with well-defined SLOs/SLIs, unified metrics, logs, and traces, and alerting that minimizes noise while surfacing real issues early
Support product teams to ship faster with fewer platform-related blockers, supported by clear architectural guidance, hands-on enablement, and improved engineering maturity in Kubernetes, networking, identity, and observability
Ensure platform operations are predictable and low-touch, with automated cluster lifecycle management (bootstrap, upgrades, scaling) that reduces manual effort, risk, and operational toil
Requirements:
5+ years in DevOps/SRE/Platform roles operating production services
Deep hands-on with Kubernetes (install/upgrade, control plane, CNI, ingress, storage) and Linux systems
Understanding of AWS fundamentals (networking, IAM, compute, storage)
Experience with IaC tools like Terraform and comfort working with platform APIs/CLIs
Proficiency in at least one backend language (for example Python or Go) for automation/tooling
Intellectual Firepower: Rapidly comprehends, structures and synthesizes complex information, draws accurate conclusions, and communicates them with clarity
Passion & Work Ethic: Brings sustained motivation, resilience, and high personal standards to every challenge
Ownership & Action: Assumes full accountability for outcomes, acting decisively, and ensuring commitments are delivered
Team Player: Works collaboratively across teams, contributing to shared success, and engaging in constructive debate
Integrity & Growth Mindset: Operates with transparency and humility, learns from setbacks, and actively seeks opportunities to grow
Nice to have:
Experience with Rancher, Istio, External Secrets/Vault, and supply-chain hardening (SBOM)
Operating clusters in restricted or air-gapped environments
FusionAuth/SSO integration and policy-as-code
Background in regulated or mission-critical domains