This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and even high-growth start-ups. But those are just facts. What makes us unique is that we have a true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good and know how to balance the two.
Job Responsibility:
Substrate operation — own the Kubernetes cluster plus Keycloak (identity), Vault (secrets), MinIO (object storage), Harbor (registry), Kong (gateway) — from bootstrap to day-2 operations
SLO framework — define, publish, and defend SLOs for every tier-1 service
own error budgets and burn-rate alerting
Incident response — build the on-call rotation, paging, runbook library, and post mortem culture
lead incident command during P1/P2 events
Release operations — co-own the blue-green / canary release model with L6 Delivery
sign off production-bound releases
Air-gap operations — ensure every operational runbook works in a fully offline environment — no assumption of external dependencies
Lead the Platform squad — technically lead 1 Infrastructure Engineer, 1 Observability Engineer, 2 DevOps Engineers
set standards for infra-as-code and automation
Requirements:
Bachelor's degree in computer science or related field
5–8 years in SRE or production-engineering roles running distributed systems at scale
Deep Kubernetes expertise — operators, RBAC, network policy, storage, upgrades
Hands-on with Keycloak / Vault / MinIO / Harbor / Kong or equivalent identity/secrets/storage/registry/gateway stacks
Strong Linux fundamentals and at least one systems language (Go, Rust) or shell/Python for tooling
Proven SLO/SLI authorship and error-budget-driven decision-making
Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, Loki, Tempo)
Calm, clear communication during incidents
strong post-mortem writing
Hands-on with infra-as-code — Helm, Kustomize, Terraform
Nice to have:
Prior experience running air-gapped or on-prem platforms for regulated customers
Cilium/Istio service-mesh operation
GitOps delivery with ArgoCD or Flux
FinOps / cost-attribution experience
Certified Kubernetes Administrator (CKA) or equivalent