This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a Senior DevOps engineer to build and maintain the infrastructure supporting the next-generation blockchain for stablecoins. You will work closely with 4 of our wonderful DevOps engineers, valuing technical ambition and ownership.
Job Responsibility:
Own the reliability roadmap: lead the architectural evolution of our production systems toward higher availability, graceful degradation, and predictable failure recovery
Redesign for resilience: identify brittleness in existing infrastructure and drive systematic improvements, such as chaos engineering, redundancy patterns, and blast-radius reduction
Elevate observability: transform our monitoring from system health to risk awareness, ensuring we catch issues before they escalate to incidents
Strengthen operational rigour: improve our runbooks, incident response protocols, and post-mortem practices to compound reliability gains over time
Mentor and uplift the team: share hard-won lessons from operating at scale, raise the collective bar, and nurture a culture of ownership
Partner across engineering: work with protocol, client, and backend teams to bake reliability in at the design phase
Requirements:
10+ years of operating distributed systems at scale, ideally at companies where uptime was existential: big tech, HFT, exchanges, or blockchain infrastructure
system design depth: architected platforms for fault tolerance
fluent in Kubernetes, Terraform, and cloud infrastructure
led reliability transformations before, taking production systems from 'good enough' to 'financial-grade'
lead through influence, not authority
Nice to have:
Production blockchain experience: running L1/L2 infrastructure, validator operations, or RPC networks at scale
Background in HFT, trading systems, or fintech where latency and uptime directly impact P&L
Experience with formal SRE practices: error budgets, SLOs, capacity planning, and incident management at scale
Familiarity with Rust-based systems, consensus protocols, or indexing infrastructure
Track record of building reliability culture: not just systems, but the practices and mindsets that sustain them
What we offer:
Above market salary plus token compensation
Premium health insurance for you and your family fully covered by Plasma
Monthly wellness budget, whether for the gym, therapy, sauna & massage
A beautiful London HQ with gym access and daily food
All the tools and tech you need to operate at your best
Visa sponsorship and relocation support if you are joining the London office from abroad