This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Our client is looking to fill the role of Senior Distributed Systems Engineer. This role exists to design and harden distributed systems that operate reliably in imperfect conditions. You will build and maintain loosely coupled, resilient systems that function across unreliable, high-latency, or low-bandwidth environments. Success looks like systems that degrade gracefully, recover intelligently, and continue operating when ideal conditions disappear. You are the engineer who ensures data flows, services coordinate, and distributed components behave predictably—even when the network doesn’t.
Job Responsibility:
Design for Real-World Networks: Architect distributed systems that tolerate latency, bandwidth constraints, and intermittent connectivity
Design loosely coupled services that avoid unnecessary cross-system fragility
Implement fault-tolerant communication and state synchronization strategies
Build Resilient Runtime Systems: Develop systems that prioritize correctness, durability, and recovery
Implement retry logic, backpressure, caching strategies, and eventual consistency patterns
Ensure systems degrade gracefully instead of failing catastrophically
Own Integration Boundaries: Design and maintain APIs and communication layers between distributed components
Identify and eliminate tight coupling across services
Diagnose and resolve cross-system failures in production environments
Operate Close to the Metal (When Necessary): Write efficient, production-grade code in Rust or Go
Optimize performance and memory usage in resource-constrained environments
Debug network behavior, concurrency issues, and distributed coordination challenges
Requirements:
Extensive experience building loosely coupled distributed systems
Have worked in environments with low bandwidth, high latency, or unreliable communication links
Understand failure modes in distributed architectures and how to mitigate them
Comfortable reasoning about consistency models, coordination strategies, and state replication
Have experience writing production systems in Rust or Go
Prefer solving systems-level problems over building thin CRUD-style services
Care deeply about durability, reliability, and operational behavior
Nice to have:
Experience with edge computing or constrained network environments
Familiarity with message queues, event-driven systems, or streaming architectures
Background in AI/HPC-adjacent distributed systems
Experience debugging production networking issues
Exposure to consensus algorithms or distributed coordination primitives