This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We're looking for a technical OMS Platform Reliability Lead to own the health and stability of our Fluent Commerce Order Management ecosystem. This is a systems engineering role — you'll lead our RUN support team and drive the shift from reactive support to self-healing, automated operations.
Job Responsibility
Design automated Order Replay mechanisms to resolve sync failures without manual intervention
Build observability dashboards (Splunk, Datadog, or New Relic) to monitor GraphQL performance, API latency, and webhook success rates
Serve as the ultimate escalation point for incidents requiring Java debugging or complex GraphQL analysis
Lead Root Cause Analysis across application logs and event-driven architecture
Partner with E-commerce, architecture, and Fluent Commerce engineers on roadmap and platform upgrades
Mentor the RUN team on GraphQL optimization and Java debugging
Requirements
5+ years in OMS Technical Operations or Platform Engineering