Job Description:
Your Mission Call of Duty is one of the most iconic and successful video game franchises in the world, delivering unforgettable experiences to millions of players every day. At the heart of that experience is fair play—and that’s where the Ricochet Anti-Cheat team comes in. Our mission is to detect and eliminate cheating quickly and at scale, ensuring every player enjoys a level playing field. As a Senior Data Reliability Engineer, you will play a critical role in building and operating the data systems that power Call of Duty Anti-Cheat, processing petabytes of game telemetry with high reliability, integrity, and trust. What you bring to the table You will design, deploy, and operate large-scale, reliable data systems that support telemetry ingestion, enrichment, and analytics for Anti-Cheat. Working closely with Machine Learning, Security, and Game Engineering teams, you’ll help enable automated data pipelines—from ingest to insight—that directly inform anti-cheat actions in production. This role blends data engineering, reliability engineering, and operational excellence. You’ll define GitOps-based workflows for securely deploying data pipelines and application stacks, build deep observability into everything you own, and ensure the accuracy and validity of data that Anti-Cheat systems depend on. Priorities can often change in a fast-paced environment like ours, so this role includes, but is not limited to, the following responsibilities: Create the ML Data pipeline used for our models including building the ML templates that are used, the observability of our models, the metrics and KPIs used to monitor their efficacy, and the automated retraining required as the data drifts. Design and operate large-scale, highly-available data pipelines and platforms for high-volume game telemetry Ensure the integrity, trustworthiness, and quality of Anti-Cheat data Partner closely with Machine Learning teams to support batch, streaming, online inference workflows, automated testing of ML artifacts, and observability and maintenance of automated deployment pipelines Define and maintain GitOps workflows for secure, automated testing, integration, and deployment Build comprehensive observability (metrics, logs, dashboards, alerts) into data pipelines and services Own operational excellence, including incident response, root-cause analysis, and post-mortems Contribute to deployment and release strategies such as canary, blue/green, and shadow deployments