CrawlJobs Logo

Senior Data Reliability Engineer

Canada, Vancouver Employment contract · Job Posted May 04, 2026
Apply Position
Job Link Share

Job Description

Your Mission Call of Duty is one of the most iconic and successful video game franchises in the world, delivering unforgettable experiences to millions of players every day. At the heart of that experience is fair play—and that’s where the Ricochet Anti-Cheat team comes in. Our mission is to detect and eliminate cheating quickly and at scale, ensuring every player enjoys a level playing field. As a Senior Data Reliability Engineer, you will play a critical role in building and operating the data systems that power Call of Duty Anti-Cheat, processing petabytes of game telemetry with high reliability, integrity, and trust. What you bring to the table You will design, deploy, and operate large-scale, reliable data systems that support telemetry ingestion, enrichment, and analytics for Anti-Cheat. Working closely with Machine Learning, Security, and Game Engineering teams, you’ll help enable automated data pipelines—from ingest to insight—that directly inform anti-cheat actions in production. This role blends data engineering, reliability engineering, and operational excellence. You’ll define GitOps-based workflows for securely deploying data pipelines and application stacks, build deep observability into everything you own, and ensure the accuracy and validity of data that Anti-Cheat systems depend on. Priorities can often change in a fast-paced environment like ours, so this role includes, but is not limited to, the following responsibilities: Create the ML Data pipeline used for our models including building the ML templates that are used, the observability of our models, the metrics and KPIs used to monitor their efficacy, and the automated retraining required as the data drifts. Design and operate large-scale, highly-available data pipelines and platforms for high-volume game telemetry Ensure the integrity, trustworthiness, and quality of Anti-Cheat data Partner closely with Machine Learning teams to support batch, streaming, online inference workflows, automated testing of ML artifacts, and observability and maintenance of automated deployment pipelines Define and maintain GitOps workflows for secure, automated testing, integration, and deployment Build comprehensive observability (metrics, logs, dashboards, alerts) into data pipelines and services Own operational excellence, including incident response, root-cause analysis, and post-mortems Contribute to deployment and release strategies such as canary, blue/green, and shadow deployments

Job Responsibility

  • Create the ML Data pipeline used for our models including building the ML templates that are used, the observability of our models, the metrics and KPIs used to monitor their efficacy, and the automated retraining required as the data drifts
  • Design and operate large-scale, highly-available data pipelines and platforms for high-volume game telemetry
  • Ensure the integrity, trustworthiness, and quality of Anti-Cheat data
  • Partner closely with Machine Learning teams to support batch, streaming, online inference workflows, automated testing of ML artifacts, and observability and maintenance of automated deployment pipelines
  • Define and maintain GitOps workflows for secure, automated testing, integration, and deployment
  • Build comprehensive observability (metrics, logs, dashboards, alerts) into data pipelines and services
  • Own operational excellence, including incident response, root-cause analysis, and post-mortems
  • Contribute to deployment and release strategies such as canary, blue/green, and shadow deployments

Requirements

  • 10+ years of programming experience
  • Extensive experience working in Python
  • familiarity with Go
  • Strong experience with data technologies such as SQL, Spark, and Airflow
  • Hands-on experience building observability systems using tools like OpenTelemetry, Prometheus, Loki, and Grafana
  • Experience with dashboarding and alerting for production systems
  • Secure automation of testing and deployments using GitHub Actions / Workflows (GitOps)
  • Experience with Linux system administration in production environments
  • Cloud-native deployment experience using Kubernetes, Helm, and ArgoCD
  • Experience supporting model deployments (batch and online APIs)
  • bonus for streaming or near-real-time systems
  • Master’s degree or equivalent professional experience in Data Engineering, Data Science, Computer Science, Mathematics, Statistics, or a related field
  • Familiarity with production release strategies including canary, blue/green, and shadow deployments
  • Ability to communicate clearly with both engineers and non-technical partners, translating ambiguous needs into actionable technical designs

Nice to have

  • Experience coding in Rust
  • Experience with incident response and operational ownership, including leading post-mortems
  • Cloud networking experience in Azure
  • Background in security, anti-abuse, or anti-cheat data systems
  • Production experience with deep learning systems, including model serving, optimization, monitoring, and retraining triggers

What we offer

  • Medical, dental, vision, health savings account or health reimbursement account, healthcare spending accounts, dependent care spending accounts, life and AD&D insurance, disability insurance
  • 401(k) with Company match, tuition reimbursement, charitable donation matching
  • Paid holidays and vacation, paid sick time, floating holidays, compassion and bereavement leaves, parental leave
  • Mental health & wellbeing programs, fitness programs, free and discounted games, and a variety of other voluntary benefit programs like supplemental life & disability, legal service, ID protection, rental insurance, and others
  • If the Company requires that you move geographic locations for the job, then you may also be eligible for relocation assistance

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Data Reliability Engineer

8 matching positions

Senior Platform Engineer - Data Reliability

The Feedzai Platform Data Reliability play a pivotal role in managing core data ...
Location
Location
Portugal
Salary
Salary:
Not provided
feedzai.com Logo
Feedzai
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A bachelor's degree in Computer Science, Information Systems, or the equivalent combination of education, experience, and training
  • 4+ years of experience in data reliability, platform engineering, or operating data services at scale
  • Proficiency in programming languages such as Go, Java, or similar
  • Hands-on experience with Container Technologies and Orchestration (e.g., Docker, Kubernetes)
  • Valuable experience with data streaming and messaging platforms, like Kafka, Elasticsearch, RabbitMQ
  • Familiarity with CI/CD pipelines and tools such as Jenkins, Gitlab, or similar
  • Demonstrated commitment to staying updated with industry trends and emerging technologies, showcasing a proactive approach to continuous learning
  • Demonstrated knowledge of best practices in security, ensuring the implementation of secure coding standards
  • Experience working with Cloud Providers, with a preference for AWS Cloud
  • Expertise in utilizing monitoring and observability stacks like Grafana and Prometheus
Job Responsibility
Job Responsibility
  • Build and maintain Kubernetes Operators, including deployment, monitoring, operations, and analytics tools developed by the team
  • Engage in development tasks using Go, Java, or similar languages
  • Operate services such as Kafka, Elasticsearch, RabbitMQ, Redis, Relational databases and Couchbase at an enterprise scale
  • Contribute to the self-healing capabilities of applications in our enterprise environments
  • Develop playbooks associated with actionable alerts to streamline response procedures
  • Work with AI-assisted development tools (e.g. Cursor) as part of your daily workflow to ship faster and iterate effectively
  • Maintain and enhance our Infrastructure as Code (IaC) to efficiently manage end-to-end lifecycle operations (monitoring, alerting, security, cost optimization, configuration, backup, etc.) in production environments
  • Utilize your experience and problem solving skills to help prevent and investigate production issues
Read More
Arrow Right

Senior Site Reliability Engineer - Data Pipeline

Bloomreach is building the world’s premier agentic platform for personalization....
Location
Location
Czechia
Salary
Salary:
Not provided
bloomreach.com Logo
Bloomreach
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You can articulate how your contributions have transformed the way engineers work and think by fostering a strong DevOps/SRE culture
  • You can demonstrate how impactful your work as an SRE or DevOps Engineer can be in connection to business success
  • You understand the importance of you build - you run it principle and you love the feeling you own it
  • You are mindful of the costs associated with running our service, which translates into effective vertical and horizontal pod autoscaling and detailed telemetry insights
  • You believe the infrastructure as a code is the only thing that can bring stability into chaos
  • Terraform is your daily bread, and HELM deployments are your second-best friend
  • You use telemetry data and metrics to provide feedback to engineers on how the application and services behave
  • You can navigate yourself in complex service architecture by using distributed debugging
  • You have experience with Python and a solid grasp of engineering practices
  • You don’t hesitate to participate in OnCall rotation 24/7 support
Job Responsibility
Job Responsibility
  • Your task is to build and maintain an ecosystem where engineers can safely and efficiently develop, debug and operate their services running in GCP, Kubernetes using DataFlow, DataProc and Python with Go
  • You make sure the services have high level of observability, enabling us to provide quality service for our customers
  • Further services can scale vertically and horizontally based on current load, operational and telemetric data (OTEL, Prometheus, Victoria Metrics)
  • Team have enough insights about health of our services (Grafana, Alerting, PageDuty)
  • You helps the team to fulfill security requirements given ISO and SOC2 audits, by enforce security principles like key distribution, key rotation, authorisation & authentication on service level, data encryption at transit, data isolation, resource limitations, quality of service, audit logs (mainly by Enovy proxies)
  • You contribute to our tooling, so we have tools in place for debugging, troubleshoot and performance testing
  • You automate manual/semi-manual steps deployment and instance setup
  • You have hands on on L3 support and incident resolutions
  • CI pipelines have linters, security scans, code smell detection enabling engineers to produce quality MRs
What we offer
What we offer
  • A great deal of freedom and trust
  • We have defined our 5 values and the 10 underlying key behaviors that we strongly believe in
  • We believe in flexible working hours to accommodate your working style
  • We work virtual-first with several Bloomreach Hubs available across three continents
  • We organize company events to experience the global spirit of the company and get excited about what's ahead
  • We encourage and support our employees to engage in volunteering activities - every Bloomreacher can take 5 paid days off to volunteer
  • We have a People Development Program -- participating in personal development workshops on various topics run by experts from inside the company
  • Our resident communication coach Ivo Večeřa is available to help navigate work-related communications & decision-making challenges
  • Our managers are strongly encouraged to participate in the Leader Development Program
  • Bloomreachers utilize the $1,500 professional education budget on an annual basis to purchase education products (books, courses, certifications, etc.)
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer - Data Pipeline

The Data Pipeline team is a backend-focused engineering team that is built on st...
Location
Location
Slovakia
Salary
Salary:
3500.00 EUR / Month
bloomreach.com Logo
Bloomreach
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You can articulate how your contributions have transformed the way engineers work and think by fostering a strong DevOps/SRE culture.
  • You can demonstrate how impactful your work as an SRE or DevOps Engineer can be in connection to business success
  • You understand the importance of you build - you run it principle and you love the feeling you own it
  • You are mindful of the costs associated with running our service, which translates into effective vertical and horizontal pod autoscaling and detailed telemetry insights.
  • You believe the infrastructure as a code is the only thing that can bring stability into chaos
  • Terraform is your daily bread, and HELM deployments are your second-best friend
  • You use telemetry data and metrics to provide feedback to engineers on how the application and services behave
  • You can navigate yourself in complex service architecture by using distributed debugging
  • You have experience with Python and a solid grasp of engineering practices
  • A big advantage is, if you have an experience with Go, or with ETL pipelines
Job Responsibility
Job Responsibility
  • Build and maintain an ecosystem where engineers can safely and efficiently develop, debug and operate their services running in GCP, Kubernetes using DataFlow, DataProc and Python with Go
  • Make sure the services have high level of observability, enabling us to provide quality service for our customers
  • Ensure further services can scale vertically and horizontally based on current load, operational and telemetric data (OTEL, Prometheus, Victoria Metrics)
  • Ensure team have enough insights about health of our services (Grafana, Alerting, PageDuty)
  • Help the team to fulfill security requirements given ISO and SOC2 audits, by enforce security principles like key distribution, key rotation, authorisation & authentication on service level, data encryption at transit, data isolation, resource limitations, quality of service, audit logs (mainly by Enovy proxies)
  • Contribute to our tooling, so we have tools in place for debugging, troubleshoot and performance testing
  • Automate manual/semi-manual steps deployment and instance setup
  • Have hands on on L3 support and incident resolutions
  • Ensure CI pipelines have linters, security scans, code smell detection enabling engineers to produce quality MRs
What we offer
What we offer
  • A great deal of freedom and trust
  • Flexible working hours
  • Work virtual-first with several Bloomreach Hubs available across three continents
  • Company events
  • 5 paid days off to volunteer
  • People Development Program
  • Communication coach available
  • Leader Development Program
  • $1,500 professional education budget annually
  • Employee Assistance Program with counselors
  • Fulltime
Read More
Arrow Right

Senior Data Engineer - Data Platform

We are looking for a Senior Data Engineer - Data Platform to join our Data & AI ...
Location
Location
France , Paris
Salary
Salary:
Not provided
doctolib.fr Logo
Doctolib
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • More than 7 years of experience as Site Reliability Engineer, Data Ops, Data Platform Engineer or in a similar role, with a proven track record of building and maintaining complex data infrastructures
  • Strong proficiency in data engineering and infrastructure tools and technologies, such as stream and events processing (Kafka, PubSub, Firehose) and Kubernetes
  • Expertise in programming languages like Python
  • Familiar with cloud infrastructure and services, preferably AWS, Azure, or GCP, and have experience with infrastructure-as-code tools such as Terraform
  • Excellent problem-solving skills with a focus on identifying and resolving data infrastructure bottlenecks and performance issues
Job Responsibility
Job Responsibility
  • Design and implement a scalable and reliable data infrastructure that supports the collection, processing, storage, and analysis of large-scale datasets while pushing security and privacy best practices
  • Build and maintain data pipelines that efficiently extract, transform, and load data from various sources into our data warehouse
  • Implement automation and orchestration tools to streamline infrastructure provisioning, data workflows, reduce manual effort, and improve operational efficiency
  • Monitor data platform for performance and reliability, identify and troubleshoot issues, and implement proactive solutions to ensure data quality and availability
  • Streamline and monitor platform costs, identify optimizations and saving opportunities while collaborating with data engineers, data scientists, and other stakeholders
What we offer
What we offer
  • Free comprehensive health insurance for you and your children
  • Parent Care Program: receive one additional month of leave on top of the legal parental leave
  • Free mental health and coaching services through our partner Moka.care
  • For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
  • Work from EU countries and the UK for up to 10 days per year, thanks to our flexibility days policy
  • Up to 14 days of RTT
  • A subsidy from the work council to refund part of the membership to a sport club or a creative class
  • Lunch voucher with Swile card
  • Fulltime
Read More
Arrow Right

Senior Data Engineer – Data Engineering & AI Platforms

We are looking for a highly skilled Senior Data Engineer (L2) who can design, bu...
Location
Location
India , Chennai, Madurai, Coimbatore
Salary
Salary:
Not provided
optisolbusiness.com Logo
OptiSol Business Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on expertise in cloud ecosystems (Azure / AWS / GCP)
  • Excellent Python programming skills with data engineering libraries and frameworks
  • Advanced SQL capabilities including window functions, CTEs, and performance tuning
  • Solid understanding of distributed processing using Spark/PySpark
  • Experience designing and implementing scalable ETL/ELT workflows
  • Good understanding of data modeling concepts (dimensional, star, snowflake)
  • Familiarity with GenAI/LLM-based integration for data workflows
  • Experience working with Git, CI/CD, and Agile delivery frameworks
  • Strong communication skills for interacting with clients, stakeholders, and internal teams
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable ETL/ELT pipelines across cloud and big data platforms
  • Contribute to architectural discussions by translating business needs into data solutions spanning ingestion, transformation, and consumption layers
  • Work closely with solutioning and pre-sales teams for technical evaluations and client-facing discussions
  • Lead squads of L0/L1 engineers—ensuring delivery quality, mentoring, and guiding career growth
  • Develop cloud-native data engineering solutions using Python, SQL, PySpark, and modern data frameworks
  • Ensure data reliability, performance, and maintainability across the pipeline lifecycle—from development to deployment
  • Support long-term ODC/T&M projects by demonstrating expertise during technical discussions and interviews
  • Integrate emerging GenAI tools where applicable to enhance data enrichment, automation, and transformations
What we offer
What we offer
  • Opportunity to work at the intersection of Data Engineering, Cloud, and Generative AI
  • Hands-on exposure to modern data stacks and emerging AI technologies
  • Collaboration with experts across Data, AI/ML, and cloud practices
  • Access to structured learning, certifications, and leadership mentoring
  • Competitive compensation with fast-track career growth and visibility
  • Fulltime
Read More
Arrow Right

Senior Data Engineer / Data Analyst

N-iX is a global software development service company. Our customer is the Europ...
Location
Location
Ukraine
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience in analytics, sales operations, revenue insights, or data-driven consulting
  • Strong SQL skills (joins, window functions, performance tuning)
  • Production experience with Apache Airflow
  • Python skills for pipeline development
  • Ability to build and maintain database views (PostgreSQL, Snowflake, Redshift)
  • Solid ETL/ELT understanding
  • Clear communication with non-technical stakeholders
  • GitHub experience (branches, PRs, reviews)
  • English level - at least Upper-Intermediate, both spoken and written
Job Responsibility
Job Responsibility
  • Work with Salesforce objects and relationships to ensure correct ingestion, transformation, and integration into our reporting environment
  • Design, build, and maintain data pipelines using Apache Airflow (DAG creation, scheduling, monitoring, troubleshooting)
  • Write, optimize, and maintain SQL queries, stored procedures, and functions for data transformation and extraction
  • Create and manage database views to support analytics, reporting, and downstream applications
  • Ensure data quality, consistency, and reliability across pipelines and views (validation checks, monitoring)
  • Support QuickSuite dataset adjustments (new fields, logic changes, view extensions)
  • Document data flows, data models, and pipeline logic for long-term maintainability and handover
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right

Senior Data Engineer Lead / Architect - Senior Vice President

At Citi Services - Global Trade Technology Organization, we are on a mission to ...
Location
Location
India , Pune, Maharashtra, India, Chennai, Tamil Nadu, India
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of professional experience in data engineering, with a proven track record of designing and building large-scale data systems
  • 3+ years in a technical leadership or architect role, with experience mentoring junior and senior engineers
  • Expert-level proficiency in at least one programming language (Python or Scala preferred) and exceptional SQL skills
  • Proven hands-on experience with Python or Scala for data manipulation, scripting, machine learning, and backend development
  • Deep, hands-on experience with a major cloud platform (AWS, GCP, or Azure) and its data ecosystem (e.g., S3/GCS, Redshift/BigQuery, EMR/Dataproc, Kinesis/Dataflow)
  • Extensive hands-on experience with modern big data technologies and Data streaming (like Hadoop, Hive, Impala, Apache Spark, Kafka, or Flink)
  • Proficiency with workflow orchestration tools such as Airflow, Dagster, or Prefect
  • Proficiency in designing and implementing microservices architectures, RESTful APIs, and event-driven systems with 'Data as a Product' Principle
  • Solid understanding of data modeling concepts and database design for both analytical (OLAP) and transactional (OLTP) workloads
  • Deep understanding and hands-on experience with relational databases (e.g., PostgreSQL, Oracle), NoSQL databases (e.g., MongoDB, Cassandra), data warehousing, and big data technologies (e.g., Spark, Kafka)
Job Responsibility
Job Responsibility
  • Architect & Design: Design, architect, and oversee the development of robust, scalable, and reliable data infrastructure, including data lakes, data warehouses, and real-time streaming platforms on the cloud
  • Build & Code: Act as a senior individual contributor and hands-on technical leader. Write clean, maintainable, and high-performance code for data ingestion, transformation, and serving layers (e.g., using Python, Scala, SQL, and Spark)
  • Lead & Mentor: Lead a team of data engineers, providing technical guidance, mentorship, and career development support. Foster a collaborative and inclusive team environment
  • Champion Culture: Define, document, and champion data engineering best practices across the organization, including CI/CD, data quality, testing frameworks, observability, and code review standards
  • Drive Strategy: Partner with leadership, product managers, data scientists, and analysts to understand data needs and develop a long-term data strategy and roadmap
  • Innovate & Evaluate: Stay at the forefront of data engineering technologies. Evaluate, prototype, and recommend new tools and frameworks to continuously improve our data platform
  • Ensure Governance: Implement and enforce robust data governance, security, and privacy policies in partnership with our security and compliance teams
  • Fulltime
Read More
Arrow Right

Software engineer 2 / Senior Software engineer - Azure Data

Microsoft's Azure Data engineering team is leading the transformation of analyti...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience with the Azure stack including Storage, Compute, Networking, Fabric, Purview, Synapse, AKS, DevOps, Data Factory, or Power BI
  • Experience with big data technologies such as Spark, Kafka, Hadoop, or HBase
  • Experience building data lake or data engineering products, tools, or pipelines
  • Familiarity with container-based architectures (Docker, Kubernetes)
  • Ability to debug complex distributed systems on Linux and/or Windows platforms
Job Responsibility
Job Responsibility
  • Write extensible, maintainable code in C#, Java, Scala, or Python for Fabric Materialized Lake View services and HDInsight components
  • Use AI tools and coding best practices across the development lifecycle
  • Design data refresh, scheduling, and query optimisation features with minimal supervision
  • Review code from teammates for correctness, test coverage, security risks, and adherence to team standards
  • Coach junior engineers through code reviews
  • Debug complex issues in distributed systems running on Azure, Linux, and Windows
  • Run live site operations on a rotational, on-call basis
  • Integrate logging and instrumentation to gather telemetry on system health, performance, reliability, and security
  • Work with product managers, technical leads, and partners across geographies to define customer requirements for Materialized Lake View features
  • Fulltime
Read More
Arrow Right