This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re looking for a Senior Staff Engineer, Online Datastores to help us build and operate the next generation of reliable, high-performance datastores at Webflow - starting with Apache Druid and expanding to systems like MongoDB and Postgres - by setting standards for availability, scalability, and real-time analytics at global scale.
Job Responsibility:
Ensure high reliability, uptime, and query performance for Apache Druid clusters on EC2
Lead monitoring, alerting, troubleshooting, and incident response using observability tools such as DataDog and CloudWatch
Manage data lifecycle: tiering strategies (hot, warm, default), retention policies, deep storage (S3), local SSD caches, and Aurora MySQL metadata store
Operate and tune Zookeeper clusters while ensuring overall cluster stability, coordination, and service discovery
Optimize performance and cost efficiency by right-sizing clusters, scaling instances, and balancing ingestion throughput with query workloads
Define and drive standards for online datastore operations, including multi-region deployments, failover strategies, SLA/SLOs, and best practices for real-time analytics workloads
Provide technical leadership and mentorship, collaborating with data and product teams to define governance, reliability practices, and long-term strategy across datastores (starting with Druid, and over time MongoDB and Postgres)
Requirements:
Hands-on experience setting up and managing Apache Druid clusters (Broker, Router, Coordinator/Overlord, Historical, Middle Manager) including upgrades, troubleshooting and tuning
Deep knowledge of Druid operations: indexing, segment management, data partitioning, tiered storage, and query optimization and cluster right-sizing for performance/cost tradeoffs
Proficiency in Java (JDK 8+) and Linux/Unix systems, and scripting/automation (Bash, Python) for deployments, maintenance and performance tuning
Solid understanding of distributed systems concepts such as replication, failover, consensus protocols (Zookeeper), and multi-region deployment strategies
Familiarity with the broader data ecosystem: streaming ingestion (Kafka/MSK, Flink CDC), cloud storage (S3) and related data infrastructure
Experience establishing and operating to SLAs/SLOs, and defining standards and best practices for real-time analytics workloads
Experience with observability and incident management: metrics collection, dashboards, and alerting (Datadog, Prometheus, Grafana, Cloudwatch or equivalent)
Knowledge of security and governance practices including authentication, role-based access, encryption, and audit logging
Demonstrated technical leadership: mentoring engineers, driving architecture discussions, and collaborating with data, infra, and product teams
Business-level fluency to read, write and speak in English
What we offer:
Ownership in what you help build. Every permanent Webflower receives equity (RSUs) in our growing, privately held company
Health coverage that actually covers you. Comprehensive medical, dental, and vision plans for full-time employees and their dependents, with Webflow covering most premiums
Support for every stage of family life. 12 weeks of paid parental leave for all parents and 6+ weeks of additional paid leave for birthing parents. Plus inclusive care for family planning, menopause, and midlife transitions
Time off that’s actually off. Flexible vacation, paid holidays, and a sabbatical program to help you recharge and come back inspired
Wellness for the whole you. Access to mental health resources, therapy and coaching
Invest in your future. A 401(k) with 100% employer match (up to $6,000/year) in the U.S., and support for retirement savings globally
Monthly stipends that flex with your life. Localized support for work and wellness expenses — from Wi-Fi to workouts
Bonus for building together. All full-time, permanent, non-commission employees are eligible for our annual WIN bonus program