CrawlJobs Logo

Senior Engineering Manager - Metrics Platform

confluent.io Logo

Confluent

Location Icon

Location:
India

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We’re not just building better tech. We’re rewriting how data moves and what the world can do with it. With Confluent, data doesn’t sit still. Our platform puts information in motion, streaming in near real-time so companies can react faster, build smarter, and deliver experiences as dynamic as the world around them. It takes a certain kind of person to join this team. Those who ask hard questions, give honest feedback, and show up for each other. No egos, no solo acts. Just smart, curious humans pushing toward something bigger, together. One Confluent. One Team. One Data Streaming Platform. About the Metrics Platform Team: The Confluent Metrics Platform team's mission is to provide a best-in-class observability foundation that enables customers to monitor, analyze, and optimize their real-time data streaming infrastructure at cloud scale. Our charter is to deliver Realtime Metrics and Insights through the Confluent Cloud Metrics API, empowering businesses to make data-driven decisions about their streaming workloads. We are a critical component of Confluent's observability systems, serving as the primary interface through which customers understand the health, performance, and behavior of their Kafka clusters, connectors, ksqlDB applications, and Schema Registry deployments. Our technology powers monitoring dashboards, alerting systems, and capacity planning tools for thousands of customers running mission-critical streaming applications. About the Role: As a Senior Manager, Engineering for the Metrics Platform team, you will build, lead, and grow a high-performing engineering organization responsible for one of Confluent's most critical services. This role demands a unique blend of deep technical expertise and strong leadership—you must drive both the strategic vision for a large-scale, real-time analytics platform AND execute flawlessly on operational excellence. Your immediate focus will be on: Scaling for Growth: Leading the technical strategy to scale our metrics infrastructure to handle 10x data volume over the next 2 years; API Evolution: Driving the roadmap for new metrics datasets, query capabilities, and integration patterns; Operational Excellence: Ensuring 99.99%+ availability, sub-second query performance, and seamless incident response; Cross-Team Collaboration: Partnering with multiple teams across Telemetry, Cloud Infrastructure, and Product to deliver end-to-end observability solutions.

Job Responsibility:

  • Define and execute the multi-year technical roadmap for the Metrics Platform, including Data infrastructure cluster evolution, data retention strategies, and query optimization
  • Build, mentor, and grow a world-class engineering team
  • Partner with Product Management to define and prioritize the Metrics API roadmap based on customer needs and business impact
  • Align with Confluent's broader observability strategy across Cloud and Platform offerings
  • Establish metrics and KPIs to measure system performance, system reliability, and customer satisfaction

Requirements:

  • 14+ years of overall experience in software development and engineering
  • 4+ years of engineering management experience, leading productive, high-performing teams
  • Experience operating large-scale distributed systems in production environments (preferably cloud-native)
  • Demonstrated ability to hire and retain top engineering talent, provide impactful coaching, and drive high-performance results
  • Proven track record of shipping features consistently and meeting aggressive deadlines with a high degree of urgency
  • Exceptional prioritization skills with the ability to balance short-term execution with a long-term strategic vision for technical evolution
  • Exceptional communication and collaboration skills, with a focus on building a positive, inclusive team culture aligned with organizational goals
  • Solid fundamentals in distributed systems design, replication protocols, and high-availability production operations
  • Deep familiarity with Kafka or similar high-scale event streaming platforms (Pulsar, Flink, etc.) in cloud environments
  • Experience operating complex architectures across large public clouds (AWS, GCP, Azure) or private cloud-native infrastructures
  • Strong engineering background with a hands-on approach to technology and a passion for architectural deep-dives

Nice to have:

  • Direct experience with Apache Druid in production at scale
  • Familiarity with Prometheus, OpenMetrics, or OpenTelemetry ecosystems
  • Experience in SaaS or platform engineering organizations
What we offer:
  • Remote-First Work
  • Robust Insurance Benefits
  • Flexible Time Away
  • The Best Teammates
  • Experience Ambassadors
  • Open and Honest Culture
  • Well-Being and Growth

Additional Information:

Job Posted:
April 11, 2026

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Engineering Manager - Metrics Platform

Senior Engineering Manager, Platform Engineering (Developer Experience)

Everlaw is seeking a Senior Engineering Manager, Platform to lead teams focused ...
Location
Location
United States , Oakland, California
Salary
Salary:
219000.00 - 277000.00 USD / Year
everlaw.com Logo
Everlaw
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years as a senior engineer building developer productivity tools and/or highly available platform services (e.g., storage, pub-sub, search, caching, observability) and/or deep experience with infrastructure/cloud technologies (e.g., Terraform, Kubernetes, Docker)
  • 3+ years of experience directly managing software engineers and/or technical leads, including hiring, coaching, performance management, and growing a high-performing team
  • 2+ years of experience building and leading developer experience or platform teams/programs that deliver internal platforms and tooling with measurable productivity outcomes (e.g., faster builds/tests, improved CI/CD lead times, higher deployment frequency)
  • Experience managing scalable database infrastructure (e.g., Postgres, MySQL or equivalent)
  • Can communicate at the right altitude with both technical and non-technical stakeholders, and you’ve led cross-functional roadmaps with Engineering Operations, Security Engineering, DevOps, Product, and Design
  • Authorized to work in the United States. Please note that currently, Everlaw is not sponsoring employment visas.
Job Responsibility
Job Responsibility
  • Lead platform teams that build and evolve core internal platforms and developer tooling—spanning build/test infrastructure, CI/CD, and developer workflows—to improve engineer productivity and time-to-value
  • Collaborate closely with Engineering Operations, Security Engineering, DevOps, Product, and Design to synthesize requirements and prioritize impactful investments
  • Drive roadmapping, resourcing, and execution for critical platform areas that make it better and cheaper to develop, test, and release software
  • Establish and use developer efficiency metrics (e.g., build/test times, deploy lead time, change failure rate) to identify bottlenecks and plan ambitious improvements to workflows
  • Ensure operational excellence for platform services and tooling with clear SLOs, robust observability, and incident/bug management practices
  • Coach and develop engineers and leads
  • provide actionable feedback, elevate technical execution, and foster an inclusive, high-accountability culture
  • Partner with Engineering Operations to improve processes for alignment, goal setting, empowerment, and cross-team execution across Engineering
  • Communicate effectively with both technical and non-technical stakeholders, adjusting altitude from strategy to technical deep dives as needed.
What we offer
What we offer
  • Medical
  • dental
  • wellness program
  • paid parental leave
  • professional development
  • fully stocked kitchen
  • Equity program
  • 401(k) retirement plan with company matching
  • Health, dental, and vision
  • Flexible Spending Accounts for health and dependent care expenses
  • Fulltime
Read More
Arrow Right

Senior Platform Owner - Platform Engineering

Enables self-service of BT product teams through the planning, execution, and de...
Location
Location
United States
Salary
Salary:
111200.00 - 185400.00 USD / Year
sysco.com Logo
Sysco
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in CS or equivalent degree or 5 years related work experience in a technical field
  • 8+ years experience as a Platform Owner, Developer or similar technical roles with 5+ years of experience in an agile environment
  • Strong understanding of Scrum, Lean, XP, Kanban and other agile development frameworks
  • Proven experience managing design and delivery of highly technical and complex large-scale platforms
  • Sound knowledge of emerging technologies focusing on cloud products, SaaS platforms, or Enterprise Applications
  • Strong platform engineering skills with deep understanding of platform vision and priorities
  • Demonstrated leadership and people management skills and ability to make decisions and set clear priorities
  • Proven ability to recognize value and manage investments to maximize value delivered
  • Strong stakeholder management and negotiation skills
Job Responsibility
Job Responsibility
  • Product Vision & Strategy: Collaborate with engineering leaders to establish, promote, and refine the long-term vision for the PaaStry+ ecosystem
  • Own the Gateway Products: Oversee the end-to-end lifecycle for mission-critical gateways
  • Pioneer Agentic AI in Platform Engineering: Drive our AI-powered platform initiative
  • Define the Platform Ecosystem Roadmap: Maintain a unified, prioritized, and actionable backlog for the entire platform
  • Go-to-Market & Adoption: Design and deliver a robust developer adoption strategy for the platform
  • Customer Success & Feedback: Establish ongoing feedback mechanisms to track satisfaction and ensure your roadmap addresses key developer pain points
  • Value Metrics & KPIs: Define and focus on the metrics that matter
  • Evangelism & Communication: Be the public representative and principal evangelist for PaaStry+
  • Fulltime
Read More
Arrow Right

Senior Engineering Manager - AI/ML

Hewlett Packard Enterprise is looking for a Senior Engineering Manager - AI/ML t...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, engineering, data science, artificial intelligence, machine learning, or closely related quantitative discipline
  • 7-15 years’ experience including 5 or more years of people management experience
  • Advanced Degree (Master’s or Ph.D.) strongly preferred
  • Strong problem-solving and analytical skills, with the ability to identify business opportunities, formulate strategies, and execute projects effectively
  • Excellent communication and presentation skills, with the ability to convey complex technical concepts to technical and non-technical stakeholders
  • Proven ability to manage multiple projects and priorities in a fast-paced environment, ensuring timely delivery and high-quality results
  • Experience with cloud platforms, big data technologies, and distributed computing frameworks is a plus
  • Strong understanding of data privacy, security, and ethical considerations in AI and machine learning
  • Strong technical expertise in AI and machine learning algorithms, models, and tools, with proficiency in programming languages such as Python or R
  • Demonstrated leadership and management skills, with experience in leading and mentoring AI and machine learning professional teams.
Job Responsibility
Job Responsibility
  • Develop software algorithms to structure, analyze and leverage structured and unstructured data
  • Use machine learning and statistical modeling techniques to improve product/system performance, data management, quality, and accuracy
  • Apply, optimize, and scale deep learning technologies and algorithms
  • Document procedures for installation and maintenance
  • Perform testing and debugging
  • Define and monitor performance metrics
  • Translate customer requirements and industry trends into AI/ML products and systems improvements
  • Develop and drive the organization’s AI and machine learning strategy
  • Identify new opportunities for AI and machine learning applications
  • Oversee complex AI and machine learning projects from conception to deployment
What we offer
What we offer
  • Comprehensive suite of benefits supporting physical, financial, and emotional wellbeing
  • Personal and professional development programs
  • Career growth opportunities
  • Inclusive work environment.
  • Fulltime
Read More
Arrow Right

Senior Manager, Software Engineering (Orchestration Services)

The Data and Storage Services team is responsible for handling all of Affirm’s D...
Location
Location
United States
Salary
Salary:
232000.00 - 310000.00 USD / Year
affirm.com Logo
Affirm
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid leadership and interpersonal skills
  • 10+ years of experience in managing multiple diverse and inclusive teams and delivering large cross-functional technical programs
  • Proven track record in stakeholder management, ownership, and successful delivery
  • Expertise in managing large-scale, geographically distributed compute and data processing systems, including data lake solutions and Workflow Orchestration frameworks
  • Expertise in scaling frameworks like Spark, Flink and, Kafka on Kubernetes and cloud providers like AWS, leveraging storage systems such as AWS S3 and Apache Iceberg
  • Capable of mentorship, cross-functional project execution, and individual contribution
  • Strong interpersonal, written, and verbal communication skills with a growth mindset
  • Experience in the data infrastructure domain and a passion for leading technical teams and contributing to Open Source solutions
  • Bachelor’s degree in Computer Science, related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Lead a team of engineers and managers with empathy while fostering a high-performance, ownership-driven & inclusive culture to develop frameworks, systems, and tools for new Affirm products
  • Oversee teams managing storage services including relational, key-value, and analytics storage infrastructure and tools at scale
  • Understand and drive business and engineering metrics, promoting a culture of reliability, security, and productivity
  • Collaborate with tech leads, program managers, and other engineering managers on security, tools, architecture, planning, and delivery of multiple concurrent projects
  • Work across the engineering organization and with internal and external partners
  • Provide leadership and growth opportunities to team members, mentor engineers, recruit, and represent Affirm hiring brands
  • Lead technical decisions, projects, and roadmaps within the Batch and Streaming teams, shaping Affirm’s strategy for managing and serving orchestration workloads
  • Collaborate with peers, leadership, and stakeholders across platform engineering and product engineering organizations
  • In collaboration with tech leads, develop a multi year roadmap to design and implement frameworks, services, and tools for new Affirm products and business needs
  • Guide, tutor, and aid in the professional growth of junior and senior engineers within the team
What we offer
What we offer
  • Health care coverage - Affirm covers all premiums for all levels of coverage for you and your dependents
  • Flexible Spending Wallets - generous stipends for spending on Technology, Food, various Lifestyle needs, and family forming expenses
  • Time off - competitive vacation and holiday schedules allowing you to take time off to rest and recharge
  • ESPP - An employee stock purchase plan enabling you to buy shares of Affirm at a discount
  • Fulltime
Read More
Arrow Right

Senior Manager, Software Engineering (Orchestration Services)

The Data and Storage Services team is responsible for handling all of Affirm’s D...
Location
Location
Canada
Salary
Salary:
206000.00 - 256000.00 CAD / Year
affirm.com Logo
Affirm
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid leadership and interpersonal skills
  • 10+ years of experience in managing multiple diverse and inclusive teams and delivering large cross-functional technical programs
  • Proven track record in stakeholder management, ownership, and successful delivery
  • Expertise in managing large-scale, geographically distributed compute and data processing systems, including data lake solutions and Workflow Orchestration frameworks
  • Expertise in scaling frameworks like Spark, Flink and, Kafka on Kubernetes and cloud providers like AWS, leveraging storage systems such as AWS S3 and Apache Iceberg
  • Capable of mentorship, cross-functional project execution, and individual contribution
  • Strong interpersonal, written, and verbal communication skills with a growth mindset
  • Experience in the data infrastructure domain and a passion for leading technical teams and contributing to Open Source solutions
  • Bachelor’s degree in Computer Science, related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Lead a team of engineers and managers with empathy while fostering a high-performance, ownership-driven & inclusive culture to develop frameworks, systems, and tools for new Affirm products
  • Oversee teams managing storage services including relational, key-value, and analytics storage infrastructure and tools at scale
  • Understand and drive business and engineering metrics, promoting a culture of reliability, security, and productivity
  • Collaborate with tech leads, program managers, and other engineering managers on security, tools, architecture, planning, and delivery of multiple concurrent projects
  • Work across the engineering organization and with internal and external partners
  • Provide leadership and growth opportunities to team members, mentor engineers, recruit, and represent Affirm hiring brands
  • Lead technical decisions, projects, and roadmaps within the Batch and Streaming teams, shaping Affirm’s strategy for managing and serving orchestration workloads
  • Collaborate with peers, leadership, and stakeholders across platform engineering and product engineering organizations
  • In collaboration with tech leads, develop a multi year roadmap to design and implement frameworks, services, and tools for new Affirm products and business needs
  • Guide, tutor, and aid in the professional growth of junior and senior engineers within the team
What we offer
What we offer
  • Health care coverage - Affirm covers all premiums for all levels of coverage for you and your dependents
  • Flexible Spending Wallets - generous stipends for spending on Technology, Food, various Lifestyle needs, and family forming expenses
  • Time off - competitive vacation and holiday schedules allowing you to take time off to rest and recharge
  • ESPP - An employee stock purchase plan enabling you to buy shares of Affirm at a discount
  • Fulltime
Read More
Arrow Right

Senior Data Engineer - Platform Enablement

SoundCloud empowers artists and fans to connect and share through music. Founded...
Location
Location
United States , New York; Atlanta; East Coast
Salary
Salary:
160000.00 - 210000.00 USD / Year
soundcloud.com Logo
SoundCloud
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in data engineering, analytics engineering, or similar roles
  • Expert-level SQL skills, including performance tuning, advanced joins, CTEs, window functions, and analytical query design
  • Proven experience with Apache Airflow (designing DAGs, scheduling, task dependencies, monitoring, Python)
  • Familiarity with event-driven architectures and messaging systems (Pub/Sub, Kafka, etc.)
  • Knowledge of data governance, schema management, and versioning best practices
  • Understanding observability practices: logging, metrics, tracing, and incident response
  • Experience deploying and managing services in cloud environments, preferably GCP, AWS
  • Excellent communication skills and a collaborative mindset
Job Responsibility
Job Responsibility
  • Develop and optimize SQL data models and queries for analytics, reporting, and operational use cases
  • Design and maintain ETL/ELT workflows using Apache Airflow, ensuring reliability, scalability, and data integrity
  • Collaborate with analysts and business teams to translate data needs into efficient, automated data pipelines and datasets
  • Own and enhance data quality and validation processes, ensuring accuracy and completeness of business-critical metrics
  • Build and maintain reporting layers, supporting dashboards and analytics tools (e.g. Looker, or similar)
  • Troubleshoot and tune SQL performance, optimizing queries and data structures for speed and scalability
  • Contribute to data architecture decisions, including schema design, partitioning strategies, and workflow scheduling
  • Mentor junior engineers, advocate for best practices and promote a positive team culture
What we offer
What we offer
  • Comprehensive health benefits including medical, dental, and vision plans, as well as mental health resources
  • Robust 401k program
  • Employee Equity Plan
  • Generous professional development allowance
  • Creativity and Wellness benefit
  • Flexible vacation and public holiday policy where you can take up to 35 days of PTO annually
  • 16 paid weeks for all parents (birthing and non-birthing), regardless of gender, to welcome newborns, adopted and foster children
  • Various snacks, goodies, and 2 free lunches weekly when at the office
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer

As a Senior Platform Engineer on Dedrone’s Infrastructure Services team, you wil...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of relevant experience in cloud infrastructure, developer tooling, backend engineering, or platform/DevOps roles
  • Proficiency with modern cloud and automation tooling, including CI/CD pipeline development, AWS services (EC2, ECS/EKS, S3, IAM, CloudWatch), Infrastructure-as-Code (Terraform or AWS CDK), and containerization using Docker and orchestration tools such as ECS, EKS, or Kubernetes
  • Strong engineering fundamentals, including backend development experience (e.g., Java, Go, Python), as well as Linux, Bash, and scripting skills
  • Experience implementing observability practices—including metrics, logging, and tracing
  • A collaborative approach grounded in Axon’s values—showing ownership, candor, customer success, the courage to boldly go, and the ambition to aim far and win right
Job Responsibility
Job Responsibility
  • Own, design, and optimize CI/CD pipelines supporting Dedrone’s distributed, product ecosystem—reducing build times, deployment friction, and manual overhead
  • Build, automate, and maintain AWS infrastructure using Infrastructure-as-Code (Terraform or AWS CDK), ensuring scalable, secure, and reusable cloud environments
  • Maintain and evolve backend services owned by the team
  • Architect, optimize, and secure Docker images and container workflows
  • support orchestration environments (such as ECS, EKS, Kubernetes)
  • Strengthen and expand Dedrone’s observability stack— metrics, logging, tracing, and alerting—leveraging tools such as Grafana, Cloudwatch
  • Establish and promote engineering best practices across development standards, CI/CD patterns, infrastructure templates, and reusable tooling
  • Partner closely with product engineering teams to understand bottlenecks, reduce toil, and increase overall developer velocity
  • Contribute to security-focused improvements across the platform CI/CD, IAM, secrets management, and cloud resource hardening
  • Collaborate with Axon platform, infrastructure, and security teams to align workflows and adopt shared best practices
What we offer
What we offer
  • Competitive base salary and RSUs
  • Comprehensive pension plan with matching contribution
  • Private health insurance & cash plans
  • 30 days paid holiday + UK public holidays
  • Enhanced maternity/paternity leave
  • GymPass subscription
  • Life assurance & income protection
  • Career growth support and wellness resources
  • Fulltime
Read More
Arrow Right

Senior ML Platform Engineer

At WHOOP, we're on a mission to unlock human performance and healthspan. WHOOP e...
Location
Location
United States , Boston
Salary
Salary:
150000.00 - 210000.00 USD / Year
whoop.com Logo
Whoop
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s Degree in Computer Science, Engineering, or a related field
  • or equivalent practical experience
  • 5+ years of experience in software engineering with a focus on ML infrastructure, cloud platforms, or MLOps
  • Strong programming skills in Python, with experience in building distributed systems and REST/gRPC APIs
  • Deep knowledge of cloud-native services and infrastructure-as-code (e.g., AWS CDK, Terraform, CloudFormation)
  • Hands-on experience with model deployment platforms such as AWS SageMaker, Vertex AI, or Kubernetes-based serving stacks
  • Proficiency in ML lifecycle tools (MLflow, Weights & Biases, BentoML) and containerization strategies (Docker, Kubernetes)
  • Understanding of data engineering and ingestion pipelines, with ability to interface with data lakes, feature stores, and streaming systems
  • Proven ability to work cross-functionally with Data Science, Data Platform, and Software Engineering teams, influencing decisions and driving alignment
  • Passion for AI and automation to solve real-world problems and improve operational workflows
Job Responsibility
Job Responsibility
  • Architect, build, own, and operate scalable ML infrastructure in cloud environments (e.g., AWS), optimizing for speed, observability, cost, and reproducibility
  • Create, support, and maintain core MLOps infrastructure (e.g., MLflow, feature store, experiment tracking, model registry), ensuring reliability, scalability, and long-term sustainability
  • Develop, evolve, and operate MLOps platforms and frameworks that standardize model deployment, versioning, drift detection, and lifecycle management at scale
  • Implement and continuously maintain end-to-end CI/CD pipelines for ML models using orchestration tools (e.g., Prefect, Airflow, Argo Workflows), ensuring robust testing, reproducibility, and traceability
  • Partner closely with Data Science, Sensor Intelligence, and Data Platform teams to operationalize and support model development, deployment, and monitoring workflows
  • Build, manage, and maintain both real-time and batch inference infrastructure, supporting diverse use cases from physiological analytics to personalized feedback loops for WHOOP members
  • Design, implement, and own automated observability tooling (e.g., for model latency, data drift, accuracy degradation), integrating metrics, logging, and alerting with existing platforms
  • Leverage AI-powered tools and automation to reduce operational overhead, enhance developer productivity, and accelerate model release cycles
  • Contribute to and maintain internal platform documentation, SDKs, and training materials, enabling self-service capabilities for model deployment and experimentation
  • Continuously evaluate and integrate emerging technologies and deployment strategies, influencing WHOOP’s roadmap for AI-driven platform efficiency, reliability, and scale
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right