CrawlJobs Logo

Cloud Engineer - Product Metrics

United States 141000.00 - 208000.00 USD / Year · Job Posted February 20, 2026
Apply Position
Job Link Share

Job Description

The Product Metrics team owns the collection, storage, and serving of metrics collected from customers' ClickHouse instances. As a part of the team you will be responsible for designing, building, operating and maintaining components of the petabyte-scale platform that stores trillions of records and processes millions of new events every second, owning its reliability, performance and availability. Our stack is built using Golang, runs in Kubernetes, and is, of course, stored in ClickHouse. The team’s responsibilities include gathering and processing data for the internal billing and accounting system as well as customer-facing dashboards that provide our customers with immediate insights and analytics.

Job Responsibility

  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running

Requirements

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform

Nice to have

  • Experience with ClickHouse
  • Experience writing Kubernetes operators or controllers

What we offer

  • Flexible work environment
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – opportunities to engage with colleagues at company-wide offsites

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Cloud Engineer - Product Metrics

8 matching positions

Senior Cloud Engineer - Product Metrics

The Product Metrics team owns the collection, storage, and serving of metrics co...
Location
Location
United States
Salary
Salary:
141000.00 - 208000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform
Job Responsibility
Job Responsibility
  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Cloud Engineer - Product Metrics

The Product Metrics team owns the collection, storage, and serving of metrics co...
Location
Location
Canada
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform
Job Responsibility
Job Responsibility
  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right
New

Cloud Engineer

Location
Location
United States , Lehi
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related discipline, or equivalent practical experience
  • 10+ years of hands-on cloud engineering experience, including significant depth in AWS environments
  • Required AWS certification
  • advanced AWS certifications are strongly preferred
  • Strong experience with infrastructure as code and automation tools such as Terraform, CloudFormation, Ansible, or AWS CDK
  • Proficiency with AWS services including EC2, VPC, Auto Scaling, Route 53, and DynamoDB
  • Experience supporting CI/CD and DevOps practices with tools such as GitHub Actions, Azure DevOps, JIRA, or similar platforms
  • Solid scripting skills in languages such as Python or PowerShell for automation and operational tasks
  • Ability to work independently, deliver against defined requirements, and collaborate effectively in technical discussions with senior engineers and architects
Job Responsibility
Job Responsibility
  • Manage and enhance AWS core infrastructure components, including VPCs, Route 53, CDNs, load balancers, and EC2 services, to support secure and highly available operations
  • Build and refine multi-zone disaster recovery and resiliency solutions, and validate recovery readiness through recurring testing and review
  • Respond to production issues with cross-functional teams by diagnosing failures, identifying root causes, and implementing long-term corrective measures
  • Develop and maintain observability practices using tools such as CloudWatch and Datadog, including logging, metrics, dashboards, and alerting
  • Evaluate cloud environments for performance, scalability, and cost improvements, and recommend changes that strengthen operational efficiency
  • Create and support infrastructure automation using Terraform, CloudFormation, or AWS CDK to standardize provisioning and configuration management
  • Improve deployment workflows by contributing to CI/CD pipelines and automation processes using tools such as GitHub Actions, Harness, or similar platforms
  • Maintain clear technical documentation for infrastructure designs, configurations, procedures, and support practices
  • Partner with developers, system administrators, and security stakeholders to align cloud architecture with business and technical requirements
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right

Azure Cloud Engineer

Location
Location
India , Miracle Heights
Salary
Salary:
Not provided
miraclesoft.com Logo
Miracle Software Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Azure
  • Rubrik
  • Cloud Monitoring
  • ITIL
  • Networking
Job Responsibility
Job Responsibility
  • Maintain Microsoft Azure cloud infrastructure and services to ensure operational stability and system availability
  • Manage and monitor Rubrik backup environments, ensuring successful backup execution, recovery readiness, and compliance with operational standards
  • Perform routine cloud support activities, infrastructure maintenance, and system health checks across production environments
  • Monitor alerts, incidents, dashboards, and infrastructure performance metrics to proactively identify and address issues before service disruption occurs
  • Troubleshoot and resolve Azure infrastructure, networking, storage, and compute-related issues within defined support timelines
  • Participate in North America after-hours operational support, escalation calls, and critical incident response activities
  • Coordinate with internal infrastructure, application, and operations teams to support issue resolution and service restoration efforts
  • Support documentation, escalation records, procedural guidelines, and environment-related knowledge artifacts
  • Follow ITIL processes, governance standards, and cloud operations best practices to support reliable and secure infrastructure management
Read More
Arrow Right

Senior Software Engineer (Cloud ETL & Data)

We are seeking a data-centric Senior Software Engineer to design, build, and evo...
Location
Location
Lithuania , Vilnius; Kaunas
Salary
Salary:
6500.00 EUR / Month
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Graduate or postgraduate degree in Computer Science, Software Engineering, or equivalent experience
  • 7+ years of professional experience in software engineering with exposure to distributed or cloud based systems
  • Strong experience with Azure, microservices, containers, and Kubernetes
  • Hands on experience building ETL pipelines, workflow based systems, or event driven architectures
  • Solid proficiency in an object-oriented language, with a preference for C# .NET
  • Solid understanding of observability, CI/CD, reliability, and cloud operations
  • Strong problem solving skills and the ability to deliver production quality software.
Job Responsibility
Job Responsibility
  • Design and build robust, scalable ETL pipelines for parsing, validating, and transforming diverse engineering data formats
  • Develop and implement strategies for schema management and versioning within data synchronization workflows
  • Architect solutions that guarantee deterministic execution, fault tolerance, and transactional consistency for all data operations
  • Build distributed, event-driven, and task-oriented systems using microservices, messaging, and containerized workloads on Microsoft Azure
  • Implement resilience patterns such as retries, circuit breakers, and rate limiting to ensure high availability
  • Design and implement concurrency control, idempotency, and conflict-resolution patterns in distributed data workflows
  • Build and maintain comprehensive observability, including structured logging, metrics, and distributed tracing
  • Collaborate with architects on high-level design and implementation decisions
  • Mentor junior engineers through code reviews and technical guidance
  • Contribute to shared engineering standards and documentation.
What we offer
What we offer
  • A great Team and culture
  • An exciting career as an integral part of a world-leading software company providing solutions for architecture, engineering, and construction
  • An attractive salary and benefits package
  • A commitment to inclusion, belonging and colleague wellbeing through global initiatives and resource groups
  • A company committed to making a real difference by advancing the world’s infrastructure for better quality of life
  • Training and professional development opportunities (certifications programs, conferences etc.)
  • Additional annual leave days and extra paid days for different occasions
  • Health insurance package and accidents insurance 24/7
  • Referral program with bonuses
  • Extra paid day for volunteering in the organization of your choice
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (Cloud ETL & Data)

We are seeking a data-centric Senior Software Engineer to design, build, and evo...
Location
Location
Lithuania , Vilnius; Kaunas
Salary
Salary:
Not provided
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Graduate or postgraduate degree in Computer Science, Software Engineering, or equivalent experience
  • 7+ years of professional experience in software engineering with exposure to distributed or cloud based systems
  • Strong experience with Azure, microservices, containers, and Kubernetes
  • Hands on experience building ETL pipelines, workflow based systems, or event driven architectures
  • Solid proficiency in an object-oriented language, with a preference for C# .NET
  • Solid understanding of observability, CI/CD, reliability, and cloud operations
  • Strong problem solving skills and the ability to deliver production quality software
Job Responsibility
Job Responsibility
  • Data & ETL Leadership:Design and build robust, scalable ETL pipelines for parsing, validating, and transforming diverse engineering data formats
  • Develop and implement strategies for schema management and versioning within data synchronization workflows
  • Architect solutions that guarantee deterministic execution, fault tolerance, and transactional consistency for all data operations
  • Software & Systems Design:Build distributed, event-driven, and task-oriented systems using microservices, messaging, and containerized workloads on Microsoft Azure
  • Implement resilience patterns such as retries, circuit breakers, and rate limiting to ensure high availability
  • Design and implement concurrency control, idempotency, and conflict-resolution patterns in distributed data workflows
  • Build and maintain comprehensive observability, including structured logging, metrics, and distributed tracing
  • Collaboration & Mentorship:Collaborate with architects on high-level design and implementation decisions
  • Mentor junior engineers through code reviews and technical guidance
  • Contribute to shared engineering standards and documentation
What we offer
What we offer
  • A great Team and culture
  • An exciting career as an integral part of a world-leading software company
  • An attractive salary and benefits package
  • A commitment to inclusion, belonging and colleague wellbeing
  • A company committed to making a real difference by advancing the world’s infrastructure
  • Training and professional development opportunities (certifications programs, conferences etc.)
  • Additional annual leave days and extra paid days for different occasions (marriage, moving day, bereavement leave etc.)
  • Health insurance package and accidents insurance 24/7
  • Referral program with bonuses
  • Extra paid day for volunteering in the organization of your choice
  • Fulltime
Read More
Arrow Right

Cloud Engineer

We are hiring a hands-on AWS Cloud Engineer with strong Linux and networking fun...
Location
Location
United States
Salary
Salary:
Not provided
ewaycorp.com Logo
eWay Corp
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on experience supporting production workloads on AWS
  • Strong knowledge of AWS core services including VPC, EC2, RDS, S3, CloudWatch, CloudFormation, ELB/ALB, Auto Scaling, EBS, EFS, and WAF
  • Solid understanding of AWS networking concepts including subnets, route tables, NAT, DNS, security groups, and NACLs
  • Strong Linux systems administration skills including performance tuning and troubleshooting at OS and network layers
  • Experience with web servers such as Apache and Nginx
  • Working knowledge of MySQL, MariaDB, or similar relational databases
  • Strong troubleshooting capability across infrastructure, application connectivity, and performance issues
  • Familiarity with firewall concepts and traffic filtering
  • Ability to prioritize and manage multiple incidents in parallel
Job Responsibility
Job Responsibility
  • Own L2 support for production AWS environments operating under 24x7x365 SLA commitments
  • Respond to incidents, perform deep troubleshooting, and drive root cause analysis with a focus on permanent resolution
  • Monitor and maintain cloud infrastructure to ensure availability, performance, and cost efficiency
  • Troubleshoot issues across multiple layers including AWS networking, Linux OS, web servers, and database connectivity
  • Support and optimize AWS services including compute, storage, database, and load balancing layers
  • Collaborate with DevOps, application, and security teams to resolve issues and improve system reliability
  • Implement and support backup, disaster recovery, and high availability configurations
  • Analyze logs and metrics using CloudWatch and other tools to proactively identify risks and performance bottlenecks
  • Contribute to infrastructure automation using CloudFormation or similar IaC tools
  • Maintain accurate documentation of environments, incidents, and standard operating procedures
What we offer
What we offer
  • Career Development, Training & certification assistance
  • Medical insurance cover for self, spouse, and children
  • Provident Fund
  • Paid Time off (Maternity, Sick days, Holidays and Earned Leave)
  • Weekends off
  • Flexible work hours and public holidays
  • Loyalty bonus
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure & Observability

Location
Location
India , Bengaluru
Salary
Salary:
Not provided
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years in software engineering with a track record of architecting distributed systems or platforms at scale
  • Strong hands‑on experience in Golang and one scripting language (e.g., Python or Shell)
  • Experience operating observability at pb-scale ingestion and hundreds of millions of series
  • Expertise in observability platforms and tooling (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, ClickHouse) and standards (OpenTelemetry, OpenMetrics)
  • Deep experience building systems of scale and operating cloud infrastructure with Kubernetes
  • strong proficiency with service mesh technologies (Istio/Envoy), infrastructure‑as‑code (Terraform) and experience in multi‑cloud (AWS, GCP)
  • Demonstrated ability to evolve storage and query architectures for cost, scale, and latency (e.g., TSDB, Parquet, distributed processing)
  • Proven experience integrating security as part of infrastructure and platform development
  • Exceptional cross‑functional communication
  • effective collaboration with both technical and non‑technical stakeholders
Job Responsibility
Job Responsibility
  • Architect and lead Roku’s observability platform across metrics, logs, and traces
  • evolve data pipelines and storage layers optimized for high throughput, performance, and cost at Roku scale (TSDBs, Parquet, distributed processing)
  • Extend and harden open‑source observability systems
  • overhaul core components (e.g., storage layers, query paths) to improve performance, reliability, and usability at scale
  • Implement features such as pre‑aggregation, down-sampling, and sampling to reduce load and accelerate queries across the platform
  • Collaborate across platform, SRE, and product teams to migrate hundreds of workloads to our common platform
  • augment and automate CI/CD flows and onboarding
  • Integrate security into infrastructure and platform services
  • ensure robust multi‑tenant, multi‑cluster, and multi‑cloud designs
  • Contribute improvements back to open source and CNCF‑aligned projects
What we offer
What we offer
  • Global access to mental health and financial wellness support and resources
  • healthcare (medical, dental, and vision)
  • life, accident, disability, commuter, and retirement options (401(k)/pension)
  • time off in accordance with local leave policies
  • Fulltime
Read More
Arrow Right