CrawlJobs Logo

Cloud Engineer - Product Metrics

clickhouse.com Logo

ClickHouse

Location Icon

Location:
United States

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

141000.00 - 208000.00 USD / Year

Job Description:

The Product Metrics team owns the collection, storage, and serving of metrics collected from customers' ClickHouse instances. As a part of the team you will be responsible for designing, building, operating and maintaining components of the petabyte-scale platform that stores trillions of records and processes millions of new events every second, owning its reliability, performance and availability. Our stack is built using Golang, runs in Kubernetes, and is, of course, stored in ClickHouse. The team’s responsibilities include gathering and processing data for the internal billing and accounting system as well as customer-facing dashboards that provide our customers with immediate insights and analytics.

Job Responsibility:

  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running

Requirements:

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform

Nice to have:

  • Experience with ClickHouse
  • Experience writing Kubernetes operators or controllers
What we offer:
  • Flexible work environment
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – opportunities to engage with colleagues at company-wide offsites

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Cloud Engineer - Product Metrics

Senior Cloud Engineer - Product Metrics

The Product Metrics team owns the collection, storage, and serving of metrics co...
Location
Location
United States
Salary
Salary:
141000.00 - 208000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform
Job Responsibility
Job Responsibility
  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Cloud Engineer - Product Metrics

The Product Metrics team owns the collection, storage, and serving of metrics co...
Location
Location
Canada
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform
Job Responsibility
Job Responsibility
  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Morpheus Cloud Support Engineer

As a Morpheus Cloud Support Engineer, you will provide technical assistance and ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of proven experience as a Cloud Support Engineer or in a similar position
  • At least 5 years of experience in Morpheus Cloud Management Platform
  • Bachelor’s degree in computer science, Information Technology, or a related field
  • Strong understanding of cloud systems, including VMware, KVM, AWS, and Azure
  • Experience with cloud infrastructure as code (IaC) technologies such as Terraform or CloudFormation
  • Experience with containerization and orchestration systems such as Docker and Kubernetes
  • Excellent problem-solving and troubleshooting abilities
  • Strong communication skills, with the ability to clearly convey technical information to both technical and non-technical stakeholders
  • Hands-on experience in Morpheus Cloud Management Platform
  • Proficiency with Windows Server, Ubuntu, RHEL, HPE VME, Centos
Job Responsibility
Job Responsibility
  • Provide technical assistance with cloud infrastructure and services in Morpheus CMP
  • Monitor and maintain infrastructure systems to guarantee their availability and performance
  • Troubleshoot and address issues with cloud infrastructure
  • Work with the development and operations teams to optimize cloud solutions
  • Assist with the deployment and setup of cloud resources
  • Develop and maintain comprehensive documentation for cloud systems, including architecture diagrams, operational procedures, and troubleshooting guides
  • Analyze cloud system performance metrics and logs to identify trends, forecast needs, and recommend improvements or upgrades
  • Collaborate with Product Managers, Developers, Operations to understand requirements, use cases and transform them into tests
  • Handle P1 situations in Cloud Infra
  • Provide technical and architectural leadership for the infrastructure Engineering teams and Operations roles
What we offer
What we offer
  • Comprehensive suite of benefits for physical, financial, and emotional wellbeing
  • Career development programs
  • Inclusive work culture
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - Cloud Global Services

We are currently hiring a Staff Software Engineer in our Cloud Global Services t...
Location
Location
United States
Salary
Salary:
170000.00 - 260000.00 USD / Year
temporal.io Logo
Temporal
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Rich experience as an 'Arranger' and/or 'Builder/Enhancer' in large-scale distributed systems design (reliability, scalability)
  • Operational experience in large-scale distributed systems environments
  • Experience developing highly concurrent systems
  • Demonstrated experience writing concurrent code (pref. Go, Java) in production as Advanced or Expert levels
  • Knowledge and experience of reliability to ensure the high reliability of the Temporal system
  • Ideas and actions to improve the velocity of the team
Job Responsibility
Job Responsibility
  • Design and implement core backend service features for Temporal's high-availability services in-region, cross-region, and re: cross-cloud failover
  • Clearly document design choices and operational knowledge to successfully deploy and run services with those features
  • Provide appropriate service level logs and metrics to make features operational for cloud service setup
  • Provide appropriate alerts, dashboards, and runbooks for production
What we offer
What we offer
  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more
  • $3,600 / Year Work from Home Meals
  • $1,500 / Year Career Development & Learning
  • $1,200 / Year Lifestyle Spending Account
  • $1,000 / Year In-Home Office Setup (In addition to Temporal issued equipment)
  • $500 / Year Professional Memberships
  • Fulltime
Read More
Arrow Right

Developer Productivity Engineer

We are looking for a developer productivity engineer to take responsibility for ...
Location
Location
United States
Salary
Salary:
180000.00 - 320000.00 USD / Year
hightouch.com Logo
Hightouch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Engineer with a passion for solving hard technical problems
  • Motivated by high ownership and comfortable in a fast-paced, startup environment
  • Driven significant improvement in the productivity of a 50+ person development team by making high-leverage changes to their build/test/deploy processes
  • Strong development fundamentals and comfortable driving framework-level improvements across multiple teams
  • Can dive into feature code, including complicated backend code, as needed
  • Experience could be from a developer-productivity team/function, organically becoming "the build person" at a fast-growing startup, or personally delivering code that improved cross-team metrics related to developer productivity (e.g., DORA metrics, coverage, etc.)
  • Strong computer science and development fundamentals
Job Responsibility
Job Responsibility
  • Take responsibility for our monorepo and the "path to production" for over 50 engineers
  • Own the build/test/deploy of our software and how each team fits into it
  • Detangle our build/test/deploy patterns so teams can move fast and not block each other
  • Investigate/implement a tool like turbo repo to speed builds and separate concerns
  • Drive excellence in testing
  • Improve top-down and team-level views into test coverage
  • Support an ever growing matrix of data sources, data destinations, and enrichment patterns
  • Help each team understand where they have gaps
  • Support our multi-region and multi-cloud backend
  • Extend it to launch Hightouch in new regions to support data residency requirements
What we offer
What we offer
  • Meaningful equity compensation in the form of ISO options
  • Offer early exercise and a 10 year post-termination exercise window
  • Fulltime
Read More
Arrow Right

Cloud Engineer

We’re looking for a Cloud Engineer to join our growing engineering team. This ro...
Location
Location
Salary
Salary:
Not provided
solvedex.com Logo
Solvedex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exposure to a wide variety of technologies beyond only Node.js or JavaScript frameworks
  • Ability to work across multiple layers of the stack and multiple languages
  • Experience operating real systems in production
  • First-hand handling of outages, critical bugs, incidents, and distributed systems misbehavior
  • Demonstrated ability to design and ship large, complex engineering projects
  • Experience engineering for systems with millions of users or large-scale traffic
  • Takes full responsibility—from ideation to deployment—and pushes projects forward independently
  • Has improved business or engineering metrics in measurable ways (performance, reliability, cost efficiency, etc.)
  • Focuses on delivering business value, not just features
  • Bachelor’s degree or higher in Computer Science, Computer Engineering, or another STEM discipline
Job Responsibility
Job Responsibility
  • Design, build, and operate scalable, resilient, and high-performance cloud-based systems
  • Own projects from conception to delivery, demonstrating strong engineering judgment and accountability
  • Troubleshoot and resolve production issues, outages, and complex distributed system failures
  • Ship large and technically complex projects, providing clear explanations of design decisions, constraints, and architecture
  • Work on systems that support large-scale user bases (1M+ users)
  • Continuously improve engineering and business metrics through measurable technical contributions
  • Collaborate with cross-functional teams to identify opportunities and deliver meaningful business value
  • Contribute to or initiate technical improvements, automation, and tooling for operational excellence
  • Demonstrate ongoing learning, exploring new technologies, and challenging assumptions
  • Fulltime
Read More
Arrow Right

Cloud Scale Test Engineer

As a Cloud Scale Test Engineer, you will ensure the delivery of high performance...
Location
Location
United States , San Jose
Salary
Salary:
90400.00 - 208500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master's degree in computer science, engineering, information systems, or related field
  • 1-4 years of experience
  • Strong Python Coding Skills
  • Passion for AI-driven automation and process optimization
  • Experience working on cloud platforms like AWS/Azure/Google Cloud
  • Experience with Observability platforms like Prometheus, Grafana. Open Search, New Relic etc
  • Knowledge of distributed tracing and debugging in cloud-native environments
  • Proficiency in GIT, Jira, Jenkins, and CI/CD tool
  • Basic networking knowledge
  • Experience in testing containerized applications and Kubernetes-based environments
Job Responsibility
Job Responsibility
  • Design, develop, and maintain robust test automation frameworks for cloud-scale distributed systems
  • Architect performance, load, and stress tests to validate system resilience under high traffic conditions
  • Build fault-injection and chaos engineering strategies to assess the reliability of distributed services
  • Develop and execute end-to-end integration, API, and system-level tests across microservices-based architectures
  • Implement continuous testing pipelines within CI/CD workflows to accelerate deployment cycles
  • Collaborate closely with development, SRE, and infrastructure teams to ensure quality best practices are embedded within the SDLC
  • Analyze system logs, telemetry data, and observability metrics to identify and mitigate potential failures before they impact production
  • Drive automation of security testing, API contract validation, and infrastructure testing
  • Participate in diagnosing critical production issues related to system reliability and performance
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Cloud Security Site Reliability Engineer

This role sits within the Cloud Security team responsible for Private and Public...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent work experience
  • 6+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills
  • Certification or formal training in site reliability engineering concepts and practices
  • Prior experience working towards SLIs, SLOs and observability capabilities at a large scale
  • 4+ years experience in Python (preferable) or Java, on large scale systems alongside Linux based scripting languages
  • Experience working on observability, logging and metrics toolsets
  • Experience of k8s and container technologies such as Docker, Openshift and EKS
  • Experience with public cloud technologies such as AWS, GCP or Azure
  • Experience with Secrets products such as HashiCorp Vault or CyberArk
Job Responsibility
Job Responsibility
  • Working across Container products and Secrets products, across Public and Private Cloud, as well as Cloud native specific products
  • Architecting and building tools and platforms that provide capabilities for SRE
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organisation
  • Actively owning production level incidents till resolution.
What we offer
What we offer
  • Equal opportunity employer
  • Accessibility support for persons with disabilities.
  • Fulltime
Read More
Arrow Right