CrawlJobs Logo

Cloud Engineer - Product Metrics

United States 141000.00 - 208000.00 USD / Year · Job Posted February 20, 2026
Apply Position
Job Link Share

Job Description

The Product Metrics team owns the collection, storage, and serving of metrics collected from customers' ClickHouse instances. As a part of the team you will be responsible for designing, building, operating and maintaining components of the petabyte-scale platform that stores trillions of records and processes millions of new events every second, owning its reliability, performance and availability. Our stack is built using Golang, runs in Kubernetes, and is, of course, stored in ClickHouse. The team’s responsibilities include gathering and processing data for the internal billing and accounting system as well as customer-facing dashboards that provide our customers with immediate insights and analytics.

Job Responsibility

  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running

Requirements

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform

Nice to have

  • Experience with ClickHouse
  • Experience writing Kubernetes operators or controllers

What we offer

  • Flexible work environment
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – opportunities to engage with colleagues at company-wide offsites

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Cloud Engineer - Product Metrics

8 matching positions

Senior Cloud Engineer - Product Metrics

The Product Metrics team owns the collection, storage, and serving of metrics co...
Location
Location
United States
Salary
Salary:
141000.00 - 208000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform
Job Responsibility
Job Responsibility
  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Cloud Engineer - Product Metrics

The Product Metrics team owns the collection, storage, and serving of metrics co...
Location
Location
Canada
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • 2+ years of software application development experience using Golang
  • Experience with at least one of the major Cloud Service Providers such as AWS, GCP or Azure
  • Experience with storing, shipping, and retrieving large volumes of data efficiently using technologies such as ClickHouse
  • Experience with technologies such as Kubernetes, Helm, ArgoCD, Temporal as well as infrastructure-as-code tools such as Terraform
Job Responsibility
Job Responsibility
  • Take an active part in determining the roadmap for the Product Metrics team
  • Work closely within the team to deliver new features, iterate and improve them
  • Design, build, operate, and maintain business-critical petabyte-scale systems
  • Be responsible for the performance, reliability, availability and cost-efficiency of the Product Metrics systems
  • Mentor and support other team members, participate in design discussions and collaborate with the team
  • Be a part of on-call rotation and take ownership of the services you're running
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right
New

Senior Software Engineer, Cloud Development

The AI Platform team is responsible for building the foundational infrastructure...
Location
Location
Canada
Salary
Salary:
Not provided
mozilla.org Logo
Mozilla
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree with 4–6 years of relevant industry experience, or Master's degree with significant hands-on experience building and operating production systems, or work experience equivalent
  • Strong, modern Python skills, with experience writing clean, maintainable code and working with a fast toolchain (dependency management, linting, formatting, type checks, pre-commit), building both libraries and CLIs that output structured data
  • Advance experience with database deployment and management, bonus points for familiarity with Postgres
  • Proven experience deploying and operating workloads in cloud environments, including production-grade infrastructure on GCP and GKE (artifact registries, managed caches, networking and internal load balancing, VPC, DNS, and separation of nonprod and prod)
  • Hands-on experience with Kubernetes and Helm, writing charts that deploy across environments with per-environment configuration and progressive feature rollout
  • Experience with Terraform for provisioning infrastructure across environments, including schema validation and PR-level plan review
  • Experience designing and running scalable APIs that hold up under load, including health and readiness checks, auth, and clean startup and shutdown
  • Experience with Grafana or similar tools for metrics, dashboards, and reading application and infrastructure health together during rollouts
  • Strong problem-solving skills and the ability to debug performance and reliability issues in distributed systems
  • Clear and effective communication skills, with experience collaborating across engineering, product, and infrastructure teams
Job Responsibility
Job Responsibility
  • Design, build, and operate core platform services and APIs used to deploy and serve production workloads at scale
  • Own service reliability end-to-end, driving improvements in availability, scalability, performance, and operational excellence
  • Lead efforts to optimize backend services for throughput, latency, and cost efficiency across distributed infrastructure
  • Design and manage Kubernetes-based workloads, including GitOps deployment pipelines, environment configuration, and resource utilization optimization
  • Own and improve critical parts of the service lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation
  • Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience of backend services and pipelines
  • Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities that enable new product features
  • Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers through code reviews and knowledge sharing
  • Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews
What we offer
What we offer
  • Generous performance-based bonus plans to all eligible employees
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting
  • Quarterly all-company wellness days
  • Country specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Fulltime
Read More
Arrow Right
New

Cloud Engineer

Location
Location
United States , Lehi
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related discipline, or equivalent practical experience
  • 10+ years of hands-on cloud engineering experience, including significant depth in AWS environments
  • Required AWS certification
  • advanced AWS certifications are strongly preferred
  • Strong experience with infrastructure as code and automation tools such as Terraform, CloudFormation, Ansible, or AWS CDK
  • Proficiency with AWS services including EC2, VPC, Auto Scaling, Route 53, and DynamoDB
  • Experience supporting CI/CD and DevOps practices with tools such as GitHub Actions, Azure DevOps, JIRA, or similar platforms
  • Solid scripting skills in languages such as Python or PowerShell for automation and operational tasks
  • Ability to work independently, deliver against defined requirements, and collaborate effectively in technical discussions with senior engineers and architects
Job Responsibility
Job Responsibility
  • Manage and enhance AWS core infrastructure components, including VPCs, Route 53, CDNs, load balancers, and EC2 services, to support secure and highly available operations
  • Build and refine multi-zone disaster recovery and resiliency solutions, and validate recovery readiness through recurring testing and review
  • Respond to production issues with cross-functional teams by diagnosing failures, identifying root causes, and implementing long-term corrective measures
  • Develop and maintain observability practices using tools such as CloudWatch and Datadog, including logging, metrics, dashboards, and alerting
  • Evaluate cloud environments for performance, scalability, and cost improvements, and recommend changes that strengthen operational efficiency
  • Create and support infrastructure automation using Terraform, CloudFormation, or AWS CDK to standardize provisioning and configuration management
  • Improve deployment workflows by contributing to CI/CD pipelines and automation processes using tools such as GitHub Actions, Harness, or similar platforms
  • Maintain clear technical documentation for infrastructure designs, configurations, procedures, and support practices
  • Partner with developers, system administrators, and security stakeholders to align cloud architecture with business and technical requirements
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right

Azure Cloud Engineer

Location
Location
India , Miracle Heights
Salary
Salary:
Not provided
miraclesoft.com Logo
Miracle Software Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Azure
  • Rubrik
  • Cloud Monitoring
  • ITIL
  • Networking
Job Responsibility
Job Responsibility
  • Maintain Microsoft Azure cloud infrastructure and services to ensure operational stability and system availability
  • Manage and monitor Rubrik backup environments, ensuring successful backup execution, recovery readiness, and compliance with operational standards
  • Perform routine cloud support activities, infrastructure maintenance, and system health checks across production environments
  • Monitor alerts, incidents, dashboards, and infrastructure performance metrics to proactively identify and address issues before service disruption occurs
  • Troubleshoot and resolve Azure infrastructure, networking, storage, and compute-related issues within defined support timelines
  • Participate in North America after-hours operational support, escalation calls, and critical incident response activities
  • Coordinate with internal infrastructure, application, and operations teams to support issue resolution and service restoration efforts
  • Support documentation, escalation records, procedural guidelines, and environment-related knowledge artifacts
  • Follow ITIL processes, governance standards, and cloud operations best practices to support reliable and secure infrastructure management
Read More
Arrow Right

Senior Software Engineer (Cloud ETL & Data)

We are seeking a data-centric Senior Software Engineer to design, build, and evo...
Location
Location
Lithuania , Vilnius; Kaunas
Salary
Salary:
6500.00 EUR / Month
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Graduate or postgraduate degree in Computer Science, Software Engineering, or equivalent experience
  • 7+ years of professional experience in software engineering with exposure to distributed or cloud based systems
  • Strong experience with Azure, microservices, containers, and Kubernetes
  • Hands on experience building ETL pipelines, workflow based systems, or event driven architectures
  • Solid proficiency in an object-oriented language, with a preference for C# .NET
  • Solid understanding of observability, CI/CD, reliability, and cloud operations
  • Strong problem solving skills and the ability to deliver production quality software.
Job Responsibility
Job Responsibility
  • Design and build robust, scalable ETL pipelines for parsing, validating, and transforming diverse engineering data formats
  • Develop and implement strategies for schema management and versioning within data synchronization workflows
  • Architect solutions that guarantee deterministic execution, fault tolerance, and transactional consistency for all data operations
  • Build distributed, event-driven, and task-oriented systems using microservices, messaging, and containerized workloads on Microsoft Azure
  • Implement resilience patterns such as retries, circuit breakers, and rate limiting to ensure high availability
  • Design and implement concurrency control, idempotency, and conflict-resolution patterns in distributed data workflows
  • Build and maintain comprehensive observability, including structured logging, metrics, and distributed tracing
  • Collaborate with architects on high-level design and implementation decisions
  • Mentor junior engineers through code reviews and technical guidance
  • Contribute to shared engineering standards and documentation.
What we offer
What we offer
  • A great Team and culture
  • An exciting career as an integral part of a world-leading software company providing solutions for architecture, engineering, and construction
  • An attractive salary and benefits package
  • A commitment to inclusion, belonging and colleague wellbeing through global initiatives and resource groups
  • A company committed to making a real difference by advancing the world’s infrastructure for better quality of life
  • Training and professional development opportunities (certifications programs, conferences etc.)
  • Additional annual leave days and extra paid days for different occasions
  • Health insurance package and accidents insurance 24/7
  • Referral program with bonuses
  • Extra paid day for volunteering in the organization of your choice
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (Cloud ETL & Data)

We are seeking a data-centric Senior Software Engineer to design, build, and evo...
Location
Location
Lithuania , Vilnius; Kaunas
Salary
Salary:
Not provided
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Graduate or postgraduate degree in Computer Science, Software Engineering, or equivalent experience
  • 7+ years of professional experience in software engineering with exposure to distributed or cloud based systems
  • Strong experience with Azure, microservices, containers, and Kubernetes
  • Hands on experience building ETL pipelines, workflow based systems, or event driven architectures
  • Solid proficiency in an object-oriented language, with a preference for C# .NET
  • Solid understanding of observability, CI/CD, reliability, and cloud operations
  • Strong problem solving skills and the ability to deliver production quality software
Job Responsibility
Job Responsibility
  • Data & ETL Leadership:Design and build robust, scalable ETL pipelines for parsing, validating, and transforming diverse engineering data formats
  • Develop and implement strategies for schema management and versioning within data synchronization workflows
  • Architect solutions that guarantee deterministic execution, fault tolerance, and transactional consistency for all data operations
  • Software & Systems Design:Build distributed, event-driven, and task-oriented systems using microservices, messaging, and containerized workloads on Microsoft Azure
  • Implement resilience patterns such as retries, circuit breakers, and rate limiting to ensure high availability
  • Design and implement concurrency control, idempotency, and conflict-resolution patterns in distributed data workflows
  • Build and maintain comprehensive observability, including structured logging, metrics, and distributed tracing
  • Collaboration & Mentorship:Collaborate with architects on high-level design and implementation decisions
  • Mentor junior engineers through code reviews and technical guidance
  • Contribute to shared engineering standards and documentation
What we offer
What we offer
  • A great Team and culture
  • An exciting career as an integral part of a world-leading software company
  • An attractive salary and benefits package
  • A commitment to inclusion, belonging and colleague wellbeing
  • A company committed to making a real difference by advancing the world’s infrastructure
  • Training and professional development opportunities (certifications programs, conferences etc.)
  • Additional annual leave days and extra paid days for different occasions (marriage, moving day, bereavement leave etc.)
  • Health insurance package and accidents insurance 24/7
  • Referral program with bonuses
  • Extra paid day for volunteering in the organization of your choice
  • Fulltime
Read More
Arrow Right

Cloud Engineer

We are hiring a hands-on AWS Cloud Engineer with strong Linux and networking fun...
Location
Location
United States
Salary
Salary:
Not provided
ewaycorp.com Logo
eWay Corp
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on experience supporting production workloads on AWS
  • Strong knowledge of AWS core services including VPC, EC2, RDS, S3, CloudWatch, CloudFormation, ELB/ALB, Auto Scaling, EBS, EFS, and WAF
  • Solid understanding of AWS networking concepts including subnets, route tables, NAT, DNS, security groups, and NACLs
  • Strong Linux systems administration skills including performance tuning and troubleshooting at OS and network layers
  • Experience with web servers such as Apache and Nginx
  • Working knowledge of MySQL, MariaDB, or similar relational databases
  • Strong troubleshooting capability across infrastructure, application connectivity, and performance issues
  • Familiarity with firewall concepts and traffic filtering
  • Ability to prioritize and manage multiple incidents in parallel
Job Responsibility
Job Responsibility
  • Own L2 support for production AWS environments operating under 24x7x365 SLA commitments
  • Respond to incidents, perform deep troubleshooting, and drive root cause analysis with a focus on permanent resolution
  • Monitor and maintain cloud infrastructure to ensure availability, performance, and cost efficiency
  • Troubleshoot issues across multiple layers including AWS networking, Linux OS, web servers, and database connectivity
  • Support and optimize AWS services including compute, storage, database, and load balancing layers
  • Collaborate with DevOps, application, and security teams to resolve issues and improve system reliability
  • Implement and support backup, disaster recovery, and high availability configurations
  • Analyze logs and metrics using CloudWatch and other tools to proactively identify risks and performance bottlenecks
  • Contribute to infrastructure automation using CloudFormation or similar IaC tools
  • Maintain accurate documentation of environments, incidents, and standard operating procedures
What we offer
What we offer
  • Career Development, Training & certification assistance
  • Medical insurance cover for self, spouse, and children
  • Provident Fund
  • Paid Time off (Maternity, Sick days, Holidays and Earned Leave)
  • Weekends off
  • Flexible work hours and public holidays
  • Loyalty bonus
  • Fulltime
Read More
Arrow Right