CrawlJobs Logo

Senior Cloud Data Infrastructure Engineer

United States 133450.00 - 197200.00 USD / Year · Job Posted February 07, 2026
Apply Position
Job Link Share

Job Description

The Cloud AutoScaling team is dedicated to implementing robust vertical and horizontal auto-scaling capabilities within the ClickHouse cloud environment. We seek exceptional software engineers to develop and maintain the auto-scaling infrastructure to transform ClickHouse into a fully functional serverless database solution. Collaborating closely with the core database team, we are actively working on evolving ClickHouse into a cloud-native database system. Additionally, we engage with other cloud teams to drive continuous improvements in cloud infrastructure for enhanced performance and scalability.

Job Responsibility

  • Build a cutting-edge cloud-native database platform on top of the public cloud
  • Work on the autoscaling and our in-house Kubernetes operator to support seamless Vertical and Horizontal Auto-scaling
  • Improve the metrics pipeline and build algorithms to generate better autoscaling statistics and recommendations
  • Work closely with our ClickHouse core development team and other data plane teams, partnering with them to support auto-scaling use cases as well as other internal infrastructure improvements
  • Architecting and building a robust, scalable, and highly available distributed infrastructure

Requirements

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Experience building operators with Kubernetes, controller runtime
  • Production experience with programming languages like Go, C++ or Java
  • You are not a stranger to PagerDuty On-call, debugging things in production and are a strong problem-solver
  • Expertise with a public cloud provider (AWS, GCP, Azure) and their infrastructure as a service offering (e.g., EC2)
  • Experience with Data Storage, Ingestion, and Transformation (Spark, Kafka or similar tools)
  • You are passionate about solving data problems at Scale
  • You have excellent communication skills and the ability to work well within and across engineering teams

Nice to have

Experience with Python (uv, fastAPI) Data Science (Pandas, NumPy etc) is good to have

What we offer

  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Cloud Data Infrastructure Engineer

8 matching positions

Senior Cloud Infrastructure Engineer

The purpose of this role is to deliver the Cloud infrastructure in accordance wi...
Location
Location
Portugal , Lisboa
Salary
Salary:
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree qualified
  • At least 5 years professional experience in cloud and virtualization telco workload
  • Expert in data centre infrastructure, cloud computing and virtualization architectures (Telco)
  • Solid understanding of virtualization (RedHat, vmWare) and containerization (RedHat/ Kubernetes)
  • Solid understanding of routing and switching protocols (ie. BGP / ISIS / OSPF)
  • Experience with kubernetes networking (CNIs, Service Mesh, Network Policies, Ingress/Egress)
  • Experience with Network Load Balancers and Firewalls
  • Experience designing/implementing IPv6 solutions
  • Scripting expertise is preferred (Power CLI, Ansible, Powershell, Python, …)
  • Process creation and execution
Job Responsibility
Job Responsibility
  • Responsible for the delivery lifecycle management of assigned solutions within Europe
  • Coordinating suppliers, engage with peers and other stakeholders to ensure the timely delivery of functionality and quality in projects
  • Develop the on boarding process and automation of VNF & CNF delivery
  • Work within the allocated budget and timelines of delivery
  • Define, address, track and report on standard and guideline adherence
  • Translate technical requirements into business language
What we offer
What we offer
  • Hybrid Work Model - Flexible hybrid work model with 8-10 in-office days per month, managed by team leaders
  • Vodafone Products and Services - Employees get a mobile phone, free communication plan, data card, and various discounts on services and products
  • Recognition - Recognition programs for innovative, creative, high-potential employees and exemplary behaviors
  • Health and Well-being - Well-being Program offers nutrition and psychological consultations, webinars, workshops, and discounts on various services and products
  • Learning - Access to Communities of Practice and a customizable digital training platform with high-quality content (namely Harvard Business Publishing, Skillsoft and Speexx)
  • Local and International Mobility - Internal recruitment with local and international rotation opportunities across departments and roles
Read More
Arrow Right

Senior Cloud Infrastructure Engineer

Virta Health is on a mission to reverse metabolic disease in one billion people....
Location
Location
United States
Salary
Salary:
167249.00 - 216090.00 USD / Year
virtahealth.com Logo
Virta Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in cloud infrastructure using Kubernetes and Postgresql
  • Development experience with Go and Python
  • Hands-on experience with query & scripting languages (e.g., Python, Bash, SQL) for automation
  • Proficiency in Infrastructure as Code (IaC) tools, specifically Terraform
  • Strong knowledge of cloud security and compliance practices
  • Familiarity with CI/CD pipelines and containerization technologies
  • Excellent problem-solving and communication skills
Job Responsibility
Job Responsibility
  • Contribute to the continued design implication of a scalable, resilient, and highly-available cloud infrastructure platform on Google Cloud
  • Develop and maintain infrastructure automation using Terraform and Kubernetes controllers
  • Support data access patterns across services (read replicas, connection pooling, schema evolution, and safe rollout strategies)
  • Ensure that cloud infrastructure adheres to best practices for security and compliance, including monitoring for security vulnerabilities and implementing necessary measures
  • Own infrastructure patterns for operational data platforms, including backups, replication, migrations, performance tuning, and incident response
  • Maintain lightweight, auto-generated documentation of infrastructure configurations and automation procedures
  • Provide technical support and troubleshoot infrastructure-related issues, both during development and in production environments
What we offer
What we offer
  • Offers Equity
  • Fulltime
Read More
Arrow Right

Senior Cloud Engineer, Data Processing

As a Senior Cloud Engineer, Data Processing on the Sustaining Engineering L3 tea...
Location
Location
India , Chennai
Salary
Salary:
Not provided
aptiv.com Logo
Aptiv plc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree – Computer Science, Computer Engineering, or similar
  • 8+ years Python and/or Golang software development experience
  • Proven ability to analyze and navigate legacy codebases, including independently investigating, debugging, and resolving issues without detailed documentation
  • Demonstrated skill at navigating ambiguity and resolving issues without detailed instructions or oversight
  • Experience interfacing applications with relational and non-relational databases (e.g., MySQL, MongoDB), including CRUD operations and schema design
  • Proficient in Linux environments and shell scripting
  • Deep experience with Kubernetes, AWS, RabbitMQ, MongoDB, and MySQL in production settings
  • Familiarity with debugging tools, performance profiling, and system optimization techniques
  • Strong written and oral communication skills, with the ability to clearly document and explain technical concepts
Job Responsibility
Job Responsibility
  • Support and enhance Aptiv’s cloud-based data processing systems for automotive datalogger data, including ingest pipelines, ETL workflows, and analytics infrastructure
  • Investigate, root-cause, and resolve production issues across distributed systems, often involving complex legacy code and minimal documentation
  • Collaborate with systems analysts, engineers, and developers to troubleshoot issues, implement improvements, and ensure system reliability and performance
  • Partner with internal stakeholders to develop and execute validation plans that confirm fixes and enhancements meet operational and customer expectations
  • Stay current with evolving cloud technologies, data processing frameworks, and tooling to continuously improve support capabilities and system efficiency
What we offer
What we offer
  • Higher Education Opportunities (UDACITY, UDEMY, COURSERA are available for your continuous growth and development)
  • Life and accident insurance
  • Sodexo cards for food and beverages
  • Well Being Program that includes regular workshops and networking events
  • EAP Employee Assistance
  • Access to fitness clubs (T&C apply)
  • Creche facility for working parents
  • Fulltime
Read More
Arrow Right

Senior Cloud & Infrastructure Engineer

Are you an experienced cloud and infrastructure specialist looking for your next...
Location
Location
United Kingdom , Worcestershire
Salary
Salary:
65000.00 GBP / Year
dynamicsearch.co.uk Logo
Dynamic Search Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong technical background in Microsoft infrastructure (Windows Server, AD, DNS/DHCP, Exchange, Microsoft 365, Azure)
  • Hands-on experience with virtualisation (VMware or Hyper-V)
  • Good knowledge of storage, networking, and backup technologies
  • Proven success in delivering data centre migrations and transformation projects
  • Awareness of security best practices in Microsoft and hybrid environments
  • Excellent client-facing and communication skills, with the ability to present solutions clearly
  • Previous experience in an MSP or consultancy setting is highly desirable
Job Responsibility
Job Responsibility
  • Design and implement Microsoft-based infrastructure solutions, including Windows Server, Active Directory, Microsoft 365, and Azure
  • Deliver consultancy and implementation services for virtualisation (VMware/Hyper-V), storage, backup, and disaster recovery
  • Lead migrations and upgrades from legacy platforms to cloud or hybrid environments
  • Assess client infrastructure, highlight risks, and recommend improvements
  • Provide technical input into solution design and pre-sales proposals
  • Collaborate with internal teams to ensure seamless delivery of projects and managed services
  • Mentor junior engineers and act as an escalation point for complex issues
What we offer
What we offer
  • 25 days holiday plus bank holidays and your birthday off
  • Pension and life insurance
  • Ongoing training, certifications, and career development
  • Exposure to a wide range of clients and industries
  • Hybrid working with opportunities to contribute to both national and international projects
  • A collaborative culture that values work-life balance and professional growth
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure & Observability

Location
Location
India , Bengaluru
Salary
Salary:
Not provided
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years in software engineering with a track record of architecting distributed systems or platforms at scale
  • Strong hands‑on experience in Golang and one scripting language (e.g., Python or Shell)
  • Experience operating observability at pb-scale ingestion and hundreds of millions of series
  • Expertise in observability platforms and tooling (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, ClickHouse) and standards (OpenTelemetry, OpenMetrics)
  • Deep experience building systems of scale and operating cloud infrastructure with Kubernetes
  • strong proficiency with service mesh technologies (Istio/Envoy), infrastructure‑as‑code (Terraform) and experience in multi‑cloud (AWS, GCP)
  • Demonstrated ability to evolve storage and query architectures for cost, scale, and latency (e.g., TSDB, Parquet, distributed processing)
  • Proven experience integrating security as part of infrastructure and platform development
  • Exceptional cross‑functional communication
  • effective collaboration with both technical and non‑technical stakeholders
Job Responsibility
Job Responsibility
  • Architect and lead Roku’s observability platform across metrics, logs, and traces
  • evolve data pipelines and storage layers optimized for high throughput, performance, and cost at Roku scale (TSDBs, Parquet, distributed processing)
  • Extend and harden open‑source observability systems
  • overhaul core components (e.g., storage layers, query paths) to improve performance, reliability, and usability at scale
  • Implement features such as pre‑aggregation, down-sampling, and sampling to reduce load and accelerate queries across the platform
  • Collaborate across platform, SRE, and product teams to migrate hundreds of workloads to our common platform
  • augment and automate CI/CD flows and onboarding
  • Integrate security into infrastructure and platform services
  • ensure robust multi‑tenant, multi‑cluster, and multi‑cloud designs
  • Contribute improvements back to open source and CNCF‑aligned projects
What we offer
What we offer
  • Global access to mental health and financial wellness support and resources
  • healthcare (medical, dental, and vision)
  • life, accident, disability, commuter, and retirement options (401(k)/pension)
  • time off in accordance with local leave policies
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Data Infrastructure & AI

Fullstory Anywhere is one of Fullstory's three primary product verticals, and it...
Location
Location
United States , Atlanta
Salary
Salary:
160000.00 - 170000.00 USD / Year
fullstory.com Logo
Fullstory
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant experience building and operating high-throughput data pipelines (batch and/or streaming) in a major cloud platform, including work with cloud data warehouses like BigQuery, Snowflake, or Databricks.
  • Proficiency in Go, Python, Java or a similar language.
  • Hands-on experience with data transformation tooling such as dbt, with a strong understanding of data modeling and pipeline observability.
  • Familiarity with LLM integration patterns and evaluation approaches (e.g., LangSmith, Vertex AI, or comparable frameworks), or demonstrated ability to ramp quickly in applied AI.
  • A track record of owning major system areas end-to-end: driving architectural decisions, maintaining production health, and improving reliability over time.
Job Responsibility
Job Responsibility
  • Maintain, extend, and scale Go microservices that transform and deliver Fullstory session data into customer warehouses and power the team's MCP server that enables AI agent integrations.
  • Develop and maintain dbt models and pipeline orchestration to ensure timely, fault-tolerant data migrations across hundreds of customer destinations.
  • Define evaluation frameworks for LLM outputs using tools like Langsmith and Vertex AI, ensuring AI-powered customer agents produce accurate, useful results.
  • Investigate and resolve production incidents across the data pipeline, implementing systemic fixes that prevent entire classes of failure from recurring.
  • Write technical design documents that drive consensus on architectural changes, proactively surfacing scaling bottlenecks, edge cases, and cross-team dependencies.
  • Demonstrate sound technical judgment by de-risking work through spikes, taking on tech debt deliberately, and knowing when to escalate versus dig in.
What we offer
What we offer
  • Flexibility and Connection
  • flexible PTO policy
  • annual company-wide closure
  • Benefits
  • paid parental leave
  • Bereavement leave, including miscarriage/pregnancy loss
  • Learning opportunities
  • annual learning subsidy
  • Productivity support
  • monthly productivity stipend
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure & Observability

We are building a next-generation observability and cloud platform that is high-...
Location
Location
United Kingdom , Cambridge
Salary
Salary:
Not provided
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience with software engineering with a track record of architecting distributed systems or platforms at scale
  • Strong hands-on experience in Golang and one scripting language (e.g., Python or Shell)
  • Experience operating observability at pb-scale ingestion and hundreds of millions of series
  • Expertise in observability platforms and tooling (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, ClickHouse) and standards (OpenTelemetry, OpenMetrics)
  • Deep experience building systems of scale and operating cloud infrastructure with Kubernetes
  • strong proficiency with service mesh technologies (Istio/Envoy), infrastructure-as-code (Terraform) and experience in multi-cloud (AWS, GCP)
  • Demonstrated ability to evolve storage and query architectures for cost, scale, and latency (e.g., TSDB, Parquet, distributed processing)
  • Proven experience integrating security as part of infrastructure and platform development
  • Exceptional cross-functional communication
  • effective collaboration with both technical and non-technical stakeholders
Job Responsibility
Job Responsibility
  • Architect and lead Roku’s observability platform across metrics, logs, and traces
  • evolve data pipelines and storage layers optimized for high throughput, performance, and cost at Roku scale (TSDBs, Parquet, distributed processing)
  • Extend and harden open-source observability systems
  • overhaul core components (e.g., storage layers, query paths) to improve performance, reliability, and usability at scale
  • Implement features such as pre-aggregation, down-sampling, and sampling to reduce load and accelerate queries across the platform
  • Collaborate across platform, SRE, and product teams to migrate hundreds of workloads to our common platform
  • augment and automate CI/CD flows and onboarding
  • Integrate security into infrastructure and platform services
  • ensure robust multi-tenant, multi-cluster, and multi-cloud designs
  • Contribute improvements back to open source and CNCF-aligned projects
What we offer
What we offer
  • Global access to mental health and financial wellness support and resources
  • healthcare (medical, dental, and vision)
  • life, accident, disability, commuter, and retirement options (401(k)/pension)
  • time off work for vacation and other personal reasons
  • Fulltime
Read More
Arrow Right

Senior Azure Cloud Infrastructure Engineer

We are looking for an experienced Senior Azure Cloud Infrastructure Engineer to ...
Location
Location
United States , Minneapolis
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience with Microsoft Azure, including compute, networking, security, identity, and monitoring
  • Proficiency in Infrastructure as Code tools such as Terraform, Bicep, or CloudFormation
  • Strong scripting skills in Python, PowerShell, or Bash
  • Solid understanding of cloud networking, security models, and compliance frameworks
  • Familiarity with CI/CD processes and tools
  • Broad knowledge of full-stack engineering, including cloud infrastructure and application design
  • Experience with automated scaling and cloud services such as AWS or Azure
  • Expertise in Ansible for configuration management and automation
Job Responsibility
Job Responsibility
  • Design and implement cloud infrastructure solutions using Microsoft Azure, ensuring scalability and security
  • Develop Infrastructure as Code using tools such as Terraform and Bicep to automate deployments and improve efficiency
  • Collaborate with cross-functional teams to integrate cloud services with networking, security, and application systems
  • Create and maintain scripts in Python, PowerShell, or Bash to streamline operational processes
  • Monitor cloud environments and establish governance frameworks to maintain compliance and performance
  • Configure CI/CD pipelines to support continuous integration and delivery
  • Apply knowledge of full-stack engineering to connect cloud systems with data services and application design
  • Analyze and improve cloud networking and security models to enhance infrastructure reliability
  • Provide technical leadership and mentorship to team members, fostering skill development and collaboration
  • Stay updated on emerging cloud technologies and recommend innovative solutions to meet business needs
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • company 401(k) plan
Read More
Arrow Right