CrawlJobs Logo

Senior Software Engineer - Cloud Infrastructure

United Kingdom, London · Job Posted December 07, 2025
Apply Position
Job Link Share

Job Description

About the Team: The Cloud Infrastructure Engineering team builds and manages the foundational blocks of ClickHouse Cloud data plane end-to-end. This includes compute, networking, security, and a multi-cloud, multi-region architecture that provides a reliable and scalable managed ClickHouse experience for ClickHouse Cloud customers. Our team is looking for highly skilled and experienced cloud infrastructure software engineers to join our team, who will be responsible for designing, deploying, and maintaining our infrastructure.

Job Responsibility

  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure

Requirements

  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
  • You are passionate about efficiency, availability, scalability and data governance
  • You thrive in a fast-paced environment, and see yourself as a partner with the business with the shared goal of moving the business forward
  • You have a high level of responsibility, ownership, and accountability

What we offer

  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Software Engineer - Cloud Infrastructure

8 matching positions

Senior Software Engineer - Cloud Infrastructure

The Cloud Infrastructure Engineering team builds and manages the foundational bl...
Location
Location
Australia
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure

About ClickHouse: Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is on...
Location
Location
Singapore
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure

The Cloud Infrastructure Engineering team builds and manages the foundational bl...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems
  • Software development experience in Go, C/C++, Java, or another OOP language
  • Experience with cloud technologies such as AWS, Azure, or GCP, including infrastructure-as-code (IaC) tools such as Terraform or CloudFormation
  • Experience developing cloud infrastructure services, preferably with Kubernetes
  • Experience developing cloud native edge or service mesh services, preferably with envoy and Istio
  • Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers
  • Understanding of network topologies, protocols, and security principles, such as VPNs, firewalls, and load balancers
  • Knowledge of cloud security best practices, including encryption, access controls, and compliance standards like SOC2 and GDPR
  • You have excellent communication skills and the ability to work well within a global team
  • You are a strong problem-solver and have solid production debugging skills
Job Responsibility
Job Responsibility
  • Architect and build a robust, scalable, and highly available distributed infrastructure
  • Build a cutting-edge cloud-native platform on top of the public cloud, and automate our cloud resource management
  • Work closely with our ClickHouse core database development team, and security team and partner with them to produce the SAS offering
  • Work on routing and traffic components to improve the reliability and scalability of our cloud service
  • Systematically improve availability by applying industry and distributed systems best practices
  • Design and build security components & tooling: firewall, PKI and certificate infra, zero trust network, etc.
  • Improve performance and cost efficiency of our infrastructure
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure & Observability

Location
Location
India , Bengaluru
Salary
Salary:
Not provided
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years in software engineering with a track record of architecting distributed systems or platforms at scale
  • Strong hands‑on experience in Golang and one scripting language (e.g., Python or Shell)
  • Experience operating observability at pb-scale ingestion and hundreds of millions of series
  • Expertise in observability platforms and tooling (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, ClickHouse) and standards (OpenTelemetry, OpenMetrics)
  • Deep experience building systems of scale and operating cloud infrastructure with Kubernetes
  • strong proficiency with service mesh technologies (Istio/Envoy), infrastructure‑as‑code (Terraform) and experience in multi‑cloud (AWS, GCP)
  • Demonstrated ability to evolve storage and query architectures for cost, scale, and latency (e.g., TSDB, Parquet, distributed processing)
  • Proven experience integrating security as part of infrastructure and platform development
  • Exceptional cross‑functional communication
  • effective collaboration with both technical and non‑technical stakeholders
Job Responsibility
Job Responsibility
  • Architect and lead Roku’s observability platform across metrics, logs, and traces
  • evolve data pipelines and storage layers optimized for high throughput, performance, and cost at Roku scale (TSDBs, Parquet, distributed processing)
  • Extend and harden open‑source observability systems
  • overhaul core components (e.g., storage layers, query paths) to improve performance, reliability, and usability at scale
  • Implement features such as pre‑aggregation, down-sampling, and sampling to reduce load and accelerate queries across the platform
  • Collaborate across platform, SRE, and product teams to migrate hundreds of workloads to our common platform
  • augment and automate CI/CD flows and onboarding
  • Integrate security into infrastructure and platform services
  • ensure robust multi‑tenant, multi‑cluster, and multi‑cloud designs
  • Contribute improvements back to open source and CNCF‑aligned projects
What we offer
What we offer
  • Global access to mental health and financial wellness support and resources
  • healthcare (medical, dental, and vision)
  • life, accident, disability, commuter, and retirement options (401(k)/pension)
  • time off in accordance with local leave policies
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Cloud Infrastructure & Observability

We are building a next-generation observability and cloud platform that is high-...
Location
Location
United Kingdom , Cambridge
Salary
Salary:
Not provided
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience with software engineering with a track record of architecting distributed systems or platforms at scale
  • Strong hands-on experience in Golang and one scripting language (e.g., Python or Shell)
  • Experience operating observability at pb-scale ingestion and hundreds of millions of series
  • Expertise in observability platforms and tooling (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, ClickHouse) and standards (OpenTelemetry, OpenMetrics)
  • Deep experience building systems of scale and operating cloud infrastructure with Kubernetes
  • strong proficiency with service mesh technologies (Istio/Envoy), infrastructure-as-code (Terraform) and experience in multi-cloud (AWS, GCP)
  • Demonstrated ability to evolve storage and query architectures for cost, scale, and latency (e.g., TSDB, Parquet, distributed processing)
  • Proven experience integrating security as part of infrastructure and platform development
  • Exceptional cross-functional communication
  • effective collaboration with both technical and non-technical stakeholders
Job Responsibility
Job Responsibility
  • Architect and lead Roku’s observability platform across metrics, logs, and traces
  • evolve data pipelines and storage layers optimized for high throughput, performance, and cost at Roku scale (TSDBs, Parquet, distributed processing)
  • Extend and harden open-source observability systems
  • overhaul core components (e.g., storage layers, query paths) to improve performance, reliability, and usability at scale
  • Implement features such as pre-aggregation, down-sampling, and sampling to reduce load and accelerate queries across the platform
  • Collaborate across platform, SRE, and product teams to migrate hundreds of workloads to our common platform
  • augment and automate CI/CD flows and onboarding
  • Integrate security into infrastructure and platform services
  • ensure robust multi-tenant, multi-cluster, and multi-cloud designs
  • Contribute improvements back to open source and CNCF-aligned projects
What we offer
What we offer
  • Global access to mental health and financial wellness support and resources
  • healthcare (medical, dental, and vision)
  • life, accident, disability, commuter, and retirement options (401(k)/pension)
  • time off work for vacation and other personal reasons
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Cloud Infrastructure

The Airwallex Cloud Infrastructure team is a group of highly skilled, innovative...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
airwallex.com Logo
Airwallex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years working with large, distributed systems
  • Solid understanding of RESTful API design principles and patterns
  • Deep experience with public cloud (GCP, AWS, or Azure), Infrastructure as Code, and Kubernetes
  • Experience with Istio service mesh
  • Ability to write services in Golang(preferred) or Python
  • Bachelor’s degree (or higher) in Computer Science or similar field
  • Strong security skills and hands-on experience with security best practices
  • Strong communication and collaboration skills
  • For senior software engineers, we expect you to have around 6 years or at least 5 years engineering experience
  • You should be able to demonstrate more in-depth understanding in the complexity of projects and the design of code systems
Job Responsibility
Job Responsibility
  • Work on cloud platforms (like AWS, GCP, Azure)
  • Make our systems secure, reliable, and easy for our engineers to use
  • Help us scale—globally!
  • Automate everything
  • Work with awesome teammates in engineering, product, and security
  • Love solving problems and making things run smoothly
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer ( Cloud Infrastructure and Platform Engineering )

Your Career At Palo Alto Networks, Secure Cloud and AI infrastructure is the fou...
Location
Location
United States , Santa Clara
Salary
Salary:
Not provided
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS, MS, or PhD in Computer Science or a related technical field, or equivalent experience
  • 9+ years of relevant software engineering experience, with a proven track record of technical leadership and innovation
  • Demonstrated experience defining and leading large-scale, cross-organizational technical initiatives from concept to completion
  • Experience building and scaling platforms that serve thousands of engineers in complex environments
  • Strong foundation in application and infrastructure security, including secrets management, supply chain security, and secure-by-default platform design
  • Recognized expertise in developer platforms, cloud-native infrastructure, container orchestration technologies (e.g Kubernetes) and CI/CD
  • Deep proficiency with a major cloud platform (GCP preferred), including IAM, managed databases, networking, and Workload Identity
  • Experience designing and maintaining Infrastructure as Code (e.g. Terraform) at scale, including module architecture and state management
  • Expertise in authentication/authorization systems: OAuth 2.0, OIDC, token lifecycle management, and zero-trust patterns
  • Hands-on experience applying AI/ML/GenAI to solve complex software engineering problems
Job Responsibility
Job Responsibility
  • Define the Vision: Architect and own the technical roadmap for AI-enhanced developer tools and infrastructure in CIPE at Palo Alto Networks
  • Evaluate and Execute Solutions: Lead the design and implementation of novel systems that leverage Large Language Models (LLMs), static/dynamic analysis, and machine learning to create a world-class, intelligent developer experience
  • Drive Organization-Wide Impact: You are a builder, so you won't just stop at ideation. Beyond concepts, ensure your builds show step-change improvements in key engineering metrics like including code velocity, review cycle time, test effectiveness, incident reduction, and overall feature launches
  • Lead Cross-Functional Initiatives: Spearhead complex, cross-functional projects that require influencing and aligning multiple engineering organizations and their leadership
  • Enable Secure Innovation: Develop foundational AI platforms that empower teams to prototype, deploy, and scale threat-intelligent cloud features, embedding Palo Alto Networks' security natively
  • Serve as Technical Authority: Act as the go-to expert on AI-augmented cloud platforms, mentoring senior engineers and infusing industry-leading practices into our high-stakes ecosystem
  • Innovate at Enterprise Scale: Address intricate challenges in multi-cloud environments (AWS, Azure, GCP, and OCI) supporting thousands of microservices, secure workloads, and global threat detection pipelines
What we offer
What we offer
  • restricted stock units
  • bonus
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Together Cloud Infrastructure

Together AI is building the AI Acceleration Cloud, an end-to-end platform for th...
Location
Location
United States , San Francisco
Salary
Salary:
160000.00 - 230000.00 USD / Year
together.ai Logo
Together AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of professional software development experience and proficiency in at least one backend programming language (Golang desired)
  • 5+ years experience writing high-performance, well-tested, production quality code
  • Demonstrated experience with building and operating high-performance and/or globally distributed micro-service architectures across one or more cloud providers (AWS, Azure, GCP)
  • Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members
  • Deep experience with Kubernetes internals a big plus, such as implementing non-trivial Kubernetes operators, device/storage/network plugins, custom schedulers, or patches thereon or Kubernetes itself
  • Deep experience with VMs/hypervisors a big plus, such as QEMU/KVM, cloud-hypervisor, VFIO, virtio, PCIE passthrough, Kubevirt, SR-IOV
  • Deep experience with DC networking tech + solutions a big plus, such as VLAN, VXLAN, VPN, VPC, OVS/OVN
  • Experience with Cluster API or similar a big plus
  • Experience working on high-performance compute, networking, and/or storage a big plus
  • Experience virtualizing GPUs and/or Infiniband a big plus
Job Responsibility
Job Responsibility
  • Design, build, and maintain performant, secure, and highly-available backend services/operators that run in our data centers and automate hardware management, such as Infiniband partitioning, in-DC parallel storage provisioning, and VM provisioning
  • Design and build out the IaaS software layer for a new GB200 data center with thousands of GPUs
  • Work on a global multi-exabyte high-performance object store, serving massive datasets for pretraining
  • Build advanced observability stacks for our customers with automated node lifecycle management for fault-tolerant distributed pretraining
  • Perform architecture and research work for decentralized AI workloads
  • Work on the core, open-source Together AI platform
  • Create services, tools, and developer documentation
  • Create testing frameworks for robustness and fault-tolerance
What we offer
What we offer
  • competitive compensation
  • startup equity
  • health insurance
  • other benefits
  • flexibility in terms of remote work
  • Fulltime
Read More
Arrow Right