This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re seeking a seasoned Cloud Infrastructure Engineer with deep expertise in automation, infrastructure-as-code (IaC), and cloud platform management. You’ll design, deploy, and maintain robust cloud environments while collaborating with cross-functional teams to streamline CI/CD pipelines, enhance system reliability, and drive operational excellence.
Job Responsibility:
Design & Build Cloud Infrastructure: Architect and manage secure, scalable cloud environments (AWS, Azure, GCP) using IaC tools like Terraform and CloudFormation
Automate Everything: Develop and maintain automation scripts to streamline deployments, monitoring, and system operations
Systems Reliability: Implement monitoring/alerting solutions (Prometheus, Grafana, Datadog) to proactively address performance bottlenecks and ensure 99.9% uptime
Security & Compliance: Enforce security policies, manage secrets (Vault, AWS KMS), and ensure compliance with industry standards (GDPR, SOC2)
Troubleshoot & Optimize: Resolve complex infrastructure issues and lead cost-optimization initiatives for cloud resources
Collaborate & Mentor: Partner with software engineering teams to integrate DevOps practices into SDLC and mentor junior engineers on IaC and cloud best practices
Requirements:
10+ years in DevOps, Cloud Infrastructure, or SRE roles, with hands-on experience in public cloud platforms (AWS, Azure, GCP, Heroku)
Strong experience operating and supporting production distributed systems and/or databases-as-a-service in a public cloud service provider, where it was the primary product for the company
Experience designing and managing complex production environments using Kubernetes and Helm
Expertise in IaC tools (Puppet, Terraform, Ansible, CloudFormation) and configuration management
Deep understanding of networking, security, and cloud architecture best practices
Experience with monitoring tools (Prometheus, Grafana) and logging systems (ELK, Splunk)
Strong knowledge of CI/CD tools (GitHub Actions) and containerization (Docker, Kubernetes)
You like working with a small, high-caliber team with a lot of autonomy and drive, and you can iterate fast
Nice to have:
You’ve made substantial contributions to open-source projects (e.g., Puppet modules, Terraform providers)
You design and automate single-command deployments for complex, globally distributed systems to ensure consistency, reliability, and scalability across multi-cloud or hybrid environments
You fearlessly challenge the status quo and dismiss mediocre engineering as unacceptable
You have worked on distributed large-scale systems, with a good understanding of how to using tracing tools to identify bottlenecks
Experience building large-scale semantic search and/or caching systems is especially relevant
What we offer:
Medical, dental, vision, and life insurance
401(k) retirement plan
Flexible Spending Accounts (FSA) and Health Savings Accounts (HSA)