This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Software Site Reliability Engineering team (SRE) is responsible for keeping cloud-based services, streaming frameworks, NoSQL/RDBMS databases and distributed analytical platforms running in multi-cloud environments. We proactively support our Cloud infrastructure to prevent and resolve any application and infrastructure related issues. As an SRE at level 3, you will troubleshoot cloud infrastructure issues, address root cause, build automation using shell scripts and languages like Python/Go, and IaaC tools. A successful candidate has an interest in developing, operating, troubleshooting, and scaling online services and understanding of Linux infrastructures, networking principles, CI/CD tools.
Job Responsibility:
Improve the IAC Code and implement best practices
Develop tools to support the cloud platform
Optimize CI/CD pipelines
Fix vulnerabilities in the container images and standardize the process of image creation
Build AMI’s with CIS and STIG benchmarks
Debug the issue reported with the applications and infrastructure
Review new tools and technologies
Fine tune distributed systems like Apache Kafka, Cassandra, etc
Requirements:
5+ years of hands-on experience as one of the following InfraOps Engineer, DevOps, and/or SRE
Strong programming skills in Python and/or Golang
Hands-on experiencing working with Linux operating systems based on Debian or Ubuntu
Hands-on experience working with cloud services like AWS and GCP
Experience with designing and implementing cloud-based infrastructure using infrastructure as Code tools such as Terraform, Packer and Ansible
Strong experience with containerization technologies (docker, containerd) and orchestration, tools like AWS EKS and GCP GKE
Experience with implementing and maintaining CI/CD
Experience with monitoring tools like Prometheus, Grafana, ELK etc
A general background in Cloud Security
Strong experience with GitOps
Strong problem solving and debugging skills with a high sense of ownership
Excellent team player with very good oral and written communication skills
Nice to have:
Any opensource development experience
Experience with relational (SQL)
Experience with distributed systems like Apache Kafka, Cassandra, Storm, Flink
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.