This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We're looking for a seasoned Sr DevOps Engineer to help drive the reliability, scalability, and efficiency of our infrastructure and deployment processes. In this role, you'll tackle complex technical challenges, designing and implementing solutions that improve our system architecture and developer workflows. You'll play a pivotal role in guiding projects from design through production support, and you'll mentor junior engineers in DevOps best practices. Reporting to DevOps leadership, you'll also participate in our 24×7 support rotation to ensure our mission-critical services maintain maximum uptime.
Job Responsibility:
Oversee the reliability, performance, and security of critical production services from design to deployment, ensuring they meet our uptime and performance targets
Collaborate with development, QA, and product teams to build and maintain resilient infrastructure and efficient deployment pipelines
Automate infrastructure provisioning and software deployments using Infrastructure as Code and CI/CD tools, reducing manual work and errors
Participate in and improve our 24×7 on-call process, swiftly troubleshooting incidents and performing root cause analysis to prevent recurrence
Document and standardize processes and configurations, sharing knowledge to uplift the entire engineering team’s capabilities
Requirements:
5-7 years of experience in DevOps, SRE, or Software Engineering roles, with increasing responsibility in system design and operations
Extensive experience with containerization (Docker) and orchestration (Kubernetes) in production environments, including managing and scaling clusters
Proficiency in Infrastructure as Code (Terraform, CloudFormation, etc.) and configuration management tools (Ansible, Puppet) to automate infrastructure provisioning
Strong coding and scripting skills in languages like Python, Go, or Ruby, with the ability to build automation tools for system management
Deep knowledge of cloud platforms (AWS and/or GCP) and their services, with experience designing and operating cloud-based infrastructure at scale
Solid understanding of networking and security fundamentals in cloud and on-prem environments
Experience setting up and tuning monitoring/alerting systems (Prometheus, Grafana, etc.), and a thorough understanding of SRE best practices (SLIs, SLOs, incident management)
Strong problem-solving and communication skills, with a track record of working effectively in collaborative team environments
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.