This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Work closely with developers, QA, and operations teams to foster a DevOps culture focused on security, reliability, and automation
Design, implement, and manage comprehensive monitoring solutions using tools like Prometheus, Grafana, ELK stack, etc.
Develop and maintain alerting systems that proactively provide insights into system health and performance
Define and track SLIs, SLOs, and SLAs for critical services and ensure continuous compliance
Automate infrastructure provisioning and management using tools such as Ansible or Terraform to eliminate manual interventions
Build and maintain CI/CD pipelines (GitLab CI) to streamline deployments and ensure system consistency
Implement automated testing and validation processes for infrastructure and applications
Leverage containerization and orchestration technologies (Docker, Kubernetes) to manage scalable, resilient, and fault-tolerant services
Use Infrastructure as Code (IaC) to automate and standardize environment provisioning and configuration management
Ensure the security and compliance of infrastructure by implementing best practices in network security, including encryption, firewall management, access controls, and intrusion detection
Perform regular security audits and vulnerability assessments to identify and mitigate risks
Monitor network traffic and optimize performance through network tuning and troubleshooting
Develop high-availability and disaster recovery solutions for mission-critical services
Conduct postmortems for major incidents, perform root cause analysis, and implement preventive measures
Collaborate with development teams to optimize applications for performance and security
Continuously improve operational processes by identifying bottlenecks, automating workflows, and enhancing security measures
Requirements:
Proven experience in an SRE, DevOps, or infrastructure engineering role with a focus on monitoring, automation, and orchestration
Deep understanding of cloud platforms such as AWS, Azure, or Google Cloud
Strong knowledge of network design, TCP/IP, DNS, routing, and network security best practices
Expertise in monitoring tools (Prometheus, ELK)
Hands-on experience with automation tools (Terraform, Ansible, Jenkins, CI/CD)
Proficiency with containerization and orchestration (Docker, Kubernetes)
Proficiency in scripting languages (Bash, Python, Go)
Familiarity with microservices architecture and distributed systems
Bachelor in Engineering/MCA/M.sc/ M.S./ MBA in Systems, IT or Insurance or Finance
Excellent verbal & non-verbal communication skills
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.