This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The role will focus on transforming Production Management using SRE principles, with responsibilities such as enhancing incident management, promoting automation, and collaborating with multiple teams. It involves leveraging AI solutions and managing critical onboarding-related applications.
Job Responsibility:
shifting Production Management towards a proactive, predictive model using Site Reliability Engineering (SRE) principles
leveraging AI to improve issues escalation and diagnosis
implementing incident response, recovery processes, and engineering operations
championing automation initiatives to streamline operational tasks
scripting for automated health checks, alerting, and remediation
collaborating with development, infrastructure, and business teams
participating in post-mortem analysis
supporting Client Onboarding applications and other integrated systems
Requirements:
2-5 years of relevant experience in 'Production management' OR 'Site Reliability Engineering'
relevant experience in Incident Management
relevant experience in Automation using SQL & Python
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.