This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking an experienced Site Reliability Engineer to support Vodafone’s strategic growth in Internet of Things (IoT) platforms. This role focuses on ensuring the stability, resilience, and performance of critical IoT systems by embedding reliability and operational excellence throughout the product lifecycle. You will work closely with development, engineering, and architecture teams in a DevOps and agile environment, contributing to system design, operational readiness, and continuous improvement while supporting high-availability production platforms.
Job Responsibility:
Drive reliability, availability, and performance across IoT platforms through proactive monitoring, automation, and operational improvements
Design, deploy, review, and troubleshoot technical integrations with multiple platforms, services, and connected devices
Implement and enhance CI/CD practices to enable high levels of operational automation and zero-touch operations
Partner with development teams to improve services through rigorous testing, release management, and operational readiness
Act as a technical subject matter expert, supporting and coaching team members to build capability across relevant technologies
Lead and support incident and problem management activities, ensuring timely resolution, root cause analysis, and preventive actions in line with agreed SLAs
Contribute to system design reviews, including HLDs and LLDs, translating architectural decisions into operational requirements
Balance feature delivery speed with platform reliability through clearly defined service level objectives
Design, implement, and continuously enhance monitoring, alerting, and observability solutions to maintain a holistic view of system health
Manage production environments through proactive capacity planning, performance optimisation, and release deployments
Collaborate with internal and external stakeholders, including vendors, to ensure effective communication and service continuity
Promote consistent documentation and knowledge sharing across the team
Requirements:
Experienced in Site Reliability Engineering, DevOps, or production support roles within complex, enterprise-scale environments
Skilled in Unix/Linux administration with strong shell scripting experience
Experienced with CI/CD tools such as Git, Jenkins, Nexus, SonarQube, and configuration or automation tools
Proficient in infrastructure as code using tools such as Terraform or CloudFormation
Comfortable working with public cloud platforms such as AWS or Azure
Able to develop using one or more high-level programming languages, including Python, Java, or JavaScript
Experienced in containerisation and orchestration technologies, including Docker and Kubernetes
Familiar with monitoring and observability tools such as Prometheus, Grafana, CloudWatch, or Centreon
Knowledgeable in microservices architecture, APIs, and web services (REST, SOAP, JSON, XML)
Experienced with relational and NoSQL data stores such as PostgreSQL, MariaDB, Redis, MongoDB, or similar technologies
Informed about mobile network architectures and protocols, with exposure to IoT platforms (ThingWorx experience is an advantage)
Comfortable working within ITIL-based operational frameworks
Fluent in English at B2 level or above
Educated to BSc or MSc level in Computer Science, Software Engineering, Telecommunications, or a related discipline
Nice to have:
ThingWorx experience is an advantage
Additional languages are an advantage
What we offer:
The opportunity to work on large-scale, business-critical IoT platforms with global reach
Exposure to modern cloud-native architectures, DevOps practices, and automation at enterprise scale
Collaboration with international teams across Vodafone Group and strategic partners
A role that blends hands-on engineering with system design, reliability strategy, and continuous improvement
A supportive environment that values learning, knowledge sharing, and professional growth