This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Site Reliability Engineering Consultant will be responsible for developing and implementing software solutions in a complex, multi-disciplinary environment. The role requires a comprehensive understanding of software development lifecycle, excellent engineering skills, and the ability to operate in a global environment. The candidate will drive continuous delivery and automation efforts while coaching team members on best practices.
Job Responsibility:
Demonstrate an in-depth understanding of Software Development Lifecycle and how it integrates within the overall technology landscape to deliver scalable, reliable and resilient applications
Ability to operate in a global environment with on-/near-/off-shore matrix reporting structures
Operate into a highly regulated environment that requires in-depth understanding of the regulatory requirements and the industry implications for our technologies
Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
Drive Continuous Delivery and Automation efforts across the supported applications by means of Root Cause Analysis reviews, Knowledge management, Performance tuning, and user training
Foster a culture that promotes transparency and innovation for increased team productivity
Coach members of the team and outside the immediate reporting line about the best practices and recognize anti-patterns that are quickly addressed
Implement the Agile Framework through one of its implementations like SCRUM or Kanban and ensure it integrates with overall organization processes
Avidly communicate progress and project status across the organization and ensure that stakeholders are managed appropriately throughout the execution period
Requirements:
Relevant experience in a critical software development role with high business impact
Excellent engineering skills and senior architecture
Excellent working knowledge of key computer science concepts (networking, operating systems, virtualization, containerization, etc.)
Polyglot full-stack developer mentality
Excellent understanding of Software Engineering concepts like Software Development Life Cycle and GitOps
Excellent debugging and analytical skills
Operational experience of deploying and running services at scale on top of Docker/Kubernetes stack and a service mesh, like Istio, is highly desirable
Operational experience with orchestration tools for CI/CD and Infrastructure-as-Code tooling (Terraform, Cloud Formation, etc.) is a highly desirable
Experience of delivering software using Agile delivery methodologies is a must (SCRUM/Kanban)
Operational experience of using middleware technologies (MQ, Apache Kafka, etc.) to run services at scale is desirable
Strong experience with end-to-end observability stacks (Datadog, AppDynamics, Dynatrace, etc.) is desirable
Degree in computer science/mathematics/physics or related technical subject is desirable
Experience of senior stakeholder management
Consistently demonstrates clear and concise written and verbal communication skills
Bachelors degree in computer science/mathematics/physics or related technical subject
9+ years in a site reliability engineeringrelated role with proven hands-on expertise and the capability to demonstrate technical proficiency in the following: Programming (Java, Python, or equivalent)
Containerization
Kubernetes
GitOps
High Availability Systems
Infrastructure as a code
Configuration Management
Observability (tools and implementation)
Hyperscale Systems
Middleware configuration
Nice to have:
Operational experience of deploying and running services at scale on top of Docker/Kubernetes stack and a service mesh, like Istio
Operational experience with orchestration tools for CI/CD and Infrastructure-as-Code tooling (Terraform, Cloud Formation, etc.)
Operational experience of using middleware technologies (MQ, Apache Kafka, etc.) to run services at scale
Strong experience with end-to-end observability stacks (Datadog, AppDynamics, Dynatrace, etc.)
Degree in computer science/mathematics/physics or related technical subject
What we offer:
medical, dental, and vision insurance with an employer contribution
flexible spending or health savings account
life and AD&D insurance
short and long term disability coverage
paid time off
employee assistance
participation in a 401k program with company match