CrawlJobs Logo

Head of SRE

France, Paris · Job Posted February 16, 2026
Apply Position
Job Link Share

Job Description

Yousign is seeking an experienced Head of SRE to lead our Site Reliability Engineering team through a pivotal transformation phase. You will own the technical leadership and people management of a mature 6-person SRE team, driving our infrastructure migration to completion while shaping the future of our Platform organization. This role is critical as we complete our multi-semester infrastructure migration (90% by H1 2026, 100% by EOY 2026) and transition into a phase focused on resilience, scalability, and Platform Engineering excellence.

Job Responsibility

  • Lead, inspire, and grow a team of 6 SREs with diverse profiles (Infrastructure and Application backgrounds)
  • Create a unified SRE vision while respecting each profile's specificities and career development needs
  • Provide coaching and mentorship, including for senior/staff engineers on technical leadership
  • Manage workload, prioritize and make structured recommendations that will be discussed with Engineering management
  • Manage external resources: freelancers and specific expertise as needed
  • Embody and transmit Yousign's vision, values, and Operating Principles
  • Complete infrastructure migration successfully: Drive 90% decommissioning by H1 2026, 100% by EOY 2026
  • Define post-migration strategy: resilience (eliminate SPOFs), scalability (support growth), rationalization (observability overhaul)
  • Drive Platform Engineering vision: IDP/DevHub implementation to reduce cognitive load for product teams
  • Perform long-term capacity planning (1-2 years): project technical, budget, and headcount needs aligned with business growth
  • Participate in infrastructure strategy, architecture decisions, and technology trade-offs
  • Drive sovereign cloud strategy (OVH) and non-hyperscaler expertise
  • Present strategic recommendations to top management with clear business value articulation
  • Ensure SLA & SLO for critical B2B trust signature service (thousands of customers, millions of users)
  • Lead incident management with mature processes (on-call, runbooks, war rooms)
  • Act as credible spokesperson during crises: manage communication with internal teams, Engineering Council, and external customers
  • Establish and execute crisis communication plans maintaining trust and transparency
  • Drive blameless post-mortems and continuous improvement culture
  • Manage on-call organization and team mental load
  • Proactively identify and mitigate risks
  • strengthen resilience mechanisms
  • Implement disaster recovery and business continuity plans
  • Drive Platform-as-a-Product mindset: product teams as customers, focus on reducing their cognitive load
  • Minimize impact on product teams during infrastructure transformations
  • Own Build topics: Automation, CI/CD, technical framing, IDP/DevHub implementation, Developer Experience (self-service, golden paths)
  • Own Run topics: Supervision, monitoring, observability rationalization post-migration
  • Work closely with Engineering Managers to address cross-team needs
  • Diffuse DevSecOps/SRE culture within Engineering teams
  • Lead cross-functional initiatives to improve platform efficiency and reliability

Requirements

  • Proven experience with sovereign cloud (OVH, Scaleway) or traditional hosting—not just AWS/GCP/Azure with managed services
  • Real experience managing major incidents with external customer and stakeholder communication during crises
  • Minimum 3-5 years managing technical teams (5+ people)
  • Successfully led SRE transformation in a scale-up environment
  • Experience managing teams with different technical backgrounds (Infrastructure + Application, or similar hybrid profiles)
  • Ability to project over 6-12 months on technical evolution, budget, and people planning
  • Credible spokesperson in high-pressure situations with internal and external stakeholders
  • Engaging communication style that inspires and motivates teams
  • Pragmatism over dogma: 'Keep it simple' mindset-technology as a means, not an end
  • Makes pragmatic decisions based on business context, not only theoretical best practices
  • Focuses on long-term solutions rather than 'hero mode'
  • > 7 years experience

What we offer

  • Reduction of working time (RTT)
  • Stock purchase plan / Stock options
  • Professional development plan
  • Mental health benefits
  • Paid volunteer time

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Head of SRE

8 matching positions

Head of Support

Coralogix is a modern, full-stack observability platform transforming how busine...
Location
Location
Israel , Ramat Gan
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in technical support, DevOps, SRE, or similar roles
  • Strong knowledge of AWS/Azure/GCP and Kubernetes ecosystems
  • Familiarity with observability tools (Kibana, Grafana, Prometheus, Datadog, Splunk, ELK)
  • Hands-on experience with Kubernetes, Docker, and distributed systems
  • Proficiency with ELK concepts, RegEx, Lucene, and PromQL
  • Proven leadership of global/multi-regional support teams (35+ people)
  • Strong incident management and escalation-handling skills
  • Ability to optimize support operations, workflows, and tooling
  • Strong analytical and data-driven decision-making abilities
  • Excellent communicator with technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Lead and coach global Technical Support Engineering teams
  • Ensure high-quality support with improvements in CSAT, response/resolution times, backlog, and KPIs
  • Maintain clear global processes and standards
  • Align with regional leads for coverage across time zones
  • Act as the senior escalation point for complex issues
  • Guide engineers in root cause analysis, distributed systems, and observability
  • Oversee incident management with strong communication and collaboration
  • Maintain hands-on knowledge of Coralogix architecture and tooling
  • Drive continuous improvement to streamline workflows and reduce escalations
  • Enhance productivity through better tools, processes, and automation
  • Fulltime
Read More
Arrow Right

Head of SRE

Yousign is seeking an experienced Head of SRE to lead our Site Reliability Engin...
Location
Location
France , Paris
Salary
Salary:
90000.00 - 110000.00 EUR / Year
yousign.com Logo
Yousign
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Non-hyperscaler infrastructure: Proven experience with sovereign cloud (OVH, Scaleway) or traditional hosting—not just AWS/GCP/Azure with managed services
  • Critical production management: Real experience managing major incidents with external customer and stakeholder communication during crises
  • Confirmed management experience: Minimum 3-5 years managing technical teams (5+ people)
  • SRE transformation experience: Successfully led SRE transformation in a scale-up environment
  • Diverse profile management: Experience managing teams with different technical backgrounds (Infrastructure + Application, or similar hybrid profiles)
  • Long-term vision: Ability to project over 6-12 months on technical evolution, budget, and people planning
  • Crisis communication: Credible spokesperson in high-pressure situations with internal and external stakeholders
  • Energy and dynamism: Engaging communication style that inspires and motivates teams
  • Pragmatism over dogma: "Keep it simple" mindset-technology as a means, not an end
  • Trade-off thinking: Makes pragmatic decisions based on business context, not only theoretical best practices
Job Responsibility
Job Responsibility
  • Team Management and Leadership (40%): Lead, inspire, and grow a team of 6 SREs with diverse profiles (Infrastructure and Application backgrounds)
  • Create a unified SRE vision while respecting each profile's specificities and career development needs
  • Provide coaching and mentorship, including for senior/staff engineers on technical leadership
  • Manage workload, prioritize and make structured recommendations that will be discussed with Engineering management
  • Manage external resources: freelancers and specific expertise as needed
  • Embody and transmit Yousign's vision, values, and Operating Principles
  • Strategic Vision and Planning (25%): Complete infrastructure migration successfully: Drive 90% decommissioning by H1 2026, 100% by EOY 2026
  • Define post-migration strategy: resilience (eliminate SPOFs), scalability (support growth), rationalization (observability overhaul)
  • Drive Platform Engineering vision: IDP/DevHub implementation to reduce cognitive load for product teams
  • Perform long-term capacity planning (1-2 years): project technical, budget, and headcount needs aligned with business growth
What we offer
What we offer
  • Swile card - Lunch Vouchers covered by 50% by Yousign = 10.50€
  • Alan - Health insurance: Basic coverage at €62.50/month, 50% paid by Yousign
  • Life and disability insurance: 100% covered by Yousign
  • Transportation - Hybrid workers get 50% off their public transportation passes
  • Leeto - Platform with numerous benefits such as discounts on cinema tickets, theme parks, travels, sports, etc.
  • Moka.care - 4 free therapy/coaching sessions and mental health content
  • 10 RTT days/year
  • 1 charity day per year, learning & development budget, and more
  • Fulltime
Read More
Arrow Right

Head of Infrastructure

To prepare for a significant period of growth, Xelix is seeking a Head of Infras...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
xelix.com Logo
Xelix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in infrastructure, platform, or SRE roles
  • AWS Certified Solutions Architect – Professional
  • Prior experience scaling production systems in a growing company
  • Ability to operate production systems under pressure
  • Deep hands-on experience with the AWS cloud platform
  • Strong background in reliability, observability, and incident management
  • Experience leading or mentoring engineers
Job Responsibility
Job Responsibility
  • Platform Strategy & Architecture: Own the long-term platform and infrastructure strategy
  • Design and evolve cloud architecture to support scale, resilience, and performance
  • Set standards for infrastructure, CI/CD, environments, and observability
  • Make architectural decisions and trade-offs
  • Developer Experience (DevEx): Provide infrastructure for the development team to code, test and deploy efficiently
  • Advise during design sessions to help engineers pick the right solutions for projects
  • Reliability & Operations: Own production reliability, uptime, and incident response
  • Define and enforce SLAs and SREs
  • Lead incident response and post-incident reviews
  • Ensure monitoring, alerting, and on-call practices are effective and sustainable
What we offer
What we offer
  • 27 days of annual leave (including 3 days Christmas closing) which increases up to 3 days based on tenure, with the option to roll over, buy or sell up to 3 days
  • Hybrid working with one day a week from our dog-friendly Hoxton office
  • On-site gym and cycle to work scheme
  • Employee discount at over 100 retailers
  • Comprehensive private medical & dental cover with Vitality
  • Enhanced parental leave pay
  • Learning & development culture – £1,000 personal annual budget
  • We’re carbon-neutral and are working towards ambitious carbon reduction goals
  • Lots of team socials & activities
  • Annual team retreat
  • Fulltime
Read More
Arrow Right

Head of Infrastructure

At Xelix, we work with some of the world’s largest companies to automate and str...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
xelix.com Logo
Xelix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in infrastructure, platform, or SRE roles
  • AWS Certified Solutions Architect – Professional
  • Prior experience scaling production systems in a growing company
  • Ability to operate production systems under pressure
  • Deep hands-on experience with the AWS cloud platform
  • Strong background in reliability, observability, and incident management
  • Experience leading or mentoring engineers
Job Responsibility
Job Responsibility
  • Platform Strategy & Architecture: Own the long-term platform and infrastructure strategy
  • Design and evolve cloud architecture to support scale, resilience, and performance
  • Set standards for infrastructure, CI/CD, environments, and observability
  • Make architectural decisions and trade-offs
  • Developer Experience (DevEx): Provide infrastructure for the development team to code, test and deploy efficiently
  • Advise during design sessions to help engineers pick the right solutions for projects
  • Reliability & Operations: Own production reliability, uptime, and incident response
  • Define and enforce SLAs and SREs
  • Lead incident response and post-incident reviews
  • Ensure monitoring, alerting, and on-call practices are effective and sustainable
What we offer
What we offer
  • 27 days of annual leave (including 3 days Christmas closing) which increases up to 3 days based on tenure, with the option to roll over, buy or sell up to 3 days
  • Hybrid working with one day a week from our dog-friendly Hoxton office
  • On-site gym and cycle to work scheme
  • Employee discount at over 100 retailers
  • Comprehensive private medical & dental cover with Vitality
  • Enhanced parental leave pay
  • Learning & development culture – £1,000 personal annual budget
  • We’re carbon-neutral and are working towards ambitious carbon reduction goals
  • Lots of team socials & activities
  • Annual team retreat
  • Fulltime
Read More
Arrow Right

Head of Platform

At Xelix, we work with some of the world’s largest companies to automate and str...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
xelix.com Logo
Xelix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in infrastructure, platform, or SRE roles
  • AWS Certified Solutions Architect – Professional
  • Prior experience scaling production systems in a growing company
  • Ability to operate production systems under pressure
  • Deep hands-on experience with the AWS cloud platform
  • Strong background in reliability, observability, and incident management
  • Experience leading or mentoring engineers
Job Responsibility
Job Responsibility
  • Own the long-term platform and infrastructure strategy
  • Design and evolve cloud architecture to support scale, resilience, and performance
  • Set standards for infrastructure, CI/CD, environments, and observability
  • Make architectural decisions and trade-offs
  • Provide infrastructure for the development team to code, test and deploy efficiently
  • Advise during design sessions to help engineers pick the right solutions for projects
  • Own production reliability, uptime, and incident response
  • Define and enforce SLAs and SREs
  • Lead incident response and post-incident reviews
  • Ensure monitoring, alerting, and on-call practices are effective and sustainable
What we offer
What we offer
  • 27 days of annual leave (including 3 days Christmas closing) which increases up to 3 days based on tenure, with the option to roll over, buy or sell up to 3 days
  • Hybrid working with one day a week from our dog-friendly Hoxton office
  • On-site gym and cycle to work scheme
  • Employee discount at over 100 retailers
  • Comprehensive private medical & dental cover with Vitality
  • Enhanced parental leave pay
  • Learning & development culture – £1,000 personal annual budget
  • We’re carbon-neutral and are working towards ambitious carbon reduction goals
  • Lots of team socials & activities
  • Annual team retreat
  • Fulltime
Read More
Arrow Right
New

Site Reliability Engineer

As Site Reliability Engineer you will contribute to the overarching implementati...
Location
Location
Romania , Bucuresti
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related field
  • Minimum 5 years proven work experience as a Reliability Engineer or similar role
  • Expert knowledge and hands-on experience with applications hosted on cloud platforms such as Google Cloud Platform as well as with Docker / Kubernetes in combination with Google Kubernetes Engine (GKE), Terraform or similar technology
  • Experience in resilient software development in Python/JAVA and the usage of modern CI/CD pipelines e.g. Github, Github Actions, Bitbucket, Helm
  • Strong experience in the setup of observability, monitoring and self-healing solutions for instance with New Relic, Splunk, Google Cloud Operations, Lightstep and Ansible
  • Very good knowledge of security standards (e.g.: TLS, OAuth2, KMS, Vault, Admission Controllers, let's encrypt), microservice architectures and experience with API Management with Apigee or WSO2
  • Proactive attitude and collaborative Team player mindset paired with self confidence
  • Not losing your coolness and keep your eye for details even in stressful situations where time matters
  • Having a creative approach towards solving technical problems
  • Excellent communication skills in English
Job Responsibility
Job Responsibility
  • Define Service Level Objectives (SLOs), and enable an end-to-end view on customer satisfaction based on best practices for setting up Service Level Indicators (SLIs) to create effective strategies for maintaining and improving system performance and availability
  • Collaborate with Business Functional Analysts and Solution Architects to find improvements in the solution design to improve the resilience of technical solutions early on
  • Consult and guide the squad on the prioritization of reliability improvement and actively deliver them as part of the sprint
  • Hands-on experience in implementing reliability and resilience patterns like auto-scaling, circuit breakers, bulk-heads, rate limiter, retry mechanisms, etc.
  • Actively work on service request fulfilment, incident and problem mgmt. to identify and reduce toil and the MTTR with engineering best practices
  • Align and contribute on state-of-the-art SRE best practices e.g. Distributed Tracing, Open Telemetry and Chaos Engineering with the SRE chapter function
  • Be a knowledge- and skill multiplicator of your profession by being a Lead of the Site Reliability engineer population
  • Increase the seniority of the overall Site Reliability Engineer chapter by establishing events and procedures, and foster a culture of high standards
  • Lead people of your engineer profession and make them become better each day
What we offer
What we offer
  • Smooth integration and a supportive mentor
  • Pick your working style: choose from Remote, Hybrid or Office work opportunities
  • Our projects have different working hours to suit your needs
  • Sponsored certifications, trainings and top e-learning platforms
  • Private Health Insurance – custom-made for you
  • Individual coaching sessions or accredited Coaching School
  • Epic parties or themed events – lovingly designed for our people and their families
  • Fulltime
Read More
Arrow Right

Head of US Technology Command Centre

The Head of the US Technology Command Centre is a senior operational leadership ...
Location
Location
United States , Whippany
Salary
Salary:
220000.00 - 300000.00 USD / Year
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant senior‑level experience in IT Operations, Service Management, or Command Centre leadership within a complex, regulated environment
  • Proven track record of leading 24x7 operational teams, managing high‑severity incidents, and maintaining service stability at scale
  • Strong understanding of IT service management and operational control frameworks (e.g. ITIL, SRE principles, resilience and availability management)
  • Experience operating within a global, follow‑the‑sun operating model
  • Background in technology domains such as infrastructure, networks, applications, or platforms, with a strong operational and control mindset
  • Superior written and verbal communication skills, with the ability to distil complex technical issues into clear executive‑level messaging
  • Strong leadership presence, able to remain calm, decisive, and credible under pressure
  • Proven ability to influence senior stakeholders and challenge constructively where service risk exists
  • Excellent organisational skills, capable of overseeing multiple concurrent operational priorities in a dynamic environment
  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related discipline (preferred)
Job Responsibility
Job Responsibility
  • Provide overall leadership and accountability for Technology Command Centre operations during the US time zone
  • Ensure continuous monitoring, control, and optimisation of live IT services across infrastructure, applications, networks, and end‑user services
  • Operate as the senior operational decision‑maker for technology incidents, service degradation, and operational risks within the region
  • Lead the end‑to‑end management of major incidents, ensuring swift triage, effective coordination, clear communications, and timely resolution
  • Provide visible leadership during severe but plausible technology events, including crisis and resilience scenarios
  • Drive root cause analysis, problem management, and service improvement actions to reduce repeat incidents and improve service stability
  • Act as a core member of the Global Command Centre leadership community, ensuring alignment to global standards, processes, tooling, and reporting
  • Ensure seamless operational handovers across time zones to maintain true 24x7 service continuity
  • Represent the US Command Centre in global governance forums and operational reviews
  • Ensure that IT services operate within agreed risk tolerance, service levels, and operational controls
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Medical, dental and vision coverage
  • 401(k)
  • Life insurance
  • Paid leave for qualifying circumstances
  • Incentive award eligibility
  • Fulltime
Read More
Arrow Right

Engineering Manager, SRE

Abridge’s services and engineering teams are in hyperscale mode, and multiplying...
Location
Location
United States , San Francisco
Salary
Salary:
220000.00 - 260000.00 USD / Year
abridge.com Logo
Abridge
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3 - 6+ years as a manager in rapidly growing organizations including at least 1 year as a manager of managers
  • Seeking an extremely challenging role that will push you beyond your limits, where failures are inevitable and not to be feared
  • Seeking a senior leadership role to develop people, environments, and impact - not ego, accolades, or ladder climbing
  • Able to ask for help, fail fast and admit defeat
  • get yourself and others out of their comfort zone
  • Track record of leading performance engineering including load test and chaos engineering, large scale distributed telemetry implementation, major architectural and software refactors, engineering velocity, and full stack development
  • Experience running production workloads in more than one cloud provider (at a time, or across your experience)
  • Experience managing workloads across containerized solutions, Kubernetes, and CNCF-approved tooling such as Argo, istio, OTel, and more
  • Thought leader in platform building, with a strong desire to represent Abridge as a reliability engineering leader in the tech industry
  • Genuine passion for Abridge’s mission to improve healthcare in America and across the world
Job Responsibility
Job Responsibility
  • Visionary leadership: Scope, resource, evangelize, and execute a company-wide reliability and engineering velocity roadmap across environments and clouds, real-time streaming infrastructure under immense scale, compute as well as AI -at-edge infrastructure, and the most ambitious cloud security roadmap in the entire tech industry. Collaborate with department heads across product engineering, security, product management, commercial, and more to develop, align, and execute an extremely ambitious strategic roadmap
  • Gifted tactician: Work at the level of small tiger teams to unblock, enable, and drive execution and solutioning. Juggle several ambiguous and tricky problems at a time
  • Recruiter extraordinaire: Scale out your team to meet this roadmap - both ICs and managers. Attract top talent and hire quickly while maintaining a consistently high bar. Iterate on the hiring process, improve diversity and equity, retain and maximize the effectiveness of an extremely senior team
  • Mentor to the mentors: Develop their careers, create top-of-ladder development opportunities, and continuously raise the bar for your staff as well as your peers and leaders in their abilities and awareness. Earn their trust, lead by example, be a doctor rather than a judge for organizational and people challenges, and help establish and maintain a hivemind, de-siloed culture across all engineering pods
What we offer
What we offer
  • Generous Time Off: 14 paid holidays, flexible PTO for salaried employees, and accrued time off for hourly employees
  • Comprehensive Health Plans: Medical, Dental, and Vision coverage for all full-time employees and their families
  • Generous HSA Contribution: If you choose a High Deductible Health Plan, Abridge makes monthly contributions to your HSA
  • Paid Parental Leave: Generous paid parental leave for all full-time employees
  • Family Forming Benefits: Resources and financial support to help you build your family
  • 401(k) Matching: Contribution matching to help invest in your future
  • Personal Device Allowance: Tax free funds for personal device usage
  • Pre-tax Benefits: Access to Flexible Spending Accounts (FSA) and Commuter Benefits
  • Lifestyle Wallet: Monthly contributions for fitness, professional development, coworking, and more
  • Mental Health Support: Dedicated access to therapy and coaching to help you reach your goals
  • Fulltime
Read More
Arrow Right