CrawlJobs Logo

Engineering Manager - Observability & Reliability Engineering Obsession

Germany, Berlin · Job Posted March 21, 2026
Apply Position
Job Link Share

Job Description

We are looking for an Engineering Manager to join the OREO (Observability Reliability Engineering Obsession) team in Platform Engineering. As an Engineering Manager, your mission will be to lead the Reliability & Observability team and drive the evolution of Doctolib's observability platform, supporting the exponential growth of Doctolib services while building and empowering a world-class SRE team. Working in the tech team at Doctolib involves building innovative products and features to improve the daily lives of care teams and patients. We work in feature teams in an agile environment, while collaborating with product, design, and business teams. You will lead a team of Site Reliability Engineers who are responsible for shaping Doctolib's observability strategy and ensuring our platform remains reliable, debuggable, and scalable. This role sits at the intersection of people management, technical leadership, and strategic planning with a particular focus on building organizational capabilities around logging, metrics, tracing, and alerting. Your team also owns and operates critical transversal services that enable secure, scalable infrastructure management across the organization, including HashiCorp Vault for secrets management and Terraform Enterprise for infrastructure as code.

Job Responsibility

  • Lead, coach, and grow a team of Site Reliability Engineers, supporting their technical development and career progression
  • Create a culture of operational excellence, continuous improvement, and psychological safety within the team
  • Conduct regular 1:1s, performance reviews, and career development conversations
  • Recruit, onboard, and retain top SRE talent aligned with Doctolib's mission and values
  • Partner with SREs and senior engineers to define and evolve the observability strategy across the platform, focusing on logging, metrics, tracing, and alerting
  • Own the strategy and evolution of critical transversal services including HashiCorp Vault and Terraform Enterprise
  • Drive prioritization and roadmap planning for large-scale reliability and observability initiatives
  • Ensure alignment between team objectives and broader engineering and business goals
  • Advocate for and allocate resources toward reducing technical debt and improving developer experience
  • Own the team's on-call experience and contribute to the incident response processes, ensuring sustainable practices and continuous improvement
  • Ensure high availability and reliability of transversal services that are critical to the entire engineering organization
  • Lead postmortem reviews and drive systemic improvements to prevent recurring issues
  • Work closely with Product Managers, Engineering Managers, and architects to align observability capabilities with product and platform needs
  • Partner with security and infrastructure teams to evolve secrets management and IaC practices across the organization
  • Represent the OREO team in engineering leadership forums, architectural reviews, and strategic planning sessions
  • Foster strong partnerships with software engineering teams to improve instrumentation quality and adoption of observability best practices

Requirements

  • At least 5+ years of software engineering or SRE experience, with a strong technical background in cloud-native environments (preferably AWS, GCP, and/or Kubernetes-based)
  • 3+ years of engineering management experience, leading technical teams (ideally SRE, platform, or infrastructure teams)
  • Deep understanding of observability tooling and architecture (Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Prometheus, Thanos, Datadog)
  • Experience with infrastructure as code (Terraform, OpenTofu) and secrets management systems (Vault, AWS Secrets Manager)
  • Proven ability to balance technical depth with people leadership, able to mentor engineers, review technical designs, and guide architectural decisions

Nice to have

  • Experience scaling SRE or platform teams in fast-growing, high-traffic environments
  • Background in designing and operating high-scale telemetry pipelines
  • Hands-on experience with HashiCorp Vault and Terraform Enterprise in production environments
  • Hands-on experience with backend programming languages (e.g., Go, Python, Ruby)
  • Experience driving cultural and technical transformations

What we offer

  • Free comprehensive health insurance for you and your children
  • Parent Care Program: receive one additional month of leave on top of the legal parental leave
  • Free mental health and coaching services through our partner Moka.care
  • For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
  • Work from EU countries and the UK for up to 10 days per year, thanks to our flexibility days policy
  • Work Council subsidy to refund part of sport club membership or creative class
  • Up to 14 days of RTT
  • A subsidy from the work council to refund part of the membership to a sport club or a creative class
  • Lunch voucher with Swile card

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Engineering Manager - Observability & Reliability Engineering Obsession

8 matching positions

Engineering Manager - Observability & Reliability Engineering Obsession

We are looking for an Engineering Manager to join the OREO (Observability Reliab...
Location
Location
France , Paris
Salary
Salary:
Not provided
doctolib.fr Logo
Doctolib
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5+ years of software engineering or SRE experience, with a strong technical background in cloud-native environments (preferably AWS, GCP, and/or Kubernetes-based)
  • 3+ years of engineering management experience, leading technical teams (ideally SRE, platform, or infrastructure teams)
  • Deep understanding of observability tooling and architecture (Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Prometheus, Thanos, Datadog)
  • Experience with infrastructure as code (Terraform, OpenTofu) and secrets management systems (Vault, AWS Secrets Manager)
  • Proven ability to balance technical depth with people leadership, able to mentor engineers, review technical designs, and guide architectural decisions
Job Responsibility
Job Responsibility
  • Lead, coach, and grow a team of Site Reliability Engineers, supporting their technical development and career progression
  • Create a culture of operational excellence, continuous improvement, and psychological safety within the team
  • Conduct regular 1:1s, performance reviews, and career development conversations
  • Recruit, onboard, and retain top SRE talent aligned with Doctolib's mission and values
  • Partner with SREs and senior engineers to define and evolve the observability strategy across the platform, focusing on logging, metrics, tracing, and alerting
  • Own the strategy and evolution of critical transversal services including HashiCorp Vault and Terraform Enterprise
  • Drive prioritization and roadmap planning for large-scale reliability and observability initiatives
  • Ensure alignment between team objectives and broader engineering and business goals
  • Advocate for and allocate resources toward reducing technical debt and improving developer experience
  • Own the team's on-call experience and contribute to the incident response processes, ensuring sustainable practices and continuous improvement
What we offer
What we offer
  • Free comprehensive health insurance for you and your children
  • Parent Care Program: receive one additional month of leave on top of the legal parental leave
  • Free mental health and coaching services through our partner Moka.care
  • For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
  • Work from EU countries and the UK for up to 10 days per year, thanks to our flexibility days policy
  • Work Council subsidy to refund part of sport club membership or creative class
  • Up to 14 days of RTT
  • A subsidy from the work council to refund part of the membership to a sport club or a creative class
  • Lunch voucher with Swile card
  • Fulltime
Read More
Arrow Right

Principal Group Engineering Manager

The Partner Engagement Platform (PEP) team builds the systems and services that ...
Location
Location
United States , Redmond
Salary
Salary:
163000.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Lead engineering strategy and execution for modernizing the partner engagement platform, including service architecture, API evolution, and long‑term scalability
  • Oversee cloud‑based service development with emphasis on reliability, performance, security, and operational excellence
  • Define platform vision, requirements, and multi‑year roadmap with Product and TPM
  • Ensure secure global workflows for engagement, build distribution, telemetry, and diagnostics
  • Drive engineering quality via observability, automation, and CI/CD
  • Advance AI capabilities for reporting, operational insights, workflow improvements, and engineering toolchains
  • Lead a multidisciplinary team across SWE, Data Engineering, TPM, and Product
  • Mentor and develop engineers and PMs through coaching and career support
  • Drive inclusive hiring and build team capability aligned to platform and AI needs
  • Foster a culture focused on impact, learning, and customer obsession
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

The Industry Solutions Engineering (ISE) team is a global engineering organizati...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Experience building or integrating AI/ML or LLM-based solutions, prompt engineering, RAG, fine-tuning
  • Familiarity with deploying and operating AI systems in production environments
  • Understanding of model evaluation, data quality, and performance monitoring
  • Experience using cloud AI platforms (Azure ML, OpenAI, or similar)
  • OR equivalent experience.
Job Responsibility
Job Responsibility
  • Put security first: Build and ship solutions that meet enterprise security standards (threat modeling, secure coding, privacy, and compliance) from design through production
  • Translate business needs into technical solutions: Partner with stakeholders to define problem statements, success metrics, and architectural approaches that deliver measurable outcomes
  • Design and lead architecture: Own end-to-end system design for cloud and AI workloads, making sound tradeoffs across reliability, performance, cost, and maintainability
  • Deliver quickly without sacrificing quality: Use modern engineering practices (CI/CD, automated testing, observability, and progressive delivery) to iterate fast and reduce operational risk
  • Drive customer success and adoption: Work directly with customer engineering teams to deliver production-ready solutions, unblock delivery, and ensure outcomes are adopted at scale
  • Build reusable, scalable assets: Create solution accelerators, reference architectures, and code that can be reused across customers and scenarios to maximize impact
  • Operate effectively in ambiguity: Continuously learn and adapt as technologies and customer priorities evolve
  • bring clarity, structure, and momentum to complex engagements
  • Lead and mentor across disciplines: Provide technical direction, coach engineers, and collaborate with product, data, and security partners to deliver as one team
  • Lead complex delivery end-to-end: Coordinate multiple workstreams, manage dependencies, and raise the bar on reliability and operational excellence for services running in production
What we offer
What we offer
  • Health, wellness, and financial future benefits
  • Fulltime
Read More
Arrow Right

Senior Product Manager - Transaction Processing

As a Senior Product Manager in PPRO’s Transaction Processing Domain, you will pl...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
ppro.com Logo
PPRO GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Product Leadership: Proven experience owning a product or feature set, leading product discovery, and executing successfully in a fast-paced environment
  • Exceptional Collaboration Skills: Proven ability to partner closely with engineering, design and external stakeholders to deliver complex products and platforms at scale, with the capability to translate technical complexity into clear, accessible concepts
  • Strong Analytical Rigour: Ability to work with data, define KPIs, and drive decisions based on insights rather than intuition, ability to effectively communicate with and influence both customer-facing and partner-facing stakeholders
  • Payments and Fintech Experience: Deep understanding of payment processing APIs, merchant checkout workflows, compliance requirements, and risk controls in financial services
  • Technical Fluency: Experience working with APIs, automation & observability tools, and technical teams to deliver scalable and efficient solutions
  • Customer-Obsessed Mindset: Ability to deeply understand customer needs and translate them into frictionless product experiences
  • Bias for Action: A pragmatic approach to decision-making, balancing long-term vision with near-term execution
  • Languages: English is a must
Job Responsibility
Job Responsibility
  • Own and deliver the product strategy for PPRO’s core payment processing platforms and APIs, driving customer engagement, feature development and market launches in close partnership with engineering and stakeholders to provide scalable, reliable and market-leading payment capabilities that create measurable customer and business value
  • Build deep user empathy with both PPRO’s merchants and payment method partners, understanding global payment industry trends and competitors' offerings to influence PPRO’s roadmap effectively
  • Partner with Commercial and Enterprise Account Management to understand evolving customer needs, benchmark against industry standards, and shape global payment products and experiences, balancing internal priorities with customer objectives to ensure aligned, high-impact delivery
  • Own and evolve PPRO’s core payment processing platform and customer-facing API suite, ensuring highly available, reliable systems that deliver high payment success rates and strong merchant business outcomes
  • Work hands-on with observability tools, data - analyze key reliability metrics, business opportunities and success metrics related to payment acceptance rates, platform uptime & performance, and financial flow efficiency to drive meaningful outcomes
  • Define clear success metrics and KPIs to measure merchant adoption of API products, delightful merchant checkout experiences, payment method efficiency, automate reporting, and enable data-driven decision-making
  • Collaborate cross-functionally – work closely with Commercial, Compliance, Risk, Sales, and Customer Success teams to refine product requirements and translate those to global products & features
  • Lead roadmap planning and execution for key initiatives, which supports building benchmarked global products, enhancing wider platform capabilities, optimization of existing flows, and co-creation of new products with payment method partners
  • Be a thought leader in local payments – stay ahead of regulatory changes, market trends, and best practices to keep PPRO at the forefront of effective, efficient, and accessible payments across the world
What we offer
What we offer
  • Hybrid working - We offer a hybrid structure with a 3 days / week on site expectation
  • work from abroad policy, enabling employees to work remotely for up to another 30 days per year
  • Learning and Development - We offer a €1,000 annual budget to support your professional growth
  • leadership cafés, on-the-job training
  • Insurance - accident insurance, disability insurance, direct insurance (bAV) and travel insurance
  • Gym membership - PPRO helps contribute towards the costs of your gym membership
  • Enhance Family Leave
  • Mental Health Platform - one-on-one therapy, chat therapy, therapist-led courses, guided meditations, and more
  • Pet-friendly office
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer

As a 0x Platform Engineer, your mission is to ensure our suite of products remai...
Location
Location
United States
Salary
Salary:
Not provided
0x.org Logo
0x
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or equivalent experience
  • 5+ years of professional engineering experience with a strong background in platform or infrastructure engineering
  • Proficiency with infrastructure automation and scripting (TypeScript or Rust preferred)
  • Strong expertise in Kubernetes-based applications in cloud environments (EKS preferred)
  • Experience with AWS and Infrastructure as Code (Terraform)
  • Advanced understanding of modern DevOps practices (GitOps, CI/CD, OpenTelemetry)
  • Proven ability to troubleshoot distributed systems, networking, and latency issues in production environments
  • Obsessed with optimizing for low latency, high throughput, and developer productivity
  • Passion for decentralization and alignment with the 0x mission
  • Embody our core values: do the right thing, consistently ship, and create enduring value
Job Responsibility
Job Responsibility
  • Ensure uptime, reliability, and resilience across 0x’s core products and services
  • Own and improve our observability stack (Grafana, Loki, Tempo, VictoriaMetrics), driving efficiency and actionable insights
  • Design and implement resilient systems for low-latency, high-availability workloads
  • Automate engineering workflows including onboarding/offboarding, service provisioning, and infrastructure migrations
  • Manage and upgrade cloud infrastructure and Kubernetes platforms while ensuring scalability and high availability
  • Optimize build systems and CI/CD pipelines to improve developer velocity and performance
  • Collaborate across application, data, and infrastructure teams to debug issues, reduce latency, and drive cross-team automation
What we offer
What we offer
  • Comprehensive insurance (medical/dental/vision/life/disability) for U.S.-based employees — 100% of base plan covered for you and dependents
  • 401k and FSA for U.S.-based employees
  • Monthly mobile phone bill, wellness, and pre-tax transportation expense
  • Covered mental health benefits (included professional therapy sessions)
  • A supportive remote environment
  • Lunch reimbursement for all employees across the globe
  • Stipend for your ideal remote / WFH set-up: headphones, and any other work gear you may need
  • 12-week paid parental leave
  • Great office conveniently located in the SF Financial District for those in the region
  • Flexible vacation: Take time when you need it (and we really mean it)
  • Fulltime
Read More
Arrow Right
New

IT Training Lead

The IT Training Lead will drive technology learning and user adoption across the...
Location
Location
United States , Delray Beach
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in IT training, instructional design, technical enablement, or learning and development
  • Strong knowledge of Microsoft 365
  • Excellent communication, facilitation, and content development skills
  • Ability to translate technical concepts into practical, user-friendly training.
Job Responsibility
Job Responsibility
  • Design, develop, and deliver IT training programs in instructor-led, virtual, and self-paced formats
  • Take lead in the Microsoft Copilot and AI training strategy, including onboarding, advanced use cases, responsible AI usage, and ongoing enablement
  • Partner with IT leadership to support new technology rollouts, system upgrades, and digital transformation initiatives
  • Create and maintain training content, including videos, guides, tutorials, and job aids
  • Identify skill gaps and develop targeted learning solutions to improve adoption and productivity
  • Gather feedback and measure training effectiveness to continuously improve programs.
Read More
Arrow Right
New

K Kitchen Representative

The position includes, but is not limited to, the following essential job duties...
Location
Location
United States , New Albany
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent communication skills
  • Team player who can work well with others or independently
  • Acts with integrity
  • keeps commitments
  • Contagious positive attitude
  • Focuses on achieving results while having fun
  • Frequently bend, twist at waist, kneel, squat, stand, and walk
  • Occasionally climb and descend ladders
  • Tolerate extreme cold and hot temperatures and work in and around fryers, ovens, grills, coolers, freezers, sharp objects, and loud noises
  • Reach, grasp, and manipulate objects with hands for entire shift, including reaching for objects overhead
Job Responsibility
Job Responsibility
  • Provides excellent guest service in a fast and friendly manner
  • Maintains a clean restaurant environment by cleaning and performing general housekeeping duties
  • Prepares and serves food items in accordance with all Brand, Company, and health department regulations
  • Ensures product quality, food safety, and operational standards are met
  • Keeps accurate cash, sales, and inventory control records
  • Follows all government laws and safety codes
  • Completes reports on all incidents following our 5-minute rule policy
  • Lives our Company values: One Team, Do the Right Thing, Takes Ownership, Play to Win
What we offer
What we offer
  • Medical, Dental, Vision, Term Life and AD&D plans
  • Flexible spending and health savings accounts (FT)
  • Vacation paid time off
  • Company holidays paid at time and a half
  • Matching 401(k)
  • Tuition Reimbursement
  • Stock Purchase Plan
  • Employee Discount Program
  • Discount Meal Benefit
  • Wellness Plan
Read More
Arrow Right
New

K Kitchen Representative

Location
Location
United States , Decatur
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent communication skills
  • Team player who can work well with others or independently
  • Acts with integrity
  • keeps commitments
  • Contagious positive attitude
  • Focuses on achieving results while having fun
  • Frequently bend, twist at waist, kneel, squat, stand, and walk
  • Occasionally climb and descend ladders
  • Tolerate extreme cold and hot temperatures and work in and around fryers, ovens, grills, coolers, freezers, sharp objects, and loud noises
  • Reach, grasp, and manipulate objects with hands for entire shift, including reaching for objects overhead
Job Responsibility
Job Responsibility
  • Provides excellent guest service in a fast and friendly manner
  • Maintains a clean restaurant environment by cleaning and performing general housekeeping duties
  • Prepares and serves food items in accordance with all Brand, Company, and health department regulations
  • Ensures product quality, food safety, and operational standards are met
  • Keeps accurate cash, sales, and inventory control records
  • Follows all government laws and safety codes
  • Completes reports on all incidents following our 5-minute rule policy
  • Lives our Company values: One Team, Do the Right Thing, Takes Ownership, Play to Win
What we offer
What we offer
  • Medical, Dental, Vision, Term Life and AD&D plans
  • Flexible spending and health savings accounts (FT)
  • Vacation paid time off
  • Company holidays paid at time and a half
  • Matching 401(k)
  • Tuition Reimbursement
  • Stock Purchase Plan
  • Employee Discount Program
  • Discount Meal Benefit
  • Wellness Plan
Read More
Arrow Right