CrawlJobs Logo

Senior Observability Platform Engineer

life-science-talent-solutions.dk Logo

Life Science Talent

Location Icon

Location:
Denmark , København

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are on a mission to ensure everyone has access to medical expertise, no matter where they are. Half the world still lacks access to quality healthcare. Even in advanced systems, outcomes are uneven, and clinicians are overwhelmed. Medical knowledge grows faster than human capacity can keep up. Corti is building the infrastructure to close that gap. Our AI platform expands access to medical expertise, reducing errors, restoring time to clinicians, and making care more affordable, accessible, and human again. There is no quality healthcare without a quality dialogue, and no reliable AI without a strong foundation. Help us build both.

Job Responsibility:

  • Work closely with product and engineering to ensure that our systems are scalable, reliable and performant
  • Leads the design and operation of a self-hosted observability stack on Kubernetes
  • Builds libraries to standardize logging and metrics solutions across multiple applications
  • Design and implement automation tools and processes to improve the efficiency of our development and operations teams
  • Design and architect our platform to scale as the company grows
  • Create and improve the existing toolset for developers to improve their developer experience
  • Contribute to a strong culture of development at Corti through mentorship and knowledge sharing of people inside as well as outside the team

Requirements:

  • Proven experience in a Senior Platform Engineer or Senior DevOps role
  • Experienced with LGTM stack (Loki, Grafana, Tempo, Mimir)
  • Can write and integrate libraries for application logging and observability
  • Strong focus on SLOs, SLIs, and performance monitoring
  • Understands of signal-to-noise optimisation and alerting best practices
  • Balances feature enablement with infrastructure cost and retention considerations
  • Experience with containerization and orchestration tools like Docker and Kubernetes. Such as architecting a multi-tenant cluster, implementing GitOps, horizontal/vertical scaling and running a fault resilient cluster
  • Experience with cloud platforms such as Azure, AWS or GCP. Such as deploying infrastructure-as-code, configuring cloud resources to meet security and compliance standards
  • Strong knowledge of programming languages such as Golang and Python
  • Experience with managing and operating distributed, stateful systems on Kubernetes such as Redis Clusters, PostgreSQL clusters or Apache Kafka
  • Experience writing CI/CD pipelines using Github Actions
  • Experience with the usage and development of Kubernetes Operators
  • Excellent English verbal and written communication skills, including all-remote communication
What we offer:

Equipment provided by Corti

Additional Information:

Job Posted:
January 05, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Observability Platform Engineer

Senior Software Engineer, Wikidata Platform

The Wikimedia Foundation is seeking a Senior Software Engineer to join the team ...
Location
Location
Salary
Salary:
141352.00 - 175725.00 USD / Year
wikimediafoundation.org Logo
Wikimedia Foundation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as a backend or platform engineer working on distributed systems or data platforms
  • Deep understanding of database and knowledge graph representation technologies and standards
  • Proficiency in Java, C++, or other systems languages. Ability to set up, scale, and investigate systems is more important than expertise in a particular language.
  • Experience building and operating production-grade services with SLOs
  • Familiarity with modern observability tools (metrics, logging, tracing)
  • Understanding of graph databases, search indexes, or data processing pipelines
  • Ability to work collaboratively across disciplines and communicate clearly across technical and non-technical audiences
  • A commitment to learning, resilience, and contributing to a mission-driven engineering culture
Job Responsibility
Job Responsibility
  • Design, build, and maintain backend systems and APIs that power Wikidata’s query infrastructure
  • Improve reliability, observability, and automation of the Wikidata Query Service and data pipelines
  • Collaborate with SRE, data engineers, and product teams to ensure stability and scalability under growing usage
  • Monitor production systems, respond to operational incidents, and proactively identify and resolve bottlenecks
  • Support platform migrations and system upgrades (e.g., triple stores, streaming ingestion)
  • Contribute to deployment automation, CI/CD workflows, and service instrumentation
  • Participate in code reviews, design discussions, and technical planning
  • Document systems and share knowledge with team members and Wikimedia’s broader technical community
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer

Glide is looking for a Senior Platform Engineer to join our Infrastructure team ...
Location
Location
Salary
Salary:
Not provided
glideapps.com Logo
Glide
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as a platform engineer/SRE
  • 3+ years experience building and maintaining highly available and scalable distributed data sources
  • Experience with Google Cloud Platform services like Cloud SQL, Cloud Run, AlloyDB, or equivalent
  • Experience orchestrating complex systems with Kubernetes
  • Proficiency in TypeScript development
  • Strong SQL skills
  • can speak to covering index optimization strategies
  • Experience designing, building and running data-intensive event-driven architectures
  • You are a clear and effective communicator, be it when you write code, write emails, or explain complex technical issues to non-technical co-workers
  • Passionate and self-motivated, with a demonstrated ability to work in a fast-paced and evolving environment
Job Responsibility
Job Responsibility
  • Managing our existing infrastructure in GCP
  • Driving our platform evolution as the complexity and sophistication of our product only increases
  • Managing our Github/GH Actions based build pipeline
  • Provide build, test, and runtime infrastructure to service teams
  • Ensure patterns are established (e.g., for database throttling, request rate limiting, etc…) to protect Glide’s uptime
  • Monitor infrastructure costs and coordinate improvements when necessary
  • Drive SRE tooling and best practices around observability and alerting
  • Write, review, and maintain code primarily in TypeScript
  • Write architecture briefs and proposals, carry out code experiments, and build prototypes to learn how we can achieve reliable scale with our systems
  • Provide technical leadership, mentorship, pairing opportunities, and code review to encourage the growth of others
What we offer
What we offer
  • competitive salary and benefits package
  • a supportive and dynamic remote work environment
  • opportunities for career growth
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Platform Observability

Everlaw is looking for a Senior Software Engineer that brings experience in buil...
Location
Location
United States , Oakland
Salary
Salary:
164000.00 - 208000.00 USD / Year
everlaw.com Logo
Everlaw
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS or MS in Computer Science, or equivalent coursework
  • At least 3 years of experience building logging, metrics, and tracing infrastructure
  • Proficiency in coding in a language such as C, C++, C#, Java, Python, Javascript, Go or Rust
  • Experience with Infrastructure as Code and container solutions to manage cloud environments (ex: Terraform, Ansible, Docker, etc)
  • At least 1 year of experience leading multi-developer efforts, including planning, technical breakdown, and coordination
  • Excellent communication and collaboration skills
  • Please note that at this time, Everlaw is not sponsoring U.S. employment visas for this role. Due to federal contract requirements, Everlaw may only hire US citizens for this position.
Job Responsibility
Job Responsibility
  • Build observability strategies to support application and infrastructure metrics, logs, traces, dashboards, and alerts
  • Develop and maintain infrastructure as code (IAC) using tools such as Terraform and Ansible
  • Monitor usage trends to identify opportunities to optimize efficiency and performance of our metrics database and logging tools
  • Improve our on-call and incident management processes by encouraging deeper understanding, communication, and trust
  • Support developer projects by influencing design and implementation of infrastructure features as well as providing technical guidance
  • Support compliance efforts by promoting continuous documentation of our processes and involvement in audits
  • Provide Technical Mentorship to other engineers by both sharing your technical knowledge and becoming an expert in an area of our code base.
What we offer
What we offer
  • Equity program
  • 401(k) retirement plan with company matching
  • Health, dental, and vision
  • Flexible Spending Accounts for health and dependent care expenses
  • Paid parental leave and approximately 10 days (80 hours) per year of sick leave
  • Seventeen paid vacation days plus 11 federal holidays
  • Membership to Modern Health to help employees prioritize mental health and wellness
  • Annual allocation for Learning & Development opportunities and applicable professional membership dues
  • Company-sponsored life and disability insurance
  • Work in Uptown Oakland, just steps from the BART line and dozens of restaurants and walking distance to Lake Merritt
  • Fulltime
Read More
Arrow Right

Senior Engineering Manager, Platform Engineering (Developer Experience)

Everlaw is seeking a Senior Engineering Manager, Platform to lead teams focused ...
Location
Location
United States , Oakland, California
Salary
Salary:
219000.00 - 277000.00 USD / Year
everlaw.com Logo
Everlaw
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years as a senior engineer building developer productivity tools and/or highly available platform services (e.g., storage, pub-sub, search, caching, observability) and/or deep experience with infrastructure/cloud technologies (e.g., Terraform, Kubernetes, Docker)
  • 3+ years of experience directly managing software engineers and/or technical leads, including hiring, coaching, performance management, and growing a high-performing team
  • 2+ years of experience building and leading developer experience or platform teams/programs that deliver internal platforms and tooling with measurable productivity outcomes (e.g., faster builds/tests, improved CI/CD lead times, higher deployment frequency)
  • Experience managing scalable database infrastructure (e.g., Postgres, MySQL or equivalent)
  • Can communicate at the right altitude with both technical and non-technical stakeholders, and you’ve led cross-functional roadmaps with Engineering Operations, Security Engineering, DevOps, Product, and Design
  • Authorized to work in the United States. Please note that currently, Everlaw is not sponsoring employment visas.
Job Responsibility
Job Responsibility
  • Lead platform teams that build and evolve core internal platforms and developer tooling—spanning build/test infrastructure, CI/CD, and developer workflows—to improve engineer productivity and time-to-value
  • Collaborate closely with Engineering Operations, Security Engineering, DevOps, Product, and Design to synthesize requirements and prioritize impactful investments
  • Drive roadmapping, resourcing, and execution for critical platform areas that make it better and cheaper to develop, test, and release software
  • Establish and use developer efficiency metrics (e.g., build/test times, deploy lead time, change failure rate) to identify bottlenecks and plan ambitious improvements to workflows
  • Ensure operational excellence for platform services and tooling with clear SLOs, robust observability, and incident/bug management practices
  • Coach and develop engineers and leads
  • provide actionable feedback, elevate technical execution, and foster an inclusive, high-accountability culture
  • Partner with Engineering Operations to improve processes for alignment, goal setting, empowerment, and cross-team execution across Engineering
  • Communicate effectively with both technical and non-technical stakeholders, adjusting altitude from strategy to technical deep dives as needed.
What we offer
What we offer
  • Medical
  • dental
  • wellness program
  • paid parental leave
  • professional development
  • fully stocked kitchen
  • Equity program
  • 401(k) retirement plan with company matching
  • Health, dental, and vision
  • Flexible Spending Accounts for health and dependent care expenses
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer - CI/CD & AI Automation (AI-first)

Groupon is undergoing a critical platform transformation, modernizing its core d...
Location
Location
Czechia , Prague
Salary
Salary:
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of dedicated experience in Platform Engineering, DevOps, or Infrastructure roles
  • Deep expertise building, scaling, and migrating CI/CD systems, with strong practical experience in Jenkins and/or GitHub Actions
  • Expertise in scripting and automation (Python, Go, or Bash)
  • Solid understanding of container technologies, Kubernetes, and cloud build systems
  • Proven experience leveraging AI tooling (e.g., Claude Code, code analysis) to meaningfully increase developer output and optimize platform work
  • Excellent communication and ability to drive technical decisions across multiple platform and product teams
Job Responsibility
Job Responsibility
  • Platform Transformation: Lead the design, planning, and execution of the Jenkins-to-GitHub Actions migration across a large portfolio of microservices
  • Pipeline Engineering: Design and optimize high-performance, secure, and observable CI/CD workflows across GitHub Actions, Jenkins, and Kubernetes environments
  • AI-First Automation: Drive an AI-First workflow by leveraging tools (e.g., Copilot, code generation) to eliminate infrastructure toil, accelerate development, and analyze pipeline failures
  • Core Automation: Develop robust platform automation (e.g., Python, Go, Bash) to improve build efficiency, artifact caching, reliability, and repository hygiene
  • Security & Compliance: Harden CI/CD infrastructure with robust controls for secrets management, RBAC, audit logging, and secure runner design
  • Observability: Implement and enhance CI/CD observability using tools like Prometheus, Grafana, and OpenTelemetry to provide deep insights into performance and reliability
  • Technical Leadership: Mentor engineers and partner across Cloud, Security, and Developer Experience teams to define and evolve our end-to-end delivery platform architecture
Read More
Arrow Right

Senior Observability Engineer

Coralogix is a modern, full-stack observability platform transforming how busine...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Site Reliability, DevOps, or Platform Engineering with a focus on observability
  • Proven expertise with at least one major observability platform (e.g., Prometheus, Victoria Metrics, OpenSearch)
  • Hands-on experience with Kubernetes, including deep knowledge of controllers, operators, and Helm
  • Experience writing Kubernetes controllers (controller-runtime, KubeBuilder)
  • Strong programming skills in Go or Python (Rust is a plus)
  • Experience designing, scaling, and operating observability systems at enterprise scale
  • Familiarity with at least one major cloud provider (AWS, Azure, or GCP)
  • Strong understanding of distributed systems, telemetry pipelines, and instrumentation standards (e.g., OpenTelemetry)
  • Excellent communication skills with the ability to explain complex topics to diverse stakeholders
Job Responsibility
Job Responsibility
  • Design, implement, and maintain observability features such as Alerting, SLOs, Reporting, and Synthetic Tests
  • Manage and scale OpenTelemetry Collectors and other observability agents across Kubernetes environments
  • Write and maintain Kubernetes Controllers using frameworks like controller-runtime and KubeBuilder
  • Operate and optimize the internal Coralogix account, ensuring proper usage, cost efficiency, and best practices adoption
  • Define and enforce observability guidelines and standards across the organization
  • Partner with engineering teams to embed observability by default into products and services
  • Control observability-related costs while maximizing performance, visibility, and value
  • Contribute to upstream projects such as OpenTelemetry, helping shape industry standards
  • Explore and implement cutting-edge observability technologies, including eBPF-based approaches
  • Fulltime
Read More
Arrow Right

Senior Distributed Systems Engineer - Ad Display Platform Engineering

The Bidding Platform organization is the core of the RTB business, processing ov...
Location
Location
Poland
Salary
Salary:
Not provided
rtbhouse.com Logo
RTB House
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of hands-on experience in software engineering
  • Proficiency in programming
  • Excellent understanding of how complex IT systems work (from the hardware level, through software, to algorithmics)
  • Very good knowledge of fundamental Internet protocols and technologies (DNS, HTTP, cookies and others)
  • Good knowledge of basic methods of creating concurrent programs and distributed systems (from thread level to geo-distributed clusters level)
  • Practical ability to observe, monitor and analyse the operation of production systems (and draw valuable conclusions from it)
  • The ability to critically analyze the solutions created in terms of performance (from estimating the theoretical performance of the designed systems to detecting and removing actual performance problems in production)
  • General knowledge of issues (typical problems and methods of solving them) in the areas of 'high scalability' and 'high availability'
  • C1 level in English and Polish
Job Responsibility
Job Responsibility
  • Implement and maintain (in all aspects, including setting up environment, writing configuration code, monitor production) high-quality backend services for displaying Ads globally, focusing on extreme performance and scalability
  • Develop tools (deployment, testing platforms, web performance and reliability monitoring), and critical optimizations to drive measurable improvements in critical user performance metrics for ad rendering and display
  • Write, test, and deploy robust, efficient, and well-documented code in Java/Python, ensuring adherence to the highest coding and performance standards
  • Participate in code reviews, knowledge sharing sessions, and help implement technical standards and best practices within the team
What we offer
What we offer
  • Projects focused on extreme performance and high code quality – solid code reviews are our standard
  • Collaboration within an interdisciplinary, self-sufficient team (including DevOps, database experts, backend developers, product designers, and QA engineers)
  • Hardware and software tailored to your preferences (e.g., MacBook, AI tool licenses)
  • Flexible working conditions – no core hours, fully remote cooperation possible
Read More
Arrow Right

Senior Software Engineer - Observability and Reliability

We are growing the engineering team and looking for engineers who have the chops...
Location
Location
United States , New York City
Salary
Salary:
150000.00 - 220000.00 USD / Year
sigmacomputing.com Logo
Sigma Computing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Computer Science fundamentals
  • 5+ years industry experience building and maintaining high-quality software, especially software other engineers use
  • You apply a product mindset to infrastructure systems and feel accomplished enabling others
  • Desire to be a great teammate and have fun at work
  • Strong sense of craftsmanship, and a healthy academic curiosity
Job Responsibility
Job Responsibility
  • Build observability tools and platforms, including: metrics, logging, distributed tracing, dashboarding, alerting, application performance management
  • Build with modern tools and languages like Go, Open Telemetry and Kubernetes
  • Participate in on-call rotation and ensure uptime of services
  • Create runtime tools/processes that optimize cloud triaging and limit downtime
  • Define best practices around making our systems and services measurable
  • Collaborate with peers and stakeholders through design and code reviews to ensure best practices amongst available technologies. We expect successful candidates to be coding a majority of their time
What we offer
What we offer
  • Equity
  • Generous health benefits
  • Flexible time off policy. Take the time off you need!
  • Paid bonding time for all new parents
  • Traditional and Roth 401k
  • Commuter and FSA benefits
  • Lunch Program
  • Dog friendly office
  • Fulltime
Read More
Arrow Right