This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are on a mission to ensure everyone has access to medical expertise, no matter where they are. Half the world still lacks access to quality healthcare. Even in advanced systems, outcomes are uneven, and clinicians are overwhelmed. Medical knowledge grows faster than human capacity can keep up. Corti is building the infrastructure to close that gap. Our AI platform expands access to medical expertise, reducing errors, restoring time to clinicians, and making care more affordable, accessible, and human again. There is no quality healthcare without a quality dialogue, and no reliable AI without a strong foundation. Help us build both.
Job Responsibility:
Work closely with product and engineering to ensure that our systems are scalable, reliable and performant
Leads the design and operation of a self-hosted observability stack on Kubernetes
Builds libraries to standardize logging and metrics solutions across multiple applications
Design and implement automation tools and processes to improve the efficiency of our development and operations teams
Design and architect our platform to scale as the company grows
Create and improve the existing toolset for developers to improve their developer experience
Contribute to a strong culture of development at Corti through mentorship and knowledge sharing of people inside as well as outside the team
Requirements:
Proven experience in a Senior Platform Engineer or Senior DevOps role
Experienced with LGTM stack (Loki, Grafana, Tempo, Mimir)
Can write and integrate libraries for application logging and observability
Strong focus on SLOs, SLIs, and performance monitoring
Understands of signal-to-noise optimisation and alerting best practices
Balances feature enablement with infrastructure cost and retention considerations
Experience with containerization and orchestration tools like Docker and Kubernetes. Such as architecting a multi-tenant cluster, implementing GitOps, horizontal/vertical scaling and running a fault resilient cluster
Experience with cloud platforms such as Azure, AWS or GCP. Such as deploying infrastructure-as-code, configuring cloud resources to meet security and compliance standards
Strong knowledge of programming languages such as Golang and Python
Experience with managing and operating distributed, stateful systems on Kubernetes such as Redis Clusters, PostgreSQL clusters or Apache Kafka
Experience writing CI/CD pipelines using Github Actions
Experience with the usage and development of Kubernetes Operators
Excellent English verbal and written communication skills, including all-remote communication