This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Software Engineer on the Apollo team, you’ll build and operate a large-scale distributed system to allow the remote operation and maintenance of Kubernetes clusters. Our mission is to extract the entire state of a cluster into a portable, high-performance artifact within minutes, enabling full and almost instant cluster reconstruction from the ground up—all while pushing the limits of speed, reliability, and scale. You’ll design and implement backup and restore solutions for Kubernetes, leveraging proprietary compression infrastructure tailored to Palantir’s unique deployment models. You’ll also build and optimize our container artifact store, which is based on the OCI (Open Container Initiative) distribution spec—the industry standard for storing and distributing container images and artifacts. You’ll own the backbone of every environment Apollo supports, from hyperscalers to Army trucks.
Job Responsibility:
Build and maintain highly available systems responsible for orchestration of software deployment across hundreds of production environments
Develop performance sensitive storage features and optimizations to store and serve Palantir products
Engage and solve problems that customers need today with a tight feedback loop for what is important and what works
Working directly with other teams to solve problems that cross the team boundary
Debug complex issues and performance problems throughout the stack, including open source
Requirements:
4+ years of professional software development experience on core infrastructure with emphasis on operational excellence
2+ years of experience contributing to the system design or architecture (architecture, design patterns, reliability and scaling) of new and existing systems
Bachelor's degree in Computer Science or equivalent
Systems programming experience with strong proficiency in golang, Java or equivalent
Some familiarity with containers (Docker) and orchestration (Kubernetes) at scale
Nice to have:
Experience working with hosted container registries such as Amazon Elastic Container Registry (ECR), Google Artifact Registry, or Azure Container Registry (ACR)
Familiarity with the OCI Distribution Spec and open source registry projects (e.g., distribution/distribution, Harbor)
Experience designing, building, and operating high-scale observability or infrastructure systems
Experience building high-quality software in a fast-paced CI/CD development environment
High empathy for developer and operators workflows and productivity
What we offer:
Medical, dental, and vision insurance
Voluntary life insurance
Basic life, AD&D and disability insurance
Commuter benefits
Relocation assistance
Take what you need paid time off
2 weeks paid time off built into the end of each year
10 paid holidays
Supportive leave of absence program
Paid leave for new parents
Subsidized back-up care for all parents
Fertility and family building benefits
Stipend to help with expenses that come with a new child