This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Join us and help shape the future of AI by architecting next-generation knowledge systems. The Infra team at LlamaIndex owns the foundations that our product is built upon as well as many of the tools that enable engineers to develop, ship, and observe their code. We are responsible for designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly growing product.
Job Responsibility:
Collaborate with other engineering teams to build and maintain foundational systems that empower developers and support the company's rapid growth
Design and implement scalable infrastructure solutions for various deployment models, including SaaS, single-tenant, and private deployments
Manage and optimize cloud resources and Kubernetes clusters for cost-effectiveness and performance
Enable external customer deployment success through maintaining clear infrastructure boundaries and principles
Optimize and improve the release and deployment processes to enhance efficiency and reliability
Ensure compliance with relevant regulations and implement robust security measures across different deployment environments
Requirements:
5+ years of engineering experience
Worked on Platform or Infrastructure teams on significant projects involving infrastructure components (Terraform/CDKTF, Kubernetes, Helm, test infrastructure, release management, observability, etc.)
Experience in optimizing cloud resource utilization
Proficient in tuning Kubernetes clusters and cloud resources for cost and performance efficiency
Willing to build LlamaIndex’s engineering culture as we grow
You can balance speed and pragmatism and build the appropriate solutions for each stage of the company’s growth
Nice to have:
Experience building out infrastructure from the ground up at a fast-growing startup
Experience with observability tools like Prometheus, Grafana, and New Relic
Experience with GitOps tools like ArgoCD and Flux for continuous deployment
Experience with security compliance and audits in cloud environments such as SOC2
Familiar with Python, Postgres, multi-cloud deployments
What we offer:
Competitive base salary and equity compensation
Comprehensive medical/dental/vision coverage for you and your family
Unlimited paid time off policy
Daily catered lunch and snacks in the San Francisco office