This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are currently seeking a Staff Software Engineer to join the AI Platform team to drive the design, architecture, and production posture of Addepar’s AI Platform and our products and solutions. This team is at the center of Addepar's mission to integrate AI across our product suite and is growing quickly. This role focuses on building a scalable platform and infrastructure to power deep, rich AI capabilities and products. As the AI platform expands, you will be a central architect of the "Core Platform" layer – building the managed services, serving infrastructure, observability, and cross-platform integrations that turn experimental AI capabilities into scalable production software. This includes productionizing the cutting edge of the latest AI developments like agents, MCP, computer use, etc; alongside designing and automating the operational backbone of Addepar's AI stack.
Job Responsibility:
Design and build the production runtime for LLM-based agents and products, creating the services and infrastructure that serve autonomous agents
Develop deep application-level knowledge to proactively inform and influence requirements, constraints and best practices for implementing composable, complex AI systems
Lead the design, implementation, and automation of production infrastructure on a variety of cloud environments (Kubernetes/Databricks), to enable us to ship and scale AI features instantly
Evangelize and promote disciplined, best engineering practices to enforce strong production hygiene and culture
Initiate and lead collaborations with cross-functional teams to identify and resolve complex application or infrastructure issues, serving as a technical subject matter expert
Architect, build, and maintain advanced, automated CI/CD pipelines e.g. using Jenkins, ArgoCD, AWS CodeBuild/Pipeline, GitHub Actions, or similar, establishing best practices for deployment strategies (e.g., blue/green, canary)
Develop systems and best practices monitoring, alerting, and troubleshooting of our probabilistic and AI-driven systems and broader software stack
Requirements:
Extensive experience as a Software/Backend Engineer, with a track record of taking on increasing responsibility
Experience across the full product lifecycle: designing, implementing, shipping, scaling, operationalizing, and maintaining technology/SaaS products
Exceptional Programming skills and fundamentals in Python/Go/Java, with a proven track record of building large scale production systems
Proficient experience with diverse compute environments including microservices (K8s), Databricks and serverless architectures (e.g. AWS Lambda)
Demonstrable experience leading initiatives with infrastructure-as-code tools such as Terraform in complex, multi-account environments
Proficient experience with comprehensive monitoring and alerting stacks (e.g. Prometheus/Grafana/Sentry/cloud-native tools), with a focus on observability strategy
Excellent interpersonal and communication skills to effectively collaborate with multi-functional teams, articulate complex technical concepts, and influence outcomes
Nice to have:
Extensive experience with Databricks (Unity Catalog, Model Serving, Jobs)
Demonstrable experience writing and contributing to significant systems automation tooling or open-source projects is a strong plus
Specific experience with LLMs and agentic systems and associated technologies such as Langchain, Vector DBs, or MLFlow
Exposure to industry practices in financial services or other highly regulated data environments is a plus