This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a fullstack engineer at Inflection, you will own the platforms, systems, and services that bring our conversational AI to life at scale. You’ll collaborate across research, product, and infrastructure teams to enable rapid iteration, high reliability, and secure delivery of novel AI features to millions of users. Your work will directly impact both the pace of product development and the stability of our production systems.
Job Responsibility:
Design and implement scalable backend systems and APIs that power production LLM experiences, including agentic workflows, memory systems, and tool integrations
Build and operate high-availability infrastructure to support real-time inference, retrieval, and conversation pipelines
Develop internal platforms to improve engineering productivity—CI/CD pipelines, service templates, observability frameworks, and rollout tooling
Collaborate closely with applied research and frontend teams to rapidly prototype, ship, and iterate on end-user features
Ensure systems meet our high bar for security, uptime, and latency through incident response, load testing, monitoring, and automation
Participate in on-call rotations to maintain the reliability of the services you build
Requirements:
5+ years of professional software engineering experience, particularly in full-stack development
Prior experience in high-growth or early-stage startup environments
Strong proficiency across the modern web stack: Python, TypeScript, Node.js, and modern frontend frameworks (e.g., React, Tailwind)
Experience in designing complex architectures, including asynchronous workflows and integrations
Proven problem-solving, collaboration, and communication skills
Experience building or integrating AI/LLM-powered applications
Experience with modern cloud and workflow infrastructure, including orchestration frameworks (e.g., Temporal), containerization and Kubernetes, and CI/CD pipelines on AWS/GCP/Azure
Have a bachelor’s degree or equivalent in a related field to the offered position requirements
What we offer:
Diverse medical, dental and vision options
401k matching program
Unlimited paid time off
Parental leave and flexibility for all parents and caregivers
Support of country-specific visa needs for international employees living in the Bay Area