This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re looking for strong engineers (backend, frontend, or full-stack) who are excited about building agents. You’ll help shape how we build, evaluate, orchestrate, and scale LLM-powered agents in production - and define what it means to create truly lovable AI products.
Job Responsibility:
Build, tune, and scale agents that power lovable products
Add new agent skills and tools
Improve agent reasoning, orchestration, and efficiency
Design how multiple agents collaborate
Select the right models for different task types
Push the limits of what agents can reliably do in real products
Analyze agent behavior and performance
Hill-climb toward better helpfulness, safety, and reliability
Build evaluation frameworks and benchmarks
Create experimentation pipelines and feedback loops
Ensure agents perform well across real-world use cases
Requirements:
Strong engineering fundamentals
Ability to build high-quality production systems
Backend, frontend, or full-stack engineering background
Nice to have:
Have built AI agents yourself (side projects count)
Are deeply curious about how AI systems behave and improve
Have worked with LLMs or AI systems in production
Are excited about experimenting with new models and techniques
Shipped ML or AI features to real users with uptime requirements
Built evaluation systems or ML experimentation pipelines
Strong opinions on safety, latency, and helpfulness - but open to testing and learning