This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re a fast-growing startup building production-grade AI agents for enterprise customers at scale. We’re looking for Software Engineers with Applied AI experience who can own the design, build, and deployment of agentic workflows powered by Large Language Models (LLMs)—from early prototypes to production-grade AI agents, to deliver concrete business value in enterprise workflows. In this role, you’ll work closely with customers on real-world business problems, often building first-of-their-kind agent workflows that integrate LLMs with tools, APIs, and data sources. While our pace is startup-fast, the bar is enterprise-high: agents must be reliable, observable, safe, and auditable from day one. You’ll collaborate closely with customers, product, and platform teams, and help shape how agentic systems are built, evaluated, and deployed at scale.
Job Responsibility:
Customer-Facing Technical Impact: Work closely with enterprise customers to translate high-value, ambiguous business problems into well-framed agentic problems with clear success criteria and evaluation methodologies
Provide technical leadership across the full development and evaluation lifecycle, including post-deployment iteration, for agentic workflows
Contribute to shared frameworks and patterns that enable consistent delivery across customers
Agent Design, Build and Production launches: Lead the design, build, and delivery of LLM-powered agents that reason, plan, and act across tools and data sources with enterprise-grade reliability and performance
Balance rapid iteration with enterprise requirements, evolving prototypes into stable, reusable solutions
Define and apply evaluation and quality standards to measure success, failures, and regressions
Debug real-world agent behavior and systematically improve prompts, workflows, tools, and guardrails
Team Mentorship & Organizational Impact: Mentor engineers across distributed teams
Drive clarity in ambiguous situations, build alignment, and raise engineering quality across the organization
Requirements:
Production Engineering: Substantial experience building, shipping, and maintaining production-grade software (Python/TypeScript)
Agentic Architectures: Hands-on experience building agents that plan and execute multi-step tasks (ReAct, Plan-and-Execute) and interact with external APIs/tools
The LLM Stack: Deep familiarity with Frontier Models (GPT, Claude, Gemini), RAG, vector databases (Pinecone, Weaviate, etc.), and orchestration frameworks (LangGraph, CrewAI, or custom state machines)
Rigorous Evaluation: Proven ability to move beyond 'trial and error' by building robust evaluation frameworks to measure agent accuracy, safety, and latency
Leadership & Impact: Stakeholder Mastery: Experience leading technical discussions with enterprise customers to translate ambiguous business needs into concrete technical specs
Experience mentoring distributed teams and setting the architectural standards for AI/Agentic systems
Additional Requirements: Strong written and verbal communication skills
Ability and interest to travel up to 25%, flexible
What we offer:
An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend