This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Own the architecture and delivery of enterprise-grade multi-agent systems on a hybrid AWS and Open Source stack, ensuring scalability, security, and production readiness.
Job Responsibility
Own the architecture and delivery of enterprise-grade multi-agent systems on a hybrid AWS and Open Source stack, ensuring scalability, security, and production readiness
Define multi-agent system architecture (planner, executor, reviewer, memory)
Author Architecture Decision Records (ADRs) for: AWS Bedrock vs open-source models
Hosted vs self-managed inference
Design event-driven orchestration using Step Functions and/or LangGraph
Implement model routing strategies: High-complexity workflows to managed LLM platforms (e.g., Bedrock)
Cost-sensitive tasks to open-source models (e.g., Llama, Mistral)
Define fallback and failover logic
Architect: Model and prompt versioning pipelines
Evaluation pipelines
Red-teaming frameworks
Define observability metrics such as latency, token cost, and hallucination rates
Design IAM-based agent permissions and secure API/tool access
Ensure compliance with enterprise security and governance standards
Translate business problems into technical architecture
Align with security, infrastructure, and business teams
Requirements
Strong AWS experience (at least 3–4 of the following): Bedrock (or equivalent), Lambda, API Gateway, Step Functions, IAM, VPC
Experience with at least one agent framework: LangChain or LangGraph
Infrastructure as Code: Terraform or AWS CDK
Strong understanding of distributed systems, event-driven architecture, and RAG patterns
Nice to have
CrewAI or AutoGen
Observability tools such as LangSmith or equivalent