This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for an experienced engineer to build and enhance observability capabilities for agent-based solutions on AWS. This role focuses on providing deep visibility into distributed systems, improving reliability, and enabling proactive issue detection and resolution. Our customer is a multinational corporation with more than a century of history and offices in over 180 countries. Their most ambitious goal at the time is to introduce a range of Reduced-Risk Products (RRPs). The target audience is more than 1 billion consumers around the globe. IT platform hosts 700+ applications. Intellia’s mission is to help the client with the engineering of a comprehensive software ecosystem for a game-changing IoT product on the margin of innovative consumer experience and cutting-edge technology. Our teams are involved in the engineering of core platform components for best-in-class eCommerce, Digital Marketing and IoT solutions. As a DevOps engineer, you will become a part of Core Architecture Team and be responsible for the architecture, implementation of best practices in our Digital Engineering Enterprise Platform. The Platform is a set of services and internet applications that accelerate the development and delivery of software applications by taking care of common SDLC challenges. The Platform provides access and consumption for engineering teams to a set of services, technologies, practices for their development and for operating their application, ensuring a set of compliance and best practices. Project is in production for 2+ years, being supported by multiple teams. Our technical domains are: – AWS cloud, partially Azure – SSO, Organizations, Service control policies, access models. – IAAC: terraform enterprise, terratest, chalice – Serverless: lambda, step functions, wide range of misc automations, fargate – System, Application, Network and security architectures – Orchecstration: k8s (eks) – SRE activities (logging, tracing, monitoring), OpsGenie, Splunk – Hashicorp Vault – Hybrid Networking
Job Responsibility
Design and implement agent workflows using LangGraph
Build stateful, multi-step AI pipelines with complex decision logic
Orchestrate interactions between multiple agents and external systems
Integrate LLM-based components into production-grade applications
Ensure scalability and reliability of agent execution flows
Collaborate with platform teams to integrate agent workflows with AWS infrastructure
Optimize performance and cost efficiency of agent-based systems
Contribute to architecture and best practices for agentic systems
Requirements
5+ years of experience as a Software / DevOps / Platform Engineer
Strong experience with Python (FastAPI, APIs, async workflows)
Hands-on experience with LangGraph or similar agent orchestration frameworks
Experience working with LLM-based systems (OpenAI, Anthropic, etc.)
Strong knowledge of AWS (EKS, Lambda, API Gateway, etc.)
Experience with Kubernetes and Terraform
Understanding of stateful workflows and distributed systems
Experience building and integrating APIs and microservices
Familiarity with CI/CD processes and cloud-native development
Nice to have
Experience with LangChain or similar ecosystems
Understanding of prompt engineering and LLM evaluation