CrawlJobs Logo

Engineering Manager, AgentOps

United States, San Francisco 216200.00 - 270250.00 USD / Year · Job Posted February 20, 2026
Apply Position
Job Link Share

Job Description

The vision for the AgentOps team is to build the best Agent Development Platform in the AI Industry. Agent Development is in its nascent stages in a rapidly changing industry, with limited tooling making it hard for agent developers to manage agent lifecycles. As a team that has a front-row seat to what Enterprise customers need and want, we want to build an opinionated but flexible platform for all Agent operations ("AgentOps"), including building, deploying, monitoring, evaluating and improving agents to solve customer needs. With the AI industry currently moving towards RL workflows that need verifiable rewards, we further want to gear this platform towards knowledge capture that create compounding effects to make Agents more effective and capable. We want to create a virtuous data flywheel where agents built using this platform see continuous performance improvements and increase customer value over time.

Job Responsibility

  • Manage the engineering team and drive technical delivery
  • Comfortable working cross functionally, whether that be internal or external customers
  • Work across the entire product lifecycle from conceptualization through production
  • Build features end-to-end: back-end, system design, debugging and testing
  • Deliver experiments at a high velocity and level of quality to engage our customers
  • Influence the culture, values, and processes of a growing engineering team
  • Inspire and mentor engineers
  • Collaborating with cross-functional teams to define, design, and ship new product features and experiences

Requirements

  • At least 5 years of relevant experience
  • At least 2 years of experience managing engineers is preferred
  • Proven success in leading, managing, and developing high-performing Engineering teams
  • Expertise in identifying product engagement patterns and trends for large scale consumer products
  • Track record of shipping high-quality products and features at scale
  • Desire to work in a very fast-paced environment
  • Ability to turn business and product ideas into engineering solutions
  • Excellent problem-solving skills
  • Be able to work independently or as part of a team
  • Excited to join a dynamic, hybrid team in either San Francisco or New York City.

What we offer

  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • A learning and development stipend
  • Generous PTO
  • Additional benefits such as a commuter stipend
  • Equity based compensation

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Engineering Manager, AgentOps

8 matching positions

New

ML Solutions Architect

As an ML Solutions Architect, you'll be the technical bridge between clients and...
Location
Location
Colombia , Medellín; Bogotá; Cali; Barranquilla; Bucaramanga
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solution Design: Ability to architect end-to-end ML systems for diverse business problems
  • ML Lifecycle: Deep understanding of the full ML lifecycle from data to deployment
  • System Design: Experience designing scalable, production-grade ML architectures
  • Trade-off Analysis: Ability to evaluate technical approaches (cost, performance, complexity)
  • Feasibility Assessment: Quickly assess if ML is an appropriate solution for a problem
  • Agentic Architecture: Deep understanding of agent design patterns, state management, and orchestration frameworks
  • Claude Ecosystem: Hands-on experience with Claude Code, Claude Agent SDK, and Anthropic's tool ecosystem
  • MCP Proficiency: Understanding of Model Context Protocol architecture for designing client integrations
  • Agent Frameworks: Practical knowledge of LangGraph, LangChain agents, and multi-agent orchestration patterns
  • AI-Assisted Workflows: Demonstrated experience with AI coding assistants (Cursor, GitHub Copilot, Claude Code) for rapid prototyping
Job Responsibility
Job Responsibility
  • Lead technical discovery sessions with prospective clients
  • Understand client business problems and translate them into ML solutions
  • Design end-to-end ML architectures and technical proposals
  • Create compelling technical presentations and demonstrations
  • Estimate project scope, timelines, cost, and resource requirements
  • Support General Managers in winning new business
  • Serve as the primary technical point of contact for clients
  • Manage technical stakeholder expectations
  • Present technical solutions to both technical and non-technical audiences
  • Navigate complex organizational dynamics and conflicting priorities
What we offer
What we offer
  • High-visibility role working with diverse clients
  • Opportunity to shape solution offerings and practice direction
  • Work with cutting-edge ML, LLM, and agentic AI technologies
  • Global exposure across LATAM, Europe, and North America
  • Career path toward Practice Leadership or Principal Architect
  • Learning budget and conference attendance
  • Remote-first with regular client travel opportunities
  • Compensation for health insurance or sports coverage
  • Access to the latest AI tools and subscriptions for professional development
  • Fulltime
Read More
Arrow Right

Ai Technical Architect

Location
Location
United States , Auburn Hills
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 20+ years in software engineering with 5+ years focused on AI/ML systems
  • 3+ years hands-on experience architecting and shipping production LLM and agentic AI applications
  • Demonstrated success leading enterprise-scale AI platform builds with measurable business outcomes
  • Track record architecting scalable cloud-native systems on AWS in regulated or large-enterprise environments
  • Experience leading technical teams, mentoring engineers, and engaging executive stakeholders
  • Bachelor's or Master's degree in Computer Science, AI/ML, or a related technical field
  • Expert proficiency with LangGraph, LangChain, and agent orchestration frameworks
  • Deep experience with Amazon Bedrock, SageMaker, and Amazon Q, including Bedrock Agents and Knowledge Bases
  • Hands-on experience with Model Context Protocol (MCP), function calling, tool use, and structured output patterns
  • Strong command of prompt engineering, evaluation harnesses, fine-tuning, and model optimization
Job Responsibility
Job Responsibility
  • Design the enterprise AI platform architecture spanning the LLM API gateway, GPU and compute allocation pools, sandbox provisioning, model registry, and security gate automation
  • Define infrastructure standards, API gateway patterns, and reference architectures consumed by all AI delivery towers and partner integrations
  • Establish guardrails for token metering, rate limiting, audit logging, DLP validation, SAST, DAST, dependency scanning, and model card review embedded in CI/CD
  • Review security posture across all AI workloads with mapping to NIST AI RMF, AWS Well-Architected (including the Machine Learning Lens), and applicable enterprise compliance baselines
  • Architect multi-agent systems using LangGraph, LangChain, and Model Context Protocol (MCP) for complex workflow orchestration, planning, and tool use
  • Define patterns for ReAct, Chain-of-Thought, Tree-of-Thoughts, and agent-to-agent coordination across enterprise and customer-facing use cases
  • Design and optimize Retrieval-Augmented Generation (RAG) systems, embedding strategies, and semantic search across structured and unstructured enterprise data
  • Establish MLOps and AgentOps practices for deployment, evaluation, observability, and continuous improvement of agents and models in production
  • Architect solutions on Amazon Bedrock, Amazon SageMaker, Amazon Q, Bedrock Agents, and Bedrock Knowledge Bases
  • Define infrastructure patterns using Amazon EKS, AWS Lambda, ECS Fargate, API Gateway, EventBridge, SNS/SQS, Kinesis, S3, DynamoDB, Aurora, Redshift, Athena, OpenSearch, and Kendra
  • Fulltime
Read More
Arrow Right

AI Integration Engineer

We are seeking a highly motivated AI Integration Engineer to join our team and h...
Location
Location
United States , Annapolis Junction
Salary
Salary:
112800.00 - 257000.00 USD / Year
boozallen.com Logo
Booz Allen Hamilton
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in infrastructure engineering or system integration roles
  • 2+ years of experience supporting large-scale AI/ML systems or GPU-centric environments
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud, and their AI-focused services, including SageMaker, GCP AI Platform, and Azure Machine Learning
  • Experience with networking concepts, including TCP/IP, DNS, NGINX, load balancing, and firewalls, applied to AI model and infrastructure deployment
  • Experience integrating MLOps pipelines using tools such as MLflow, Kubeflow, TensorFlow Serving, or Vertex AI, including integration of AgentOps frameworks such as LangSmith and Arize Phoenix, to monitor autonomous decision-making paths and agent reasoning traces
  • Experience with orchestration frameworks for multi-agent systems such as LangGraph, CrewAI, or AutoGen, and managing the stateful databases required to support them, including Redis and Postgres
  • Experience working with NVIDIA GPU technologies, including CUDA, NCCL, TensorRT, and DGX systems, and container or orchestration tools such as Kubernetes, Docker, Terraform, or Pulumi
  • Ability to manage and optimize distributed, high-performance computing environments, including clusters of GPUs and cloud-based GPU instances
  • TS/SCI clearance with a polygraph
  • Bachelor's degree in CS, Computer Engineering, or Systems Engineering
Job Responsibility
Job Responsibility
  • Serve as the technical point of contact for integrating LLMs and other AI workloads across infrastructure systems, operational tools, and application pipelines
  • Architect, deploy, and maintain scalable GPU computing environments and infrastructure required for autonomous agentic workflows, including persistent state management, long-term memory systems such as Vector DBs, and multi-step reasoning traces
  • Develop, manage, and optimize CI/CD pipelines for AI deployments, ensuring smooth transitions from model development to production environments
  • Oversee network and infrastructure connectivity, ensuring seamless communication between distributed systems, GPUs, virtual machines (VMs), APIs, and Command and Control (C2) tools
  • Design and secure tool-calling environments where agents interact with external APIs, ensuring strict governance and sandboxing for autonomous actions
  • Provide diagnostic and troubleshooting expertise for AI systems, monitoring infrastructure to maintain availability, security, and performance benchmarks
  • Collaborate across engineering, data, and AI teams to align infrastructure solutions with business and operational goals
What we offer
What we offer
  • Health, life, disability, financial, and retirement benefits
  • paid leave
  • professional development
  • tuition assistance
  • work-life programs
  • dependent care
  • recognition awards program
  • Fulltime
Read More
Arrow Right

Infrastructure Engineer

A Senior Infrastructure Engineer, you will play a pivotal role in shaping the fo...
Location
Location
United States , New York; San Francisco
Salary
Salary:
130000.00 - 250000.00 USD / Year
distyl.ai Logo
Distyl AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional experience in infrastructure, DevOps, or systems engineering roles, with a proven track record of delivering production-grade systems
  • Deep expertise in cloud platforms (Azure, AWS, or GCP) and container orchestration with Kubernetes
  • Strong experience with Infrastructure as Code tools (Terraform, Pulumi, etc.) and GitOps workflows
  • Hands-on experience with CI/CD systems (Flux, ArgoCD, Helm, GitHub Actions, GitLab CI)
  • Proficiency in Python and Linux for automation and tooling
  • Solid knowledge of microservices architecture, distributed systems design, networking, security, IAM, and secrets management
  • Familiarity with modern observability stacks (Prometheus, Grafana, OpenTelemetry, DataDog)
  • Experience with serverless frameworks and event-driven architectures
  • A demonstrated ownership mindset—able to take initiative, make informed decisions, and drive projects forward
  • Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Design & Operate Cloud-Native Infrastructure: Architect and manage resilient, scalable deployments across cloud and hybrid environments, leveraging Kubernetes, serverless frameworks, and modern microservices architectures with the infrastructure that powers them
  • Evolve Infrastructure as Code: Drive the development of modular Terraform and GitOps configurations that ensure consistency, repeatability, and speed across multi-cloud environments
  • Advance Automation & CI/CD: Drive our automation strategy by building and refining CI/CD pipelines that enable rapid, reliable deployments. You'll apply modern practices like GitOps and work with a variety of build and deployment tools, such as Gradle, Bazel, Flux, Helm, and GitHub Actions, to reduce operational overhead
  • Embed Security & Compliance: Integrate IAM, secrets management, and secure service-to-service communication into every layer of the infrastructure
  • Drive Observability & Reliability: Establish robust monitoring, logging, and alerting practices with tools like Datadog Prometheus, Grafana, and OpenTelemetry to ensure our systems are performant, secure, and enterprise-ready
  • AgentOps & AI Tooling: Design and build the critical infrastructure for our advanced AI systems. This includes architecting AI toolchains, handling complex agent integrations and agent deployments
What we offer
What we offer
  • 100% covered medical, dental, and vision for employees and dependents
  • 401(k) with additional perks (e.g., commuter benefits, in‑office lunch)
  • Access to state‑of‑the‑art models, generous usage of modern AI tools, and real‑world business problems
  • Ownership of high‑impact projects across top enterprises
  • A mission‑driven, fast‑moving culture that prizes curiosity, pragmatism, and excellence
  • Fulltime
Read More
Arrow Right

Principal AI Engineer Senior Vice President

The Principal AI Engineer / SVP is a senior leader responsible for defining and ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 14+ years of extensive experience across analytics consulting, AI Product development, AI/ML, GenAI research, GenAI projects development and production experience
  • World-class expertise in building GenAI led business solves, with a proven track record of designing and delivering business solutions, platforms, Products
  • Deep and authoritative knowledge of Generative AI, including LLMs, and extensive experience architecting and working with technology teams in building complex, production-grade Agentic solutions
  • Ability to define and articulate new architectural options and strategies to solve complex business and technical problems
  • Recognized thought leader in AI, with experience in influencing and driving technical direction across a large enterprise
  • Exceptional leadership and mentoring capabilities, with experience developing senior-level technical talent
  • Deep understanding of the financial services domain, including regulatory, compliance, and security challenges
  • Deep exposure to Agentic frameworks, AgentOps Tools, vector databases, container orchestration, API management, Foundation models, Fine Tuning frameworks, Parameter Efficient methods, Knowledge Graphs, retrieval frameworks and embedding models
  • Hands on experience with GenAI tech stack – Knowledge management, Chunking strategies, Intent algorithms, RAG strategies, Hybrid search algorithms, Fusion and ranking, attribution
  • Bachelor's/University degree
Job Responsibility
Job Responsibility
  • Define and own the long-term vision and strategy for Generative AI and Agentic systems across the business unit, ensuring alignment with business goals and future growth
  • Lead the research, design, and prototyping of next-generation AI platforms and "first-of-a-kind" capabilities, establishing new patterns for the organization
  • Act as the lead solution architect for the most complex and critical AI initiatives, ensuring solutions are built for scale, with the highest standards of reliability, security, and performance
  • Set the technical standards for AI development, including frameworks, tools, MLOps, and Model Governance, influencing teams across the organization
  • Mentor and develop senior technical talent (Lead and Senior Engineers), acting as a "force multiplier" to elevate the technical capabilities of the entire department
  • Lead and scale a multi-tiered organization of AI professionals, setting the vision, managing budget, and representing the AI function at the senior executive level
  • Serve as the primary technical advisor to senior leadership on all matters related to AI, including opportunities, risks, and competitive landscape
  • Drive a culture of innovation, experimentation, and technical excellence
  • Fulltime
Read More
Arrow Right
New

Senior Audit Manager – Customer Domains

This is an opportunity to join Group Internal Audit at Bank of Ireland, the thir...
Location
Location
Republic of Ireland , Dublin
Salary
Salary:
Not provided
bankofireland.com Logo
Bank of Ireland
Expiration Date
June 26, 2026
Flip Icon
Requirements
Requirements
  • Certified Internal Auditor (CIA), Certified Information Systems Auditor (CISA), or a recognised accountancy qualification
Job Responsibility
Job Responsibility
  • Lead end-to-end audit engagements, from planning through to reporting, ensuring delivery aligns to agreed timelines, budget and quality standards
  • Assess and challenge internal controls across Retail, Mortgage and C&CB, using data and insights to form clear, evidence-based conclusions
  • Deliver concise and impactful audit reports, translating findings into practical and forward-looking actions for senior partners
  • Identify emerging risks, trends and control weaknesses, contributing to audit planning and improving audit methodologies
  • Build strong partner relationships, lead and support audit teams, and encourage collaboration to deliver high-quality outcomes
What we offer
What we offer
  • Hybrid working
  • 25 days annual leave
  • Excellent pension contributions
  • 6 months paid maternity leave
  • Innovative fertility and surrogacy policy
  • Working parent and carer supports
  • Substantial health insurance contribution
  • Employee Assistance Programme
  • WebDoctor
  • Financial wellbeing coaches
  • Fulltime
Read More
Arrow Right
New

System Operations Engineer

Georgia System Operations Corporation (GSOC) is seeking a System Operations Engi...
Location
Location
United States , Tucker
Salary
Salary:
77880.00 - 189300.00 USD / Year
gasoc.com Logo
Georgia System Operations
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor of Science in Engineering or a related field
  • Strong analytical and problem-solving skills
  • Proficiency with Microsoft Office tools (Excel, Power BI, Word, PowerPoint)
  • Effective written and verbal communication skills
Job Responsibility
Job Responsibility
  • Provide day-ahead and real-time engineering support to ECC and SCC operations
  • Analyze system conditions using load flow models, forecasting tools, and operational data
  • Develop, maintain, and enhance operational applications, models, and analytical tools
  • Support outage coordination, system switching, and reliability planning activities
  • Ensure compliance with NERC reliability standards and operating requirements
  • Partner with operations, planning, and engineering teams to troubleshoot issues and improve system performance
  • Develop and maintain policies, procedures, and operational guidelines that support safe and efficient operations
What we offer
What we offer
  • Medical
  • Dental
  • Vision
  • 401k Match
  • Parental Leave
  • Educational Assistance
  • Annual Performance Bonus
  • PTO
  • Volunteer Time Off
  • Fulltime
Read More
Arrow Right
New

Merchandiser Stocker

The Merchandiser is responsible for providing high-quality merchandising support...
Location
Location
United States , Noblesville; Fishers
Salary
Salary:
21.59 - 22.72 USD / Hour
keurigdrpepper.com Logo
Keurig Dr Pepper
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ability to lift-up to 50 lbs repeatedly
  • Capability to push and pull up to 100 lbs repeatedly
  • Possession of a valid driver's license
  • Proof of vehicle insurance
  • Access to a dependable and reliable vehicle
Job Responsibility
Job Responsibility
  • Providing high-quality merchandising support for Keurig Dr Pepper brands
  • Stocking and displaying products on shelves/coolers at large accounts within a set territory
  • Reporting directly to assigned stores for scheduled shifts
  • Covering routes as assigned
  • Using personal vehicle to travel among stores in territory
  • Using company-issued phone to clock in and out and track mileage for reimbursement
What we offer
What we offer
  • Medical
  • Dental and Vision
  • Paid Time Off
  • 401(k) program with employer match
  • Child & Elder Care
  • Adoption Benefits
  • Paid Parental Leave
  • Fertility Benefits
  • Employee Resource Groups
  • Breastmilk Shipping Services
  • Fulltime
Read More
Arrow Right