CrawlJobs Logo

Staff Software Engineer, AI Agent Platform

Geico

Location Icon

Location:
United States , Chevy Chase

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

115000.00 - 260000.00 USD / Year

Job Description:

The Geico AI Agent Platform team is seeking an exceptional Staff Software Engineer to build the next generation enterprise Agent OS and SDKs. This role combines deep technical expertise in platform engineering, application design and agentic workflows with strong leadership and mentoring capabilities. You will be responsible for designing, implementing, and maintaining scalable, reliable frontend and backend systems that enable our business, product and engineering teams to build, test and deploy their own AI agents & workflows. The candidate must have excellent communication skills and a proven track record of delivering business value via technical excellence.

Job Responsibility:

  • Architect and implement scalable multi-tenant backend systems for building AI agent workflows, including agent configuration, offline evaluation, synthetic data generation, workflow simulation, agent marketplace, etc. using Azure Kubernetes Service (AKS), FastAPI, etc., ensuring economy of scale and control cost of maintenance
  • Collaborate with Design team to architect and implement frontend experiences and workflows for onboarding both technical and non-technical stakeholders, maximizing user adoption and successful AI agent development
  • Develop observability frameworks to ensure 99.9%+ uptime for AI agent platforms through robust monitoring, alerting, and incident response procedures
  • Evaluate and (if desirable) integrate cutting-edge GenAI frameworks, libraries and vendors to maintain a state-of-the-art technology stack, including hybrid cloud solutions with AWS/GCP as backup or specialized use cases
  • Architect and implement scalable, high-performance machine learning platforms and systems capable of processing large data volumes and supporting real-time decision making and workflows
  • Oversee the end-to-end lifecycle of AI agent applications, ensuring robust testing, deployment, and ongoing monitoring
  • Ensure adherence to company production readiness standards, security protocols, and regulatory compliance throughout the development lifecycle
  • Continuously optimize platform performance, reducing latency and improving throughput for AI agent workloads
  • Design and implement backup, recovery, and business continuity plans for hosted platform applications & services
  • Design and maintain robust CI/CD pipelines for ML model deployment using Azure DevOps, GitHub Actions, and MLOps tools
  • Act as the tech lead for a sub-team, setting technical direction and ensuring consistency in design principles and best practices
  • Provide hands-on mentorship and guidance during design reviews, code assessments, and performance tuning
  • Lead by example in tackling complex technical challenges and driving system-wide architectural improvements
  • Establish and champion engineering standards for ML infrastructure, deployment practices, and operational procedures
  • Create technical documentation, runbooks, and deliver internal training sessions on platform capabilities
  • Work closely with data scientists, software engineers, and product teams to seamlessly deploy ML systems into production environments
  • Translate complex technical concepts into actionable insights for both technical and non-technical stakeholders
  • Foster a collaborative environment that encourages innovation and the sharing of best practices across teams
  • Present technical solutions and platform roadmaps to leadership and cross-functional stakeholders

Requirements:

  • Bachelor’s degree in computer science, Engineering, Mathematics, or a related field
  • an advanced degree (master’s or Ph.D.) is highly desirable
  • 6+ years of hands-on experience in designing, implementing, and maintaining multi-tenant AIML systems and platforms in production environments
  • 6+ years of experience working with cloud platforms such as Azure and AWS
  • Extensive expertise in designing and deploying large-scale data pipelines and real-time inference systems and managing the end-to-end AI Agent and/or AIML system development lifecycles, including configuration, evaluation, monitoring, observability and AuthN/AuthR considerations
  • 6+ years of experience working with common backend systems & tools (e.g, Kubernetes, Temporal, OpenSearch, PostgreSQL, Redis, Neo4J, etc.)
  • Deep understanding of Docker, container optimization, and multi-stage builds
  • Experience with Prometheus, Grafana, Open Telemetry and distributed tracing
  • 3+ years of experience building front-end web applications using frameworks such as React and/or Next.JS
  • Deep proficiency in programming languages such as Python, Java, Go, etc., with a strong emphasis on coding excellence
  • Proficiency in AIML frameworks such as TensorFlow, PyTorch, Langraph, etc.
  • Demonstrated track record of mentoring engineers and leading technical initiatives
  • Proven ability to tackle complex technical challenges, innovate through hands-on experimentation, and set technical standards
  • Excellent verbal and written communication against audience of diverse seniority levels and professional backgrounds

Nice to have:

  • Deep expertise operating and/or building AI agent platforms & capabilities like Langraph platform, Autogen, N8N, Crew.ai, etc.
  • Experience with LLM observability systems such as Langsmith, Langfuse, Arize Phoenix, etc.
  • Experience building LLM-based AI agent workflows via both no code/low code and traditional high-code development environments
  • Experience utilizing both open source (e.g. llama, Qwen, Mistral) and proprietary (e.g. GPT, Claude) LLMs for appropriate tasks
  • Understanding of AI safety principles, model governance, and regulatory compliance
  • Background in regulated industries with understanding of data privacy requirements and cybersecurity review processes
What we offer:
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff Software Engineer, AI Agent Platform

Staff AI Engineer

As a Staff AI Engineer on our AI Engineering team, you will be responsible for b...
Location
Location
United States
Salary
Salary:
200000.00 - 280000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of software engineering experience with a focus on production systems
  • 1.5+ years of hands-on LLM experience (2023-present) building real applications with GPT, Claude, Llama, or other modern LLMs
  • Demonstrated experience building customer-facing, scalable LLM-powered products with real user usage
  • Experience building multi-step AI agents, LLM chaining, and complex workflow automation
  • Deep understanding of prompting strategies, few-shot learning, chain-of-thought reasoning, and prompt optimization techniques
  • Expert-level Python skills for production AI systems
  • Strong experience building scalable backend systems, APIs, and distributed architectures
  • Experience with LangChain, LlamaIndex, or other LLM application frameworks
  • Proven ability to integrate multiple APIs and services to create advanced AI capabilities
  • Experience deploying and managing AI models in cloud environments (AWS, GCP, Azure)
Job Responsibility
Job Responsibility
  • Design and Deploy Production LLM Systems
  • Create sophisticated AI agents that can chain multiple LLM calls, integrate with external APIs, and maintain state across complex workflows
  • Develop and optimize prompting strategies
  • Build robust APIs and integrate AI capabilities with existing Apollo infrastructure and external services
  • Implement comprehensive evaluation frameworks, A/B testing, and monitoring systems
  • Optimize for cost, latency, and scalability across different LLM providers and deployment scenarios
  • Work closely with product teams, backend engineers, and stakeholders to translate business requirements into technical AI solutions
  • Build sophisticated multi-agent systems that can reason, plan, and execute complex sales workflows
  • Develop systems that maintain conversational context across complex multi-turn interactions
  • Build scalable large language model and agentic platforms
What we offer
What we offer
  • equity
  • company bonus or sales commissions/bonuses
  • 401(k) plan
  • at least 10 paid holidays per year, flex PTO, and parental leave
  • employee assistance program and wellbeing benefits
  • global travel coverage
  • life/AD&D/STD/LTD insurance
  • FSA/HSA and medical, dental, and vision benefits
  • Fulltime
Read More
Arrow Right

Sr. Staff Software Engineer - Advanced Analytics Platform

At DISQO, we’re redefining how companies turn data into decisions. Our mission i...
Location
Location
United States , Los Angeles, Glendale
Salary
Salary:
200000.00 - 240000.00 USD / Year
disqo.com Logo
DISQO
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of professional software engineering experience
  • 5+ years architecting or building high-performance data systems or analytics platforms
  • 3+ years of product Rust experience
  • Deep expertise in Rust and strong experience in Java
  • Proven track record building large-scale data analytics or OLAP systems from the ground up
  • Deep understanding of columnar data engines, vectorized execution, and query/dataframe optimization
  • Hands-on experience with performance engineering, profiling, and hardware-aware optimization
  • Strong expertise with AWS - designing, deploying, and optimizing large-scale data and compute systems in the cloud
  • A systems-thinking mindset
  • Thrives in a fast-moving, startup environment
Job Responsibility
Job Responsibility
  • Architect and deliver a high-performance Advanced Analytics Engine
  • Design and build an Agentic AI system that leverages this Advanced Analytics Engine
  • Partner with product, engineering and data teams to power agentic AI analytics systems
  • Profile, benchmark, and optimize Rust components
  • Leverage AWS cloud services to architect scalable, reliable, and cost-efficient analytics infrastructure
  • Shape the evolution of DISQO’s broader data platform and its integration across our product ecosystem
  • Mentor and guide engineers
  • Contribute to open-source or internal frameworks that advance analytical systems and distributed computation
What we offer
What we offer
  • 100% covered Medical/Dental/Vision for employee
  • Equity
  • 401K
  • Generous PTO policy
  • Flexible workplace policy
  • Team offsites, social events & happy hours
  • Life Insurance
  • Health FSA
  • Commuter FSA (for hybrid employees)
  • Catered lunch and fully stocked kitchen
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Core AI

As a Staff AI Engineer on our Core AI team, you will be a cornerstone of FloQast...
Location
Location
United States , San Jose
Salary
Salary:
164000.00 - 246000.00 USD / Year
floqast.com Logo
FloQast
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of professional software engineering experience
  • 4+ years focused on building backend for production applications
  • Mastery of Python
  • Familiarity with some AI application frameworks, context engineering, and scalable system design for AI products
  • Expertise in designing products that integrate with multiple technologies, APIs, and data sources in cloud-native environments (AWS preferred)
  • Strong desire to develop deep hands-on experience with LLM APIs, retrieval-augmented generation (RAG), conversational AI, document processing, and MCP integrations
  • Proven ability to lead tech product initiatives, establish technical standards and communicate complex system designs to both technical and business stakeholders
Job Responsibility
Job Responsibility
  • Architect and lead development of production AI products including intelligent chatbots, document processing systems, and agentic workflows using Python and modern AI frameworks
  • Design and implement our centralized AI platform including model routing, provider management, vector search, and AI application frameworks with seamless MCP (Model Context Protocol) integrations
  • Build scalable AI products that integrate with diverse technologies including accounting systems, document repositories, and external APIs while maintaining robust monitoring and observability
  • Master context engineering and system design for AI applications, ensuring optimal information retrieval, context assembly, and multi-turn conversation management
  • Collaborate with Product, Engineering, and Security teams to ensure AI products are robust, compliant, and aligned with business objectives in the regulated accounting space
  • Provide technical leadership and mentorship to the growing AI team, establishing best practices for AI product development, deployment, and governance
What we offer
What we offer
  • Medical
  • Dental
  • Vision
  • Family Forming benefits
  • Life & Disability Insurance
  • Unlimited Vacation
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer

Help design our AI platform and develop our next generation of machine learning ...
Location
Location
United States , San Francisco
Salary
Salary:
216500.00 - 324500.00 USD / Year
gofundme.com Logo
GoFundMe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 9+ years of hands-on experience in machine learning engineering, AI development, software engineering, or related fields
  • Experience emphasizing secure, large-scale, distributed system design, AI/ML pipeline development, and implementation
  • Extensive experience designing, developing, and operating scalable backend systems
  • Experience applying software engineering best practices such as domain-driven design, event-driven architectures, and microservices
  • Deep expertise in agentic workflows, AI evaluation solutions, prompt management, and secure AI development and testing practices
  • Strong knowledge of relational and document-based databases, data storage paradigms, and efficient RESTful API design
  • Experience establishing robust CI/CD pipelines, automated testing (unit and integration), and deployment practices
  • Strong leadership skills, including effective planning and management of complex projects, mentoring of team members, and fostering a collaborative, high-performing engineering culture
  • Excellent communicator, able to articulate complex technical concepts clearly to both technical and non-technical stakeholders
  • Bachelor's degree in Computer Science, Software Engineering, or a related technical field (preferred)
Job Responsibility
Job Responsibility
  • Design and implement AI platforms to enable scalable and secure access to LLMs from multiple model providers for diverse use cases
  • Design and implement agentic workflows, agentic tool ecosystems, and LLM prompt management solutions
  • Design, build, and optimize scalable model training, fine tuning, and inference pipelines, ensuring robust integration with production systems
  • Influence technical strategy and approach to developing embedding stores, vector databases, and other reusable assets
  • Lead initiatives to streamline ML and AI workflows, improve operational efficiency, and establish standardized procedures to achieve consistent, high-quality results across our AI systems
  • Design and develop backend services and RESTful APIs using Python and FastAPI, integrating seamlessly with ML pipelines and services
  • Take operational responsibility for team-owned services, including performance monitoring, optimization, troubleshooting, and participation in an on-call rotation
  • Collaborate with both technical and non-technical colleagues, including data and applied scientists, software engineers, product managers, and business stakeholders, to deliver reliable and scalable ML-driven products
  • Coach and mentor fellow ML engineers, promoting a culture of collaboration, continuous improvement, and engineering excellence within the team
  • Employ a diverse set of tools and platforms including Python, AWS, Databricks, Docker, Kubernetes, FastAPI, Terraform, Snowflake, Coralogix, and GitHub to build, deploy, and maintain scalable, highly available machine learning infrastructure
What we offer
What we offer
  • Competitive pay
  • Comprehensive healthcare benefits
  • Financial assistance for things like hybrid work, family planning
  • Generous parental leave
  • Flexible time-off policies
  • Mental health and wellness resources
  • Learning, development, and recognition programs
  • Fulltime
Read More
Arrow Right

Software Engineer

Pomelo Care is a multi-disciplinary team of clinicians, engineers and problem so...
Location
Location
United States , New York City
Salary
Salary:
135000.00 - 190000.00 USD / Year
pomelocare.com Logo
Pomelo Care
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong foundation in computer science, typically from a bachelor's or master's degree, or equivalent professional experience
  • At least two years of experience building and shipping high-quality software using languages like Java, C#, Kotlin, Python, Go, etc.
  • Experience developing applications with modern front-end frameworks like React or Vue.js
  • Experience delivering and maintaining software systems in a production environment
  • Keeping up with trends in LLM coding agents to multiply your engineering impact
  • Enjoy tackling complex problems and avoid unnecessary complexity
Job Responsibility
Job Responsibility
  • Write clean, elegant code to ship new features and keep our platform humming
  • Collaborate on designing and implementing new systems that enable our clinical team to deliver personalized, data-driven care
  • Contribute to our new AI-powered tools, building agents that help our clinicians work more effectively and focus on the patient in front of them
  • Enhance our mobile experience, ensuring parents and families feel connected and supported throughout their journey
  • Work closely with product managers, designers and clinical staff to understand user needs and translate them into technical solutions
What we offer
What we offer
  • Competitive healthcare benefits
  • Generous equity compensation
  • Unlimited vacation
  • Membership in the First Round Network (a curated and confidential community with events, guides, thousands of Q&A questions, and opportunities for 1-1 mentorship)
  • Fulltime
Read More
Arrow Right

Software Engineer

Pomelo Care is a multi-disciplinary team of clinicians, engineers and problem so...
Location
Location
United States , San Francisco
Salary
Salary:
135000.00 - 190000.00 USD / Year
pomelocare.com Logo
Pomelo Care
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong foundation in computer science, typically from a bachelor's or master's degree, or equivalent professional experience
  • At least two years of experience building and shipping high-quality software using languages like Java, C#, Kotlin, Python, Go, etc.
  • Experience developing applications with modern front-end frameworks like React or Vue.js
  • Experience delivering and maintaining software systems in a production environment
  • Keeping up with trends in LLM coding agents to multiply your engineering impact
  • Enjoy tackling complex problems and avoid unnecessary complexity
Job Responsibility
Job Responsibility
  • Write clean, elegant code to ship new features and keep our platform humming
  • Collaborate on designing and implementing new systems that enable our clinical team to deliver personalized, data-driven care
  • Contribute to our new AI-powered tools, building agents that help our clinicians work more effectively and focus on the patient in front of them
  • Enhance our mobile experience, ensuring parents and families feel connected and supported throughout their journey
  • Work closely with product managers, designers and clinical staff to understand user needs and translate them into technical solutions
What we offer
What we offer
  • Competitive healthcare benefits
  • Generous equity compensation
  • Unlimited vacation
  • Membership in the First Round Network (a curated and confidential community with events, guides, thousands of Q&A questions, and opportunities for 1-1 mentorship)
  • Fulltime
Read More
Arrow Right

Software Engineer

Pomelo Care is a multi-disciplinary team of clinicians, engineers and problem so...
Location
Location
United States
Salary
Salary:
135000.00 - 190000.00 USD / Year
pomelocare.com Logo
Pomelo Care
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong foundation in computer science, typically from a bachelor's or master's degree, or equivalent professional experience
  • At least two years of experience building and shipping high-quality software using languages like Java, C#, Kotlin, Python, Go, etc.
  • Experience developing applications with modern front-end frameworks like React or Vue.js
  • Experience delivering and maintaining software systems in a production environment
  • Keeping up with trends in LLM coding agents
Job Responsibility
Job Responsibility
  • Write clean, elegant code to ship new features and keep our platform humming
  • Collaborate on designing and implementing new systems that enable our clinical team to deliver personalized, data-driven care
  • Contribute to our new AI-powered tools, building agents that help our clinicians work more effectively
  • Enhance our mobile experience, ensuring parents and families feel connected and supported throughout their journey
  • Work closely with product managers, designers and clinical staff to understand user needs and translate them into technical solutions
What we offer
What we offer
  • Competitive healthcare benefits
  • Generous equity compensation
  • Unlimited vacation
  • Membership in the First Round Network (a curated and confidential community with events, guides, thousands of Q&A questions, and opportunities for 1-1 mentorship)
  • Fulltime
Read More
Arrow Right
New

Principal Software Engineer, AI Developer Tools

At Docker, we make app development easier so developers can focus on what matter...
Location
Location
United States , Seattle
Salary
Salary:
232000.00 - 319000.00 USD / Year
docker.com Logo
Docker
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years software engineering experience with 3+ years in Staff or Principal Engineer roles
  • Deep expertise in AI/ML technologies with hands-on production experience building LLM-powered applications, AI agents, or AI-assisted developer tools
  • Strong understanding of LLM APIs (OpenAI, Anthropic, etc.), prompt engineering, agent orchestration frameworks, and practical applications of AI in software development workflows
  • Proven track record of architecting and building highly scalable distributed systems and developer-facing platforms
  • Production experience with modern cloud-native infrastructure including Kubernetes, GitOps deployment patterns, observability systems, and CI/CD pipelines
  • Proficiency in Go (preferred), Rust, Java, or Python with strong software engineering fundamentals
  • Experience designing developer tools, platform engineering systems, or internal tools that enable other teams
  • Exceptional product and platform mindset considering business outcomes, developer experience, and technical trade-offs
  • Strong communication skills with ability to influence technical and non-technical stakeholders across the organization
  • Track record of technical mentorship and elevating engineering teams' capabilities
Job Responsibility
Job Responsibility
  • Define the long-term technical vision and architecture for AI-powered developer tools and the self-service platform that enables teams to build their own AI agents
  • Establish architectural patterns, technical standards, and best practices for LLM integration, AI agent development, and production AI systems serving developers
  • Lead technical strategy for platform capabilities including deployment frameworks (ArgoCD/GitOps), observability integration (Grafana), security controls, and operational tooling for AI developer tools
  • Design highly available, scalable infrastructure for hosting AI agents and developer tools with predictable performance and intelligent resource management
  • Drive technical decisions on AI technology choices, LLM provider strategies, prompt engineering approaches, and agent orchestration frameworks
  • Partner with Senior Manager and product leadership to align technical architecture with business objectives and productization opportunities
  • Architect and build production-ready AI agents for developer productivity including code review assistants, test generators, deployment diagnostics, and incident response automation
  • Design and implement the self-service platform infrastructure that reduces time-to-production for new AI tools from weeks to days
  • Build systems that accelerate adoption of AI-native development tools (Claude Code, Cursor, Warp) across Docker's engineering organization
  • Establish reliability, security, and performance standards for AI systems including SLOs, monitoring, incident response, and cost management
What we offer
What we offer
  • Freedom & flexibility
  • fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup
  • we want you comfortable while you work
  • 16 weeks of paid Parental leave
  • Technology stipend equivalent to $100 net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity
  • Fulltime
Read More
Arrow Right