CrawlJobs Logo

Lead AI Engineer (GenAI Platform Services)

United States, McLean 197300.00 - 245600.00 USD / Year · Job Posted February 06, 2026
Apply Position
Job Link Share

Job Description

Lead AI Engineer (GenAI Platform Services). At Capital One, we are creating responsible and reliable AI systems, changing banking for good. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Job Responsibility

  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability
  • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more
  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems
  • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One

Requirements

  • Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies
  • At least 4 years of experience programming with Python, Go, Scala, or Java

Nice to have

  • 6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)
  • Experience designing, developing, delivering, and supporting AI services
  • Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang
  • Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost
  • Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production

What we offer

  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Lead AI Engineer (GenAI Platform Services)

8 matching positions

Senior Lead AI Engineer (GenAI Platform Services)

At Capital One, we are creating responsible and reliable AI systems, changing ba...
Location
Location
United States , San Jose, California; McLean, Virginia; New York, New York; Cambridge, Massachusetts
Salary
Salary:
229900.00 - 286200.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies
  • At least 6 years of experience programming with Python, Go, Scala, or Java
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One
  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
  • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more
  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems
  • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One
What we offer
What we offer
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits
  • Fulltime
Read More
Arrow Right

Lead AI Platform Engineer

Join us as a Lead AI Platform Engineer. Are you passionate about building cuttin...
Location
Location
United Kingdom , Glasgow
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on expertise in designing, building, and maintaining AWS platforms supporting AI workloads (ML, GenAI, agentic systems)
  • Proven experience working across security, governance, architecture, and business stakeholders, navigating complex enterprise environments
  • A product-oriented approach, with a focus on outcomes, scalability, self-service capabilities, continuous evolution, and exceptional user experience
Job Responsibility
Job Responsibility
  • Lead and manage engineering teams, providing technical guidance, mentorship, and support to ensure the delivery of high-quality software solutions
  • Driving technical excellence, fostering a culture of innovation, and collaborating with cross-functional teams to align technical decisions with business objectives
  • Oversee timelines, team allocation, risk management and task prioritization to ensure the successful delivery of solutions within scope, time, and budget
  • Mentor and support team members' professional growth, conduct performance reviews, provide actionable feedback, and identify opportunities for improvement
  • Evaluation and enhancement of engineering processes, tools, and methodologies to increase efficiency, streamline workflows, and optimize team productivity
  • Collaboration with business partners, product managers, designers, and other stakeholders to translate business requirements into technical solutions and ensure a cohesive approach to product development
  • Enforcement of technology standards, facilitate peer reviews, and implement robust testing practices to ensure the delivery of high-quality solutions
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right

Sr. Distinguished AI Engineer (Agentic AI Platform)

At Capital One, we are creating responsible and reliable AI systems, changing ba...
Location
Location
United States , San Jose, California; San Francisco, California
Salary
Salary:
343400.00 - 392000.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or AI plus at least 10 years of experience developing AI and ML algorithms or technologies, or Master's degree plus at least 8 years of experience developing AI and ML algorithms or technologies
  • At least 10 years of experience programming with Python, Go, Scala, or Java
  • 9 years of experience deploying scalable and responsible AI solutions on cloud platforms
  • 2+ years of experience supporting Agentic Frameworks
  • 2+ years of experience with LLMOps
  • 8+ years of experience designing mission-critical machine learning platforms
  • 2+ years of experience architecting, designing, developing, integrating, delivering, and supporting complex AI systems
  • Demonstrated ability to lead and mentor multiple engineering teams and influence cross-functional stakeholders up to the VP level
  • Experience developing AI and ML algorithms or technologies using Python, C++, C#, Java, or Golang
  • Master's degree in Computer Science, Computer Engineering, or relevant technical field
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
  • Contribute to the north star platform architecture, continuously publishing and refining living diagrams and canonical APIs
  • Standardizing and automating agentic workflows
  • Contribute to crafting an end to end GenAI SDK, CLI and starter kits
  • Help bring together a vision of central guardrail services
  • Collaborate with cross organization architects to drive end to end performance
  • Accelerate innovation by incubating proof of concepts and driving RFCs
  • Own central Helm charts, operators and CRDs that auto scale agents to hit tenant SLAs
  • Coach and evangelize - hosting architecture office hours, mentoring Staff, Principal and Senior engineers, authoring technical design documents and blogs and representing Capital One at Tier1 AI conferences
What we offer
What we offer
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits
  • Fulltime
Read More
Arrow Right

Ai Lead Engineer

Location
Location
United States , Raleigh
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proactive self-starter with excellent interpersonal, communication, and customer service skills
  • Expert-level AI/ML and full-stack development skills, with strong hands-on experience building and integrating backend services and frontend applications using modern frameworks such as Node.js and React. Strong emphasis on clean, maintainable, reproducible, well-tested, and well-documented code
  • Ability to manage multiple tasks and projects simultaneously
  • Collaborative team player with a focus on achieving common goals
  • Meticulous attention to detail
  • Quick learner with a passion for staying current with emerging technologies and industry trends
  • Deep expertise in RAG systems, LLMs, embeddings, vector databases, and AI infrastructure
  • Experience designing semantic retrieval and knowledge platforms, including curated corpora and grounding/citation patterns (e.g., “show your sources” for internal auditability)
  • Experience evaluating AI models for different tasks
  • Strong ability and experience to leverage cloud infrastructure
Read More
Arrow Right

GenAI Lead Engineer

Are you ready to shape the future of banking with cutting-edge AI? At Citi, we'r...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficiency in at least two programming languages (strong preference for Python, with significant experience in Javascript/Typescript and Golang being highly valued)
  • Demonstrated deep hands-on experience in engineering and deploying enterprise-grade solutions that are highly scalable, resilient, and performant
  • Strong theoretical and practical understanding of Large Language Models (LLMs), transformers, agentic frameworks, vector stores, and advanced search algorithms
  • Experience with relevant GenAI/ML frameworks such as LangChain, LangGraph, MLFlow, Spring AI, Spring Boot, and Flask
  • Extensive experience with data analysis and manipulation using tools like SQL and Pandas
  • Proficiency in database technologies including Oracle, Postgres, or MongoDB
  • Proven experience in designing and implementing robust REST and WebSocket APIs
  • Experience with messaging and integration platforms like Kafka or JMS/MQ
  • UI development skills with technologies such as React JS or Streamlit
  • Demonstrated ability to design, develop, and deploy AI/ML/GenAI solutions into production environments (experience with MLOps principles and tools is a significant advantage)
Job Responsibility
Job Responsibility
  • Drive the identification, evaluation, and adoption of emerging GenAI, ML, and traditional AI technologies and tools to develop innovative solutions and enhance existing platforms
  • Lead the end-to-end design, prototyping, and implementation of cutting-edge AI/ML and Generative AI solutions, ensuring they address critical business needs and scale effectively across the enterprise
  • Partner closely with product management, engineering teams, and business stakeholders to deeply understand requirements and translate them into precise technical specifications and actionable roadmaps
  • Provide guidance and mentorship to junior engineers, fostering best practices in AI/ML/GenAI development, deployment, and operational excellence
  • Champion rapid delivery and iterative development, demonstrating adaptability and a willingness to pivot based on feedback and evolving needs, prioritizing value delivery over upfront perfection
  • Lead the development of compelling proof-of-concept projects to validate the feasibility and potential of novel AI/ML/GenAI solutions
  • Actively contribute to the design and development of internal AI/ML/GenAI platforms, frameworks, and shared services
  • Provide expert technical support, troubleshooting, and resolution for AI/ML/GenAI solutions in production environments
  • Fulltime
Read More
Arrow Right

Ai Ops Platform Engineer

Join us as an AI Ops Engineer, to build and run an enterprise AI Factory within ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • LLMOps / MLOps at production scale, operating the full Generative AI lifecycle including models, prompts and agents, CI/CD pipelines, structured evaluation, drift and hallucination monitoring, and controlled, auditable release processes suitable for banking environments
  • Cloud‑native AI platform engineering on AWS, with hands‑on delivery using services such as Amazon Bedrock for foundation models, agent orchestration patterns, Lambda and Step Functions, alongside demonstrated Python engineering capability and secure microservices and API design
  • AI governance, observability and cost optimisation, embedding governance by design through policy as code, alignment to model risk framework expectations, lifecycle traceability and audit‑ready evidence, supported by SRE‑grade monitoring and ongoing optimisation of token usage and compute cost across AI workloads
Job Responsibility
Job Responsibility
  • Build and run an enterprise AI Factory within our Card Merchant Services organisation, enabling AI‑driven change across the merchant payments lifecycle
  • Accountable for the end‑to‑end operationalisation of AI, spanning model, prompt, and agent lifecycles
  • deployment and monitoring
  • guardrails
  • and cost optimisation, ensuring AI solutions are production‑ready, auditable, compliant, and scalable across merchant payment use cases
  • Accountable for the end‑to‑end engineering of GenAI and ML platforms, embedding governance, observability and operational resilience by design, while enabling teams to deploy and run AI solutions with clarity, assurance and accountability at scale
  • Lead and manage engineering teams, providing technical guidance, mentorship, and support to ensure the delivery of high-quality software solutions
  • Oversee timelines, team allocation, risk management and task prioritization
  • Mentor and support team members' professional growth, conduct performance reviews, provide actionable feedback, and identify opportunities for improvement
  • Evaluation and enhancement of engineering processes, tools, and methodologies
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right

GenAI Senior Platform Engineer - Python, VP

Citi's global Innovation Labs is seeking a versatile Senior GenAI Platform Engin...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in the software industry, with a strong emphasis on building enterprise software
  • 6+ years of relevant experience developing and implementing scalable and robust platforms, applications, and services using modern libraries and frameworks (e.g., Python: FastAPI, Flask, Pandas, Scikit-learn, Hugging Face
  • Node.js: Express, NestJS
  • TypeScript)
  • 5+ years of experience delivering complex backend solutions and services (e.g., APIs, microservices) into production
  • Demonstrated experience in managing and implementing successful projects of varying sizes and complexities
  • Proven understanding of Generative AI systems, AIOps, and application monitoring/evaluation
  • Experience with cloud architectures, with specific experience in public cloud offerings
  • Strong passion and proven hands-on experience integrating with AI/ML technologies
  • Experience with software development agents, agile development, CI/CD pipelines, software testing, and code reviews
Job Responsibility
Job Responsibility
  • Lead the design, development, and maintenance of highly complex GenAI platforms, applications, and services using Python, Node.js, and TypeScript
  • Ensure the seamless operation, scalability, and integration of AI capabilities across various Citi business units
  • Engage with data science, technical, and business stakeholders to define and design the overall architecture for key use-cases
  • Drive the deployment of new GenAI products and process improvements, working with internal and external partners to design, validate, and deliver solutions
  • Resolve high-impact technical and business problems, leading projects through in-depth evaluation of complex business processes, system architecture, and industry standards
  • Provide expert guidance and advanced knowledge in modern programming, ensuring platform design adheres to architectural blueprints and best practices for generative models
  • Develop and enforce robust coding standards, testing methodologies, debugging practices, and implementation strategies for enterprise-grade solutions across Python, Node.js, and TypeScript
  • Manage multiple concurrent initiatives and projects of varying sizes and complexity
  • Engage with external vendors and startups for joint initiatives and exploration of new technologies
  • Cultivate a comprehensive understanding of how business, architecture, and infrastructure integrate within the GenAI ecosystem at Citi
What we offer
What we offer
  • Discover the top benefits offered to our global workforce, designed to support your well-being, growth and work-life balance
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, AI Agent Platform

The Geico AI Agent Platform team is seeking an exceptional Staff Software Engine...
Location
Location
United States , Chevy Chase; New York City
Salary
Salary:
115000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, Mathematics, or a related field
  • an advanced degree (master’s or Ph.D.) is highly desirable
  • 6+ years of hands-on experience in designing, implementing, and maintaining multi-tenant AIML systems and platforms in production environments
  • 6+ years of experience working with cloud platforms such as Azure and AWS
  • Extensive expertise in designing and deploying large-scale data pipelines and real-time inference systems and managing the end-to-end AI Agent and/or AIML system development lifecycles, including configuration, evaluation, monitoring, observability and AuthN/AuthR considerations
  • 6+ years of experience working with common backend systems & tools (e.g, Kubernetes, Temporal, OpenSearch, PostgreSQL, Redis, Neo4J, etc.)
  • Deep understanding of Docker, container optimization, and multi-stage builds
  • Experience with Prometheus, Grafana, Open Telemetry and distributed tracing
  • 3+ years of experience building front-end web applications using frameworks such as React and/or Next.JS
  • Deep proficiency in programming languages such as Python, Java, Go, etc., with a strong emphasis on coding excellence
Job Responsibility
Job Responsibility
  • Architect and implement scalable multi-tenant backend systems for building AI agent workflows, including agent configuration, offline evaluation, synthetic data generation, workflow simulation, agent marketplace, etc. using Azure Kubernetes Service (AKS), FastAPI, etc., ensuring economy of scale and control cost of maintenance
  • Collaborate with Design team to architect and implement frontend experiences and workflows for onboarding both technical and non-technical stakeholders, maximizing user adoption and successful AI agent development
  • Develop observability frameworks to ensure 99.9%+ uptime for AI agent platforms through robust monitoring, alerting, and incident response procedures
  • Evaluate and (if desirable) integrate cutting-edge GenAI frameworks, libraries and vendors to maintain a state-of-the-art technology stack, including hybrid cloud solutions with AWS/GCP as backup or specialized use cases
  • Architect and implement scalable, high-performance machine learning platforms and systems capable of processing large data volumes and supporting real-time decision making and workflows
  • Oversee the end-to-end lifecycle of AI agent applications, ensuring robust testing, deployment, and ongoing monitoring
  • Ensure adherence to company production readiness standards, security protocols, and regulatory compliance throughout the development lifecycle
  • Continuously optimize platform performance, reducing latency and improving throughput for AI agent workloads
  • Design and implement backup, recovery, and business continuity plans for hosted platform applications & services
  • Design and maintain robust CI/CD pipelines for ML model deployment using Azure DevOps, GitHub Actions, and MLOps tools
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right