Lead AI Engineer (GenAI Platform Services) Job at Capital One (McLean)

Senior Lead AI Engineer (GenAI Platform Services)

At Capital One, we are creating responsible and reliable AI systems, changing ba...

Location

United States , San Jose, California; McLean, Virginia; New York, New York; Cambridge, Massachusetts

Salary:

229900.00 - 286200.00 USD / Year

Capital One

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 6 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies
At least 6 years of experience programming with Python, Go, Scala, or Java

Job Responsibility

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One
Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more
Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems
Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One

What we offer

performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
comprehensive, competitive, and inclusive set of health, financial and other benefits

Fulltime

Lead AI Platform Engineer

Join us as a Lead AI Platform Engineer. Are you passionate about building cuttin...

Location

United Kingdom , Glasgow

Salary:

Not provided

Barclays

Expiration Date

Until further notice

Requirements

Hands-on expertise in designing, building, and maintaining AWS platforms supporting AI workloads (ML, GenAI, agentic systems)
Proven experience working across security, governance, architecture, and business stakeholders, navigating complex enterprise environments
A product-oriented approach, with a focus on outcomes, scalability, self-service capabilities, continuous evolution, and exceptional user experience

Job Responsibility

Lead and manage engineering teams, providing technical guidance, mentorship, and support to ensure the delivery of high-quality software solutions
Driving technical excellence, fostering a culture of innovation, and collaborating with cross-functional teams to align technical decisions with business objectives
Oversee timelines, team allocation, risk management and task prioritization to ensure the successful delivery of solutions within scope, time, and budget
Mentor and support team members' professional growth, conduct performance reviews, provide actionable feedback, and identify opportunities for improvement
Evaluation and enhancement of engineering processes, tools, and methodologies to increase efficiency, streamline workflows, and optimize team productivity
Collaboration with business partners, product managers, designers, and other stakeholders to translate business requirements into technical solutions and ensure a cohesive approach to product development
Enforcement of technology standards, facilitate peer reviews, and implement robust testing practices to ensure the delivery of high-quality solutions

What we offer

Competitive holiday allowance
Life assurance
Private medical care
Pension contribution

Fulltime

Sr. Distinguished AI Engineer (Agentic AI Platform)

At Capital One, we are creating responsible and reliable AI systems, changing ba...

Location

United States , San Jose, California; San Francisco, California

Salary:

343400.00 - 392000.00 USD / Year

Capital One

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Engineering, or AI plus at least 10 years of experience developing AI and ML algorithms or technologies, or Master's degree plus at least 8 years of experience developing AI and ML algorithms or technologies
At least 10 years of experience programming with Python, Go, Scala, or Java
9 years of experience deploying scalable and responsible AI solutions on cloud platforms
2+ years of experience supporting Agentic Frameworks
2+ years of experience with LLMOps
8+ years of experience designing mission-critical machine learning platforms
2+ years of experience architecting, designing, developing, integrating, delivering, and supporting complex AI systems
Demonstrated ability to lead and mentor multiple engineering teams and influence cross-functional stakeholders up to the VP level
Experience developing AI and ML algorithms or technologies using Python, C++, C#, Java, or Golang
Master's degree in Computer Science, Computer Engineering, or relevant technical field

Job Responsibility

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
Contribute to the north star platform architecture, continuously publishing and refining living diagrams and canonical APIs
Standardizing and automating agentic workflows
Contribute to crafting an end to end GenAI SDK, CLI and starter kits
Help bring together a vision of central guardrail services
Collaborate with cross organization architects to drive end to end performance
Accelerate innovation by incubating proof of concepts and driving RFCs
Own central Helm charts, operators and CRDs that auto scale agents to hit tenant SLAs
Coach and evangelize - hosting architecture office hours, mentoring Staff, Principal and Senior engineers, authoring technical design documents and blogs and representing Capital One at Tier1 AI conferences

What we offer

Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
comprehensive, competitive, and inclusive set of health, financial and other benefits

Fulltime

Ai Lead Engineer

Location

United States , Raleigh

Salary:

Not provided

Robert Half

Expiration Date

Until further notice

Requirements

Proactive self-starter with excellent interpersonal, communication, and customer service skills
Expert-level AI/ML and full-stack development skills, with strong hands-on experience building and integrating backend services and frontend applications using modern frameworks such as Node.js and React. Strong emphasis on clean, maintainable, reproducible, well-tested, and well-documented code
Ability to manage multiple tasks and projects simultaneously
Collaborative team player with a focus on achieving common goals
Meticulous attention to detail
Quick learner with a passion for staying current with emerging technologies and industry trends
Deep expertise in RAG systems, LLMs, embeddings, vector databases, and AI infrastructure
Experience designing semantic retrieval and knowledge platforms, including curated corpora and grounding/citation patterns (e.g., “show your sources” for internal auditability)
Experience evaluating AI models for different tasks
Strong ability and experience to leverage cloud infrastructure

GenAI Lead Engineer

Are you ready to shape the future of banking with cutting-edge AI? At Citi, we'r...

Location

India , Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Proficiency in at least two programming languages (strong preference for Python, with significant experience in Javascript/Typescript and Golang being highly valued)
Demonstrated deep hands-on experience in engineering and deploying enterprise-grade solutions that are highly scalable, resilient, and performant
Strong theoretical and practical understanding of Large Language Models (LLMs), transformers, agentic frameworks, vector stores, and advanced search algorithms
Experience with relevant GenAI/ML frameworks such as LangChain, LangGraph, MLFlow, Spring AI, Spring Boot, and Flask
Extensive experience with data analysis and manipulation using tools like SQL and Pandas
Proficiency in database technologies including Oracle, Postgres, or MongoDB
Proven experience in designing and implementing robust REST and WebSocket APIs
Experience with messaging and integration platforms like Kafka or JMS/MQ
UI development skills with technologies such as React JS or Streamlit
Demonstrated ability to design, develop, and deploy AI/ML/GenAI solutions into production environments (experience with MLOps principles and tools is a significant advantage)

Job Responsibility

Drive the identification, evaluation, and adoption of emerging GenAI, ML, and traditional AI technologies and tools to develop innovative solutions and enhance existing platforms
Lead the end-to-end design, prototyping, and implementation of cutting-edge AI/ML and Generative AI solutions, ensuring they address critical business needs and scale effectively across the enterprise
Partner closely with product management, engineering teams, and business stakeholders to deeply understand requirements and translate them into precise technical specifications and actionable roadmaps
Provide guidance and mentorship to junior engineers, fostering best practices in AI/ML/GenAI development, deployment, and operational excellence
Champion rapid delivery and iterative development, demonstrating adaptability and a willingness to pivot based on feedback and evolving needs, prioritizing value delivery over upfront perfection
Lead the development of compelling proof-of-concept projects to validate the feasibility and potential of novel AI/ML/GenAI solutions
Actively contribute to the design and development of internal AI/ML/GenAI platforms, frameworks, and shared services
Provide expert technical support, troubleshooting, and resolution for AI/ML/GenAI solutions in production environments

Fulltime

Ai Ops Platform Engineer

Join us as an AI Ops Engineer, to build and run an enterprise AI Factory within ...

Location

United Kingdom , London

Salary:

Not provided

Barclays

Expiration Date

Until further notice

Requirements

LLMOps / MLOps at production scale, operating the full Generative AI lifecycle including models, prompts and agents, CI/CD pipelines, structured evaluation, drift and hallucination monitoring, and controlled, auditable release processes suitable for banking environments
Cloud‑native AI platform engineering on AWS, with hands‑on delivery using services such as Amazon Bedrock for foundation models, agent orchestration patterns, Lambda and Step Functions, alongside demonstrated Python engineering capability and secure microservices and API design
AI governance, observability and cost optimisation, embedding governance by design through policy as code, alignment to model risk framework expectations, lifecycle traceability and audit‑ready evidence, supported by SRE‑grade monitoring and ongoing optimisation of token usage and compute cost across AI workloads

Job Responsibility

Build and run an enterprise AI Factory within our Card Merchant Services organisation, enabling AI‑driven change across the merchant payments lifecycle
Accountable for the end‑to‑end operationalisation of AI, spanning model, prompt, and agent lifecycles
deployment and monitoring
guardrails
and cost optimisation, ensuring AI solutions are production‑ready, auditable, compliant, and scalable across merchant payment use cases
Accountable for the end‑to‑end engineering of GenAI and ML platforms, embedding governance, observability and operational resilience by design, while enabling teams to deploy and run AI solutions with clarity, assurance and accountability at scale
Lead and manage engineering teams, providing technical guidance, mentorship, and support to ensure the delivery of high-quality software solutions
Oversee timelines, team allocation, risk management and task prioritization
Mentor and support team members' professional growth, conduct performance reviews, provide actionable feedback, and identify opportunities for improvement
Evaluation and enhancement of engineering processes, tools, and methodologies

What we offer

Competitive holiday allowance
Life assurance
Private medical care
Pension contribution

Fulltime

GenAI Senior Platform Engineer - Python, VP

Citi's global Innovation Labs is seeking a versatile Senior GenAI Platform Engin...

Location

Canada , Mississauga

Salary:

120800.00 - 170800.00 USD / Year

Citi

Expiration Date

Until further notice

Requirements

7+ years of experience in the software industry, with a strong emphasis on building enterprise software
6+ years of relevant experience developing and implementing scalable and robust platforms, applications, and services using modern libraries and frameworks (e.g., Python: FastAPI, Flask, Pandas, Scikit-learn, Hugging Face
Node.js: Express, NestJS
TypeScript)
5+ years of experience delivering complex backend solutions and services (e.g., APIs, microservices) into production
Demonstrated experience in managing and implementing successful projects of varying sizes and complexities
Proven understanding of Generative AI systems, AIOps, and application monitoring/evaluation
Experience with cloud architectures, with specific experience in public cloud offerings
Strong passion and proven hands-on experience integrating with AI/ML technologies
Experience with software development agents, agile development, CI/CD pipelines, software testing, and code reviews

Job Responsibility

Lead the design, development, and maintenance of highly complex GenAI platforms, applications, and services using Python, Node.js, and TypeScript
Ensure the seamless operation, scalability, and integration of AI capabilities across various Citi business units
Engage with data science, technical, and business stakeholders to define and design the overall architecture for key use-cases
Drive the deployment of new GenAI products and process improvements, working with internal and external partners to design, validate, and deliver solutions
Resolve high-impact technical and business problems, leading projects through in-depth evaluation of complex business processes, system architecture, and industry standards
Provide expert guidance and advanced knowledge in modern programming, ensuring platform design adheres to architectural blueprints and best practices for generative models
Develop and enforce robust coding standards, testing methodologies, debugging practices, and implementation strategies for enterprise-grade solutions across Python, Node.js, and TypeScript
Manage multiple concurrent initiatives and projects of varying sizes and complexity
Engage with external vendors and startups for joint initiatives and exploration of new technologies
Cultivate a comprehensive understanding of how business, architecture, and infrastructure integrate within the GenAI ecosystem at Citi

What we offer

Discover the top benefits offered to our global workforce, designed to support your well-being, growth and work-life balance

Fulltime

Staff Software Engineer, AI Agent Platform

The Geico AI Agent Platform team is seeking an exceptional Staff Software Engine...

Location

United States , Chevy Chase; New York City

Salary:

115000.00 - 260000.00 USD / Year

Geico

Expiration Date

Until further notice

Requirements

Bachelor’s degree in computer science, Engineering, Mathematics, or a related field
an advanced degree (master’s or Ph.D.) is highly desirable
6+ years of hands-on experience in designing, implementing, and maintaining multi-tenant AIML systems and platforms in production environments
6+ years of experience working with cloud platforms such as Azure and AWS
Extensive expertise in designing and deploying large-scale data pipelines and real-time inference systems and managing the end-to-end AI Agent and/or AIML system development lifecycles, including configuration, evaluation, monitoring, observability and AuthN/AuthR considerations
6+ years of experience working with common backend systems & tools (e.g, Kubernetes, Temporal, OpenSearch, PostgreSQL, Redis, Neo4J, etc.)
Deep understanding of Docker, container optimization, and multi-stage builds
Experience with Prometheus, Grafana, Open Telemetry and distributed tracing
3+ years of experience building front-end web applications using frameworks such as React and/or Next.JS
Deep proficiency in programming languages such as Python, Java, Go, etc., with a strong emphasis on coding excellence

Job Responsibility

Architect and implement scalable multi-tenant backend systems for building AI agent workflows, including agent configuration, offline evaluation, synthetic data generation, workflow simulation, agent marketplace, etc. using Azure Kubernetes Service (AKS), FastAPI, etc., ensuring economy of scale and control cost of maintenance
Collaborate with Design team to architect and implement frontend experiences and workflows for onboarding both technical and non-technical stakeholders, maximizing user adoption and successful AI agent development
Develop observability frameworks to ensure 99.9%+ uptime for AI agent platforms through robust monitoring, alerting, and incident response procedures
Evaluate and (if desirable) integrate cutting-edge GenAI frameworks, libraries and vendors to maintain a state-of-the-art technology stack, including hybrid cloud solutions with AWS/GCP as backup or specialized use cases
Architect and implement scalable, high-performance machine learning platforms and systems capable of processing large data volumes and supporting real-time decision making and workflows
Oversee the end-to-end lifecycle of AI agent applications, ensuring robust testing, deployment, and ongoing monitoring
Ensure adherence to company production readiness standards, security protocols, and regulatory compliance throughout the development lifecycle
Continuously optimize platform performance, reducing latency and improving throughput for AI agent workloads
Design and implement backup, recovery, and business continuity plans for hosted platform applications & services
Design and maintain robust CI/CD pipelines for ML model deployment using Azure DevOps, GitHub Actions, and MLOps tools

What we offer

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
Financial benefits including market-competitive compensation
a 401K savings plan vested from day one that offers a 6% match
performance and recognition-based incentives
and tuition assistance
Access to additional benefits like mental healthcare as well as fertility and adoption assistance
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year

Fulltime

Select Country

Lead AI Engineer (GenAI Platform Services)

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?