CrawlJobs Logo

Principal Engineer – GenAI & Applied AI

India, Pune Employment contract · Job Posted June 03, 2026
Apply Position
Job Link Share

Job Description

We are seeking a hands-on GenAI engineering leader to design and deliver agentic systems, RAG pipelines, and LLM-powered applications. You will be the go-to authority for applied AI architectures that move from prototype to enterprise production.

Job Responsibility

  • Architect agentic workflows, RAG systems, conversational AI solutions
  • Build accelerators and reusable components on frameworks like LangChain, LangGraph, Haystack
  • Integrate LLMs with vector databases, APIs, enterprise data sources
  • Implement AI evaluation, prompt optimization, safety guardrails
  • Mentor engineers, review code, set AI development best practices

Requirements

  • 8+ years in software/AI engineering, with 3+ in LLMs/GenAI
  • Deep expertise in Python, orchestration frameworks, vector search
  • Strong exposure to cloud AI services (AWS Bedrock, Azure AI Foundry, GCP Vertex AI)
  • Track record of shipping production-grade AI systems

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Principal Engineer – GenAI & Applied AI

8 matching positions

Principal Applied Researcher AI/NLP

At PointClickCare our mission is simple: to help providers deliver exceptional c...
Location
Location
United States
Salary
Salary:
195800.00 - 217500.00 USD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD or comparable level of experience in Computer Science, Math, Physics, Engineering or a related field
  • 4-10+ year industry experience building solutions in commercial SaaS, including at least 4 years working in applications of NLP, Search or AI/ML technologies for healthcare
  • Strong interest in applying AI/ML/NLP to healthcare related problems and data
  • Expert-level practical, hands-on experience developing and applying a wide range of techniques in Natural Language Processing, including fine tuning of LLMs and other Transformer models, plus one or more additional AI/ML or Search related areas of expertise to solve real-world problems at scale
  • Demonstrated ability to lead and perform research and experimentation to select appropriate approaches, algorithms, evaluation methods, and frameworks, as well as tasks such as feature selection, language modeling, evaluation and fine tuning or training models, applying standard approaches or developing new tools or workflows as needed to meet project requirements
  • Significant experience building and deploying AI/machine learning and NLP models for large-scale SaaS products, including familiarity with industry standard software development concepts such as scaling issues, version control, CI/CD pipelines, and security
  • Solid understanding and experience with transformer models and multiple kinds of NLP and ML models and approaches including logistic regression, random forest, ensemble methods, SVM, KNN, reinforcement learning, and other ML techniques
  • Proficiency in Python and Java required. Proficiency in JavaScript or TypeScript and modern UI frameworks for building prototype or tool front ends desired
  • Proficiency doing data engineering for ML and NLP applications, including exposure to database systems and proficiency with SQL
  • Proficiency building models from big data using modern packages, models and data analysis stacks such as NumPy, SciPy, Pandas, Scikit-learn, PyTorch, Keras, LightGBM, fastText, NLTK, and spaCy. Proficiency fine tuning Hugging Face Transformers required
Job Responsibility
Job Responsibility
  • You will be applying NLP including GenAI and other AI/ML techniques to develop model systems and solutions, collaborating across functions to scale and integrate advanced solutions into successful end user experiences in large-scale cloud based SaaS production environments for healthcare
  • You will be working with product leaders, clinical informaticists, data scientists, UI/UX researchers and designers, other AI and machine learning and domain experts, engineering teams and others, including work with customers and users who are healthcare professionals
  • Design, build and evaluate solutions that may involve structured or unstructured data including speech or natural language for healthcare use cases, delivering capabilities such as summarization, predictive models, recommenders, semantic search, extraction, classification or other NLP, AI or machine learning based techniques
  • You will be performing research and experimentation to select appropriate approaches, algorithms, evaluation methods and frameworks and doing the R&D to deliver model systems
  • You will perform, oversee and assist in data collection, data cleaning, data analysis, algorithm selection or design, prompt tuning, parameter fine tuning, training, development and evaluation of systems that deliver responsible AI solutions at scale, using existing or developing new tools or workflows as needed
  • As a principal applied researcher, you will bring deep technical expertise and also provide mentorship on advanced AI, NLP, data science, statistical and machine learning methods and technologies, helping the organization develop new capabilities for innovative solutions
  • You will have substantial independence and responsibility from day one
What we offer
What we offer
  • Benefits starting from Day 1
  • Retirement Plan Matching
  • Flexible Paid Time Off
  • Wellness Support Programs and Resources
  • Parental & Caregiver Leaves
  • Fertility & Adoption Support
  • Continuous Development Support Program
  • Employee Assistance Program
  • Allyship and Inclusion Communities
  • Employee Recognition … and more
  • Fulltime
Read More
Arrow Right

Sr. Distinguished AI Engineer (Agentic AI Platform)

At Capital One, we are creating responsible and reliable AI systems, changing ba...
Location
Location
United States , San Jose, California; San Francisco, California
Salary
Salary:
343400.00 - 392000.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or AI plus at least 10 years of experience developing AI and ML algorithms or technologies, or Master's degree plus at least 8 years of experience developing AI and ML algorithms or technologies
  • At least 10 years of experience programming with Python, Go, Scala, or Java
  • 9 years of experience deploying scalable and responsible AI solutions on cloud platforms
  • 2+ years of experience supporting Agentic Frameworks
  • 2+ years of experience with LLMOps
  • 8+ years of experience designing mission-critical machine learning platforms
  • 2+ years of experience architecting, designing, developing, integrating, delivering, and supporting complex AI systems
  • Demonstrated ability to lead and mentor multiple engineering teams and influence cross-functional stakeholders up to the VP level
  • Experience developing AI and ML algorithms or technologies using Python, C++, C#, Java, or Golang
  • Master's degree in Computer Science, Computer Engineering, or relevant technical field
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
  • Contribute to the north star platform architecture, continuously publishing and refining living diagrams and canonical APIs
  • Standardizing and automating agentic workflows
  • Contribute to crafting an end to end GenAI SDK, CLI and starter kits
  • Help bring together a vision of central guardrail services
  • Collaborate with cross organization architects to drive end to end performance
  • Accelerate innovation by incubating proof of concepts and driving RFCs
  • Own central Helm charts, operators and CRDs that auto scale agents to hit tenant SLAs
  • Coach and evangelize - hosting architecture office hours, mentoring Staff, Principal and Senior engineers, authoring technical design documents and blogs and representing Capital One at Tier1 AI conferences
What we offer
What we offer
  • Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • comprehensive, competitive, and inclusive set of health, financial and other benefits
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

CoreAI is at the forefront of Microsoft’s mission to redefine how software is bu...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Define, architect and develop Foundry agent platform services and developer experiences in TypeScript, Python, and C#, enabling customers to build, deploy, evaluate, and manage intelligent agents at scale on Microsoft Foundry
  • Champion AI-Native Development by leveraging AI tools across the SDLC, owning AI-generated assets, incorporating Responsible AI practices, and applying engineering health measures to drive continuous improvement
  • Lead API, SDK, CLI, and UI development that delivers intuitive, consistent, and well-documented experiences for building AI agents and integrating with GenAI models
  • Own architecture decisions for complex features including agent orchestration, knowledge integration, tool calling, and multi-turn conversations—ensuring scalability and extensibility
  • Serve as DRI for deployment, monitoring, incident response, and continuous improvement of live site services
  • Lead by example through code reviews, mentoring, and technical leadership—driving engineering excellence and growing talent across the team.
Job Responsibility
Job Responsibility
  • Define, architect and develop Foundry agent platform services and developer experiences in TypeScript, Python, and C#, enabling customers to build, deploy, evaluate, and manage intelligent agents at scale on Microsoft Foundry
  • Champion AI-Native Development by leveraging AI tools across the SDLC, owning AI-generated assets, incorporating Responsible AI practices, and applying engineering health measures to drive continuous improvement
  • Lead API, SDK, CLI, and UI development that delivers intuitive, consistent, and well-documented experiences for building AI agents and integrating with GenAI models
  • Own architecture decisions for complex features including agent orchestration, knowledge integration, tool calling, and multi-turn conversations—ensuring scalability and extensibility
  • Serve as DRI for deployment, monitoring, incident response, and continuous improvement of live site services
  • Lead by example through code reviews, mentoring, and technical leadership—driving engineering excellence and growing talent across the team.
  • Fulltime
Read More
Arrow Right

Principal Applied AI Engineer

Security represents the most critical priorities for our customers in a world aw...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Data Science or related technical field
  • Minimum 9+ Years of total Experience
  • 5+ years technical engineering experience with coding in languages including C#, Java AND Python
  • Should have 5+ years in Data Science experience
  • 3 years of experience with LLMs and open-source GenAI frameworks, such as LangChain, LlamaIndex, Haystack, or equivalents (e.g., Transformers, AutoGen, DSPy), including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and fine-tuning and evaluation
  • Proficiency in writing production-quality software code in one or more modern programming languages (Python, C#)
  • 3+ years experience developing software systems end-to-end, from design to implementation
  • 2+ years experience in shipping at least 2 large scale ML/AI-based services or applications on cloud platforms (Azure, AWS, GCP, etc.)
Job Responsibility
Job Responsibility
  • Design, develop, and deploy end-to-end AI/ML systems, including data ingestion, model training, evaluation, and integration into production environments
  • Build and optimize applications leveraging LLMs and open-source GenAI frameworks such as LangChain, LlamaIndex, Haystack, Transformers, AutoGen, and DSPy
  • Implement advanced GenAI techniques including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and model fine-tuning
  • Write production-grade software in Python and C# or Java, ensuring maintainability, scalability, and performance
  • Collaborate with cross-functional teams to translate business requirements into technical solutions
  • Ship and maintain large-scale AI applications, with a focus on performance monitoring and continuous improvement
  • Conduct rigorous evaluation of AI models using appropriate metrics and benchmarks
  • Optimize models for latency, throughput, and accuracy in real-world scenarios
  • Work closely with data scientists, product managers, and other engineers to drive AI initiatives
  • Stay current with the latest advancements in GenAI, LLMs, and AI frameworks
  • Fulltime
Read More
Arrow Right

Principal Applied AI Engineer

Security represents the most critical priorities for our customers in a world aw...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Data Science or related technical field
  • Minimum 9+ Years of total Experience
  • 5+ years technical engineering experience with coding in languages including C#, Java AND Python
  • Should have 5+ years in Data Science experience
  • 3 years of experience with LLMs and open-source GenAI frameworks, such as LangChain, LlamaIndex, Haystack, or equivalents (e.g., Transformers, AutoGen, DSPy), including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and fine-tuning and evaluation
  • Proficiency in writing production-quality software code in one or more modern programming languages (Python, C#)
  • 3+ years experience developing software systems end-to-end, from design to implementation
  • 2+ years experience in shipping at least 2 large scale ML/AI-based services or applications on cloud platforms (Azure, AWS, GCP, etc.)
Job Responsibility
Job Responsibility
  • Design, develop, and deploy end-to-end AI/ML systems, including data ingestion, model training, evaluation, and integration into production environments
  • Build and optimize applications leveraging LLMs and open-source GenAI frameworks such as LangChain, LlamaIndex, Haystack, Transformers, AutoGen, and DSPy
  • Implement advanced GenAI techniques including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and model fine-tuning
  • Write production-grade software in Python and C# or Java, ensuring maintainability, scalability, and performance
  • Collaborate with cross-functional teams to translate business requirements into technical solutions
  • Ship and maintain large-scale AI applications, with a focus on performance monitoring and continuous improvement
  • Conduct rigorous evaluation of AI models using appropriate metrics and benchmarks
  • Optimize models for latency, throughput, and accuracy in real-world scenarios
  • Work closely with data scientists, product managers, and other engineers to drive AI initiatives
  • Stay current with the latest advancements in GenAI, LLMs, and AI frameworks
  • Fulltime
Read More
Arrow Right

Principal engineer applied ai

Are you looking to redefine customer experience by bending the curve of innovati...
Location
Location
India , Bengaluru; Pune
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Bachelor’s degree (or above) in computer science, engineering, mathematics, or a related discipline
  • 12+ years of experience in Software Engineering, Data Science, or Analytics with 5+ years of experience in AI/ML engineering or related fields
  • Demonstrated experience designing and implementing highly scalable, production-grade AI/ML platforms and systems
  • Deep understanding of Large Language Models (LLMs), LLM training & fine tuning
  • Hands-on experience in GenAI and AgenticAI frameworks such as LangGraph/LangChain, CrewAI, Strands SDK, Google ADK, etc. Familiarity on working with MCP and A2A protocols
  • Expertise in various AI/ML techniques, such as deep learning, natural language processing, reinforcement learning, and large language models
  • Proficiency in Python, R, or other programming languages for data analysis and AI/ML development
  • Experience with DevOps and CI/CD tools and practices, such as Git, Jenkins, Docker, Kubernetes, etc.
Job Responsibility
Job Responsibility
  • Serve as the primary code-level reviewer and hands-on architect for the most complex AI/ML systems and frameworks
  • Define and enforce technology roadmaps, best practices, strategies, and standards for AI/ML adoption across the organization
  • Conduct and guide research on context/prompt engineering techniques to translate successful research into reusable engineering frameworks for the wider team
  • Architect and maintain robust data pipelines and data processing workflows for model training and model tuning, utilizing cloud services for scalability and efficiency
  • Implement and champion MLOps tools and practices, ensuring seamless integration of machine learning models and LLMs into production environments
  • Lead the effort to agentize existing systems (front-end & back-end apps) by incorporating agentic frameworks/stacks and integrating them via A2A, MCP protocol
  • Partner with cross-functional teams to integrate AI/ML solutions into products and services
  • Mentor and coach junior and senior engineers on advanced AI/ML techniques, best practices, and architecture design.
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
Read More
Arrow Right
New

Senior Principal AI Infrastructure Architect

The Senior Principal AI Infrastructure Architect is a highly skilled and advance...
Location
Location
Italy , Milano
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant experience in a consulting, presales or architecture role within a large-scale (preferably multi-national) technology services environment, with a track record of leading AI infrastructure pursuits
  • Demonstrable experience designing and delivering production AI platforms — from single multi-GPU servers through to multi-rack training clusters and inference factories
  • Strong working knowledge of the AI hardware vendor landscape (NVIDIA, AMD, Intel, Dell, HPE, Lenovo, Supermicro, Cisco, Pure, VAST, WEKA, DDN, NetApp) and how to position partner ecosystems competitively
  • Proven ability to translate AI workload requirements (model size, parameter count, sequence length, throughput SLOs, latency targets) into accurate hardware bills of materials and sizing justifications
  • Significant client engagement and consulting experience, including client needs assessment, change management and the ability to identify whitespace for follow-on AI infrastructure and managed-services work
  • Significant business development and presales experience on infrastructure-led deals, ideally including sovereign AI, AI Factory or regulated-industry GenAI programmes
  • Strong understanding of how AI infrastructure integrates with business processes, applications, data platforms and existing enterprise architecture
  • Bachelor's degree or equivalent in Information Technology, Engineering, Computer Science or a related field
  • Deep, hands-on knowledge of AI hardware: GPU and accelerator portfolios (NVIDIA Hopper / Blackwell, AMD MI300/MI325, Intel Gaudi 3, emerging custom silicon), host CPU platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), system topologies (HGX, DGX, MGX, OAM) and how each choice maps to specific AI workloads
  • Strong understanding of AI-class storage: parallel filesystems, all-flash NVMe platforms, S3-class object stores, checkpoint and dataset pipelines and the I/O patterns of large-scale training and inference (VAST, WEKA, DDN EXAScaler, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
Job Responsibility
Job Responsibility
  • Lead the end-to-end design of large, complex AI infrastructure solutions — covering accelerated compute (NVIDIA H100/H200/B200 and GB200 NVL72, AMD Instinct MI300X/MI325X, Intel Gaudi 3), CPU host platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), high-throughput storage tiers and lossless AI fabric — for enterprise, sovereign AI and AI Factory clients
  • Architect reference designs built on NVIDIA DGX/HGX SuperPOD, Dell AI Factory with NVIDIA, Cisco Nexus HyperFabric AI, HPE / Lenovo / Supermicro accelerated compute and equivalent platforms, balancing single-node performance with cluster-scale efficiency
  • Size and validate GPU clusters against real workloads — foundation-model pre-training, distributed fine-tuning, RAG, real-time and batch inference — using the right combination of NVLink/NVSwitch domains, InfiniBand NDR/XDR or Ultra Ethernet / NVIDIA Spectrum-X fabrics and tiered NVMe and parallel storage (VAST, WEKA, DDN, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
  • Define the supporting datacenter design: high-density power (50–140 kW/rack), direct-to-chip and rear-door liquid cooling, structured cabling for AI fabrics and modular deployment models across on-prem, colo and sovereign-cloud footprints
  • Work closely with the sales team to drive the presales process for AI infrastructure pursuits — client discovery, technical workshops, proposal writing, executive presentations and bid defence
  • Translate clients' AI ambitions and business outcomes into a hardware and platform roadmap, positioning NTT DATA's end-to-end portfolio — silicon, systems, storage, fabric, MLOps stack and managed services — to land service-led AI solutions
  • Lead integration of compute, storage, networking, the AI software stack (CUDA, ROCm, Triton, NIM, NVIDIA AI Enterprise, Run:ai, Slurm, Kubernetes / Kubeflow) and managed-service operating models across multiple domains, delivery units and geographies
  • Build business cases, TCO and unit-economics models (cost per token, cost per training run, GPU-hour economics) and end-to-end transition roadmaps for cloud-to-private AI migrations and sovereign AI deployments
  • Define architectural principles for AI infrastructure — accelerator utilisation, data gravity, multi-tenancy, model lifecycle, energy efficiency — and apply them to influence architectural outcomes and governance
  • Develop As-Is, Vision, FMO and To-Be AI platform architectures, identify gaps and develop transition roadmaps
  • Fulltime
Read More
Arrow Right

Principal Software Engineering Manager - Data Science & Engineering

The MSRC Data Science team is responsible in building data pipelines, data minin...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Leads team on the disciplined use of, and improving artificial intelligence (AI) tools and practices across the software development lifecycle (SDLC)
  • Guides team on proactively taking responsibility for the content of their AI-generated requirements, design documents, code, and other assets, and assisting other members of the team to do the same
  • Leads team on incorporating Responsible AI practices into the SDLC to ensure appropriate controls over AI-generated assets
  • Coaches team on applying SDLC and engineering health measures (e.g., Accelerate, SPACE framework, Engineering System Success Playbook [ESSP]) to guide improvements to processes and practices, especially those involving AI
  • Leads team on experimenting with AI tools and practices to improve their own capabilities, and providing recommendations on how to adopt them to others
  • Reviews debugging tools, tests, logs, telemetry, and other methods, and acts as an expert for others to proactively verify assumptions while developing code before issues occur across products in production
  • Guides team to perform machine learning/data extraction, transformation, and loading (ETL) pipelines (e.g., data collection, cleaning) based on data prepared
  • Guides the architecture of scalable pipelines and datasets
  • Influences the direction of the team
  • Begins to anticipate potential data pipeline issues and provides solutions
  • Fulltime
Read More
Arrow Right