CrawlJobs Logo

Lead AI/ML Engineer

India, Hyderabad & Pune · Job Posted March 04, 2026
Apply Position
Job Link Share

Job Responsibility

  • Design, implement, and optimize end-to-end ML training workflows including infrastructure setup, orchestration, fine-tuning, deployment, and monitoring
  • Evaluate and integrate multi-cloud and single-cloud training options across AWS and other major platforms
  • Lead cluster configuration, orchestration design, environment customization, and scaling strategies
  • Compare and recommend hardware options (GPUs, TPUs, accelerators) based on performance, cost, and availability

Requirements

  • Experience with cloud-based platforms (AWS, Azure), API integrations, and data models
  • Exposure to AI/ML-enabled platforms or decision-intelligence systems
  • Certifications: CBAP / PMI-PBA / Agile BA / SAFe Product Owner / Scrum Master
  • Experience in stakeholder training, change management, or workshop facilitation
  • At least 4-5 years in AI/ML infrastructure and large-scale training environments
  • Expert in AWS cloud services (EC2, S3, EKS, SageMaker, Batch, FSx, etc.) and familiar with Azure, GCP, and hybrid/multi-cloud setups
  • Strong knowledge of AI/ML training frameworks (PyTorch, TensorFlow, Hugging Face, DeepSpeed, Megatron, Ray, etc.)
  • Proven experience with cluster orchestration tools (Kubernetes, Slurm, Ray, SageMaker, Kubeflow)
  • Deep understanding of hardware architectures for AI workloads (NVIDIA, AMD, Intel Habana, TPU)
  • Expert knowledge of inference optimization techniques including speculative decoding, KV cache optimization (MQA/GQA/PagedAttention), and dynamic batching
  • Deep understanding of prefill vs decode phases, memory-bound vs compute-bound operations
  • Experience with quantization methods (INT4/INT8, GPTQ, AWQ) and model parallelism strategies
  • Hands-on experience with production inference engines: vLLM, TensorRT-LLM, DeepSpeed-Inference, or TGI
  • Proficiency with serving frameworks: Triton Inference Server, KServe, or Ray Serve
  • Familiarity with kernel optimization libraries (FlashAttention, xFormers)
  • Proven ability to optimize inference metrics: TTFT (first token latency), ITL (inter-token latency), and throughput
  • Experience profiling and resolving GPU memory bottlenecks and OOM issues
  • Knowledge of hardware-specific optimizations for modern GPU architectures (A100/H100)
  • Drive end-to-end fine-tuning of LLMs, including model selection, dataset preparation/cleaning, tokenization, and evaluation with baseline metrics
  • Configure and execute fine-tuning experiments (LoRA, QLoRA, etc.) on large-scale compute setups, ensuring optimal hyperparameter tuning, logging, and checkpointing
  • Document fine-tuning outcomes by capturing performance metrics (losses, BERT/ROUGE scores, training time, resource utilization) and benchmark against baseline models

What we offer

  • Opportunities for continuous learning and certification support
  • Collaborative and growth-oriented work culture
  • Competitive compensation and comprehensive benefits
  • Exposure to modern cloud and integration technologies

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Lead AI/ML Engineer

8 matching positions

Senior AI/ML Lead Engineer

As the Senior AI/ML Lead Engineer, you will spearhead the development and deploy...
Location
Location
United States , Dayton
Salary
Salary:
Not provided
altamiracorp.com Logo
Altamira Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or above in Computer Science, AI, or a related quantitative field
  • 7+ years of professional experience in machine learning
  • at least 3 years in a leadership or "tech lead" capacity
  • Proven track record in developing computer vision models for object tracking (e.g., CNNs, Transformers) and real-time video analytics
  • Expert-level proficiency in Python
  • familiarity with backend deployment tools (Docker, Kubernetes)
  • Experience overseeing the full data lifecycle, from acquisition and cleaning to high-fidelity labeling for specialized collections
  • Ability to articulate complex AI concepts to non-technical stakeholders and executive leadership
  • Must be a US citizen and hold a current Secret clearance or higher
Job Responsibility
Job Responsibility
  • Design and implement scalable, high-performance infrastructures for real-time object detection and tracking using frameworks like PyTorch or TensorFlow
  • Develop automated triage systems that prioritize and categorize incoming sensor or project data, ensuring critical events are escalated instantly
  • Architect multi-agent or agentic workflows to synchronize data collection efforts across various projects, optimizing resource allocation
  • Lead a high-performing team of engineers, conducting code reviews and setting engineering standards for MLOps and production pipelines
  • Collaborate with project managers to translate complex client requirements into actionable AI/ML roadmaps
  • Fulltime
Read More
Arrow Right

Lead AI/ML Engineer – Regulatory Reporting

We are establishing a specialized AI/ML team in Mumbai to modernize our Regulato...
Location
Location
India , Mumbai
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in Data Science/Engineering
  • Deep experience with Scikit-learn, TensorFlow, or PyTorch
  • Hands-on experience with LLM orchestration frameworks (LangChain, LlamaIndex) and Vector Databases (Pinecone, Milvus, or pgvector)
  • Experience building tools like Streamlit or Gradio for rapid prototyping of human-review interfaces
  • Experience in Financial Services (specifically Fraud Detection, AML, or Risk Modeling)
  • Able to communicate and explain 'Hallucination Risk' to a non-technical Chief Risk Officer
Job Responsibility
Job Responsibility
  • Build unsupervised and semi-supervised ML models (e.g., Isolation Forests, Autoencoders) to scan millions of transactional records for outliers
  • Go beyond simple 'threshold checks' to detect complex patterns
  • Reduce false positives to ensure the Reporting Team trusts the model alerts
  • Design RAG (Retrieval-Augmented Generation) pipelines to 'chat' with unstructured data (Credit Agreements, Loan Docs) and extract key regulatory attributes (Maturity Dates, Collateral Clauses)
  • Build 'Agentic' workflows where GenAI proactively suggests mapping logic or identifies the root cause of a break, requiring only a 'thumbs up/down' from the human SME
  • Build 'Explainability' (XAI) into every model
  • Create Validation Interfaces: Build simple UIs (using Streamlit or React) where business users can see the Model's Prediction side-by-side with the Source Document to rapidly approve/reject the finding
  • Work with Model Risk Management (MRM) to establish a 'fast-track' validation framework for non-deterministic GenAI models
  • Hire and mentor a squad of 4-5 junior data scientists/engineers in Mumbai
  • Act as the 'AI Evangelist' to the Operations/Finance teams, demonstrating how AI assists them rather than replacing them
  • Fulltime
Read More
Arrow Right

Solution Engineer II (Ai/Ml Lead)

Job Description: Principal Accountabilities Collaborate with teams to translat...
Location
Location
India , Noida
Salary
Salary:
Not provided
arrow.com Logo
Arrow Electronics
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • B.Tech/M.Tech or Ph.D. in Computer Science, Electronics, or related engineering domain
  • Typically requires 8–12 years of equivalent work experience
  • 3–5 years of experience in machine learning, deep learning, and computer vision
  • Proven track record of designing and deploying ML-based systems from concept to production
  • Academic publications in computer vision research at top conferences and journals
  • Excellent communication, problem-solving, and presentation skills.
Job Responsibility
Job Responsibility
  • Collaborate with teams to translate business requirements into technical specifications, system architecture, and ML pipelines
  • Drive end-to-end solution delivery — including data preparation, model development, optimization, validation, deployment, and continuous improvement
  • Provide technical guidance and mentorship to junior engineers and data scientists
  • review and refine their designs and code implementations
  • Develop reusable ML frameworks, model training workflows, and inference pipelines for rapid prototyping and deployment
  • Evaluate and integrate state-of-the-art AI/ML technologies to continuously improve model efficiency and system design
  • Respond to client RFQs and provide robust technical proposals and solution architectures
  • Partner cross-functionally with system engineers, embedded developers, and application teams for integrated AI system delivery
  • Mentor 2–5 member AI engineering team for full-cycle ML product development
  • Architect, implement, and optimize AI models for edge computing platforms ensuring high throughput, accuracy and low latency
  • Fulltime
Read More
Arrow Right

Lead Data Engineer - AI/ML

The Lead Data Engineer will be part of a team building Stanford Health Care's (S...
Location
Location
United States of America , Palo Alto
Salary
Salary:
94.35 - 125.03 USD / Hour
stanfordhealthcare.org Logo
Stanford Health Care
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related, or equivalent working experience
  • 5+ years experience in building data infrastructure for analytics teams, including ability to write code in SQL, R, or Python for processing large datasets in distributed cloud environments
  • Experience with cloud deployment strategies and CI/CD
  • Experience building and working with data infrastructure in a SaaS environment
  • Experience overseeing, developing or implementing machine learning operations (MLOps) processes
  • Experience mentoring junior engineers and enforcing best practices around code quality
  • Knowledge of multiple programming languages, commitment to choosing languages based on project-specific requirements, and willingness to learn new programming languages as necessary
  • Knowledge of resource management and automation approaches such as workflow runners
  • Collaborative mentality and excitement for iterative design working closely with the Data Science team.
Job Responsibility
Job Responsibility
  • Build end-to-end data pipelines and infrastructure for ML models used by the Data Science team and others at SHC
  • Understand the requirements of data processing and analysis pipelines and make appropriate technical design and interface decisions
  • Understand data flows among the SHC applications and use this knowledge to make recommendations and design decisions for languages, tools, and platforms used in software and data projects
  • Troubleshoot and debug environment and infrastructure problems found in production and non-production environments for projects by the Data Science Team
  • Work with other groups at SHC and the Technology and Digital Solutions (TDS) group to ensure servers and system maintenance based on updates, system requirements, data usage, and security requirements.
  • Fulltime
Read More
Arrow Right

Senior Data & AI/ML Engineer - GCP Specialization Lead

We are on a bold mission to create the best software services offering in the wo...
Location
Location
United States , Menlo Park
Salary
Salary:
Not provided
techjays.com Logo
techjays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Services: BigQuery, Dataflow, Pub/Sub, Vertex AI
  • ML Engineering: End-to-end ML pipelines using Vertex AI / Kubeflow
  • Programming: Python & SQL
  • MLOps: CI/CD for ML, Model deployment & monitoring
  • Infrastructure-as-Code: Terraform
  • Data Engineering: ETL/ELT, real-time & batch pipelines
  • AI/ML Tools: TensorFlow, scikit-learn, XGBoost
  • Min Experience: 10+ Years
Job Responsibility
Job Responsibility
  • Design and implement data architectures for real-time and batch pipelines, leveraging GCP services such as BigQuery, Dataflow, Dataproc, Pub/Sub, Vertex AI, and Cloud Storage
  • Lead the development of ML pipelines, from feature engineering to model training and deployment using Vertex AI, AI Platform, and Kubeflow Pipelines
  • Collaborate with data scientists to operationalize ML models and support MLOps practices using Cloud Functions, CI/CD, and Model Registry
  • Define and implement data governance, lineage, monitoring, and quality frameworks
  • Build and document GCP-native solutions and architectures that can be used for case studies and specialization submissions
  • Lead client-facing PoCs or MVPs to showcase AI/ML capabilities using GCP
  • Contribute to building repeatable solution accelerators in Data & AI/ML
  • Work with the leadership team to align with Google Cloud Partner Program metrics
  • Mentor engineers and data scientists toward achieving GCP certifications, especially in Data Engineering and Machine Learning
  • Organize and lead internal GCP AI/ML enablement sessions
What we offer
What we offer
  • Best in class packages
  • Paid holidays and flexible paid time away
  • Casual dress code & flexible working environment
  • Medical Insurance covering self & family up to 4 lakhs per person
Read More
Arrow Right

Senior Engineer / Lead Engineer - Virtual Engineering - AI CAE

Senior Engineer / Lead Engineer – AI CAE will Drive AI innovation in CAE analysi...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Masters Degree Mechanical/Automobile/Production /Mechatronics Engineering discipline or similar.
  • 5+ years experience in CAE at Automotive Product Development / Manufacturing Engineering.
  • 3+ years' experience in implementing AI solutions in CAE
  • Should have executed at least 5+ years of Core CAE domain (from problem definition to deployment) experience.
  • Strong programming skills in Python, MATLAB, CAE tool-specific APIs (Altair suite, NASTRAN, ANSYS APDL, Abaqus etc.), workflow automation.
  • Experience with ML frameworks like Pytorch, TensorFlow.
  • Understanding of data annotation tools and MLOps workflows.
  • Experience in data handling and feature engineering.
  • Strong problem-solving and analytical mindset.
  • Experience in domain-specific AI use cases (manufacturing, automotive, etc.).
Job Responsibility
Job Responsibility
  • Collaborate with stakeholders to understand business problems in the CAE domain and translate them into AI solutions.
  • Design, develop, and fine-tune AI/ML models for Simulation result prediction and design optimization, Automating repetitive CAE tasks (meshing, boundary conditions, post-processing).
  • Evaluate, validate, and benchmark model performance using appropriate metrics.
  • Deploy AI models into production environments in collaboration with IT/AI teams.
  • Establish monitoring and maintenance processes to ensure model accuracy over time.
  • Ensure that all AI solutions comply with organizational data security, confidentiality, and regulatory requirements.
  • Document workflows, results, and lessons learned for organisational knowledge sharing.
  • Stay updated on advancements in neural networks, multi-physics simulations, surrogate modelling and physics-informed learning techniques.
  • Fulltime
Read More
Arrow Right

Senior Engineer / Lead Engineer - Virtual Engineering - AI CFD

Senior Engineer / Lead Engineer – AI CFD will Drive AI innovation in CFD domain....
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Masters Degree Mechanical/Automobile/Production /Mechatronics Engineering discipline or similar
  • 5+ years experience in CFD at Automotive Product Development / Manufacturing Engineering
  • 2+ years experience in implementing AI solutions in CFD
  • Should have executed at least 5+ years of Core CFD domain (from problem definition to deployment) experience
  • Strong programming skills in Python and C++ for automation and solver integration
  • Experience with ML frameworks like Pytorch, TensorFlow
  • Knowledge of surrogate modeling, reduced-order modeling (ROMs), and regression techniques
  • Experience in data handling (large-scale CFD datasets) and feature engineering(feature extraction from flow fields like velocity, pressure, turbulence quantities)
  • Strong problem-solving and analytical mindset
  • Understanding of data annotation tools and MLOps workflows
Job Responsibility
Job Responsibility
  • Collaborate with stakeholders to understand business challenges in the CFD space and solve them using API based customization and AI methodologies
  • Collect, clean, annotate, and prepare datasets for text analysis and image comparison tasks
  • Design, develop, and fine-tune AI/ML models for: Automating mesh generation, solver setup, and post-processing of CFD results
  • Building optimization pipelines for thermal and fluid systems using AI-assisted approaches
  • Evaluate, validate, and benchmark model performance using appropriate metrics
  • Deploy AI models into production environments in collaboration with IT/AI teams
  • Establish monitoring and maintenance processes to ensure model accuracy over time
  • Ensure that all AI solutions comply with organizational data security, confidentiality, and regulatory requirements
  • Document workflows, results, and lessons learned for organizational knowledge sharing
  • Stay updated on advancements in neural networks, multi-physics simulations, surrogate modelling and physics-informed learning techniques
  • Fulltime
Read More
Arrow Right

Lead GenAI Lead Engineer, Innovation Labs – SVP

We're on the hunt for a highly skilled and experienced senior engineer to lead t...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant experience in the software industry
  • Proven experience in senior positions as principal engineer or architect
  • Experience with cloud architectures, and specific experience with public cloud offerings
  • Proficiency in programming languages such as Python
  • Experience as a people manager
  • Highly experienced in delivering complex solutions to production, preferably in Python and AI/ML ecosystem
  • Great passion and proven hands-on experience integrating with AI/ML technologies
  • Strong and diverse technical background
  • Ability to quickly learn and understand new technologies, influence highly skilled engineering teams and overall have a personal impact on technology decisions and vision
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Lead development of highly complex AI solutions, infrastructure, and architecture topics
  • Lead a team of engineers and data-scientists
  • Actively contribute to software development, both as a coder and as a reviewer
  • Manage executive stakeholder audience in technology by acting as a partner, trusted advisor and operating through influence
  • Work with internal and external partners to design, validate, and deliver solutions with a commercial benefit for Citi
  • Manage multiple concurrent initiatives and projects of varying sizes & complexity
  • Engage with data-science, technical and business stakeholders to define and design overall architecture for key use-cases across our lines of business
  • Work with external vendors and start-ups around joint initiatives and around exploration of new directions
  • Partner with Citi’s Risk and Governance partners to ensure best practice is followed from the perspectives of good governance, risk management, standardization, and tooling
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Fulltime
Read More
Arrow Right