This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a highly skilled AI Architect with deep expertise in Generative AI, LLMs, Video Models (Digital Humans / Avatars), and end-to-end AI product architecture. This role requires a hands-on technologist and strategic thinker who can design scalable AI systems, guide development teams, interact with global clients, and drive high-impact AI initiatives across cloud, on-prem GPU servers, and edge devices (AI PCs). You will architect solutions that span the spectrum—from H200-class GPU compute for very large LLM workloads to lightweight, optimized models that run efficiently on edge devices. This is a senior, high-ownership role for someone passionate about building real-world AI products at scale.
Job Responsibility:
Design and architect LLM-based systems using both open-source (Llama, Mistral, etc.) and proprietary (OpenAI, Azure OpenAI, Anthropic, etc.) models
Architect video-based AI systems, including Digital Human Avatars, Video Generation, Video-to-Text, and multimodal pipelines
Build end-to-end GenAI pipelines including data ingestion, preprocessing, retrieval, fine-tuning (LoRA, QLoRA, DAPT), evaluation, guardrailing, and deployment
Define and orchestrate data pipelines, ML workflows, vector search architecture, and embedding strategies
Build scalable, secure ML engineering wrappers around models (inference servers, orchestration layers, API microservices)
Oversee experimentation frameworks, evaluation methodologies, and MLOps integration
Architect AI solutions on AWS and Azure (preferred), including GPU clusters, model hosting, DevOps/MLOps, and autoscaling
Work with Nvidia GPU server stacks (DGX, H200, H100, L40S) and edge AI systems (Intel, AMD, Qualcomm AI PCs)
Optimize AI workloads across heterogeneous compute environments
Lead AI architecture across POC → MVP → GA → production-scale phases
Contribute to roadmap planning, feasibility analysis, and technical risk assessment
Ensure performance, scalability, cost efficiency, and robustness of AI products
Embed data privacy, security controls, Responsible AI, and governance frameworks into product design
Ensure adherence to enterprise AI policies, guardrails, and regulatory requirements
Interact with global clients (North America & Europe) to understand requirements, present architectures, and provide expert guidance
Create clear architecture diagrams, documentation, and high-quality technical specifications for developers and stakeholders
Serve as the technical face of the project in client discussions
Collaborate with AI Engineers, Data Scientists, Product Owners, Cloud Architects, and MLOps teams
Mentor teams in AI design patterns, best practices, and solution development
Conduct architecture reviews, code/design audits, and knowledge-sharing sessions
Requirements:
Minimum 10 years of experience in ML/AI solution architecture
Deep expertise in Generative AI: LLMs, Vision/Video models, Digital Avatars, RAG systems, and multimodal architectures
Strong experience in ML engineering, data pipelines, and scalable model APIs
Hands-on experience with Nvidia GPU systems, CUDA stack, TensorRT, vLLM/Ollama, and model optimization
Experience building AI on edge devices (Intel, AMD, Qualcomm NPUs, AI PCs)
Proficiency in AWS and Azure cloud ecosystems, including GPU-based deployments
Strong knowledge of Python, ML frameworks (PyTorch, TensorFlow), model serving frameworks, and MLOps tools
Proven track record of architecting POC, MVP, and production-grade AI products
Strong architectural documentation and diagramming skills (Mermaid, Draw.io, Lucidchart, ArchiMate)
Excellent communication skills for client presentations and internal leadership discussions
Ability to work in a fast-paced, multi-project environment across global teams
Nice to have:
Graduation from a Tier-1 institute (IIT, NIT, IIIT, or equivalent)
Certifications in AI/ML Architecture, Solution Architecture, or Cloud Architecture (AWS, Azure)
Experience with enterprise AI governance and generative AI compliance frameworks
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.