CrawlJobs Logo

Multimodal AI Engineer, Document Understanding

llamaindex.ai Logo

LlamaIndex

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Join us and help shape the future of AI by redefining document workflows with AI agents. We are seeking exceptional AI engineers to join our core document understanding team. You will work at the intersection of computer vision, natural language processing, and production ML systems to push the boundaries of what's possible in document parsing and understanding. Our document understanding team builds the intelligence behind LlamaParse, LlamaExtract, and our other processing products. These systems are processing millions of complex documents including PDFs, PowerPoints, Word documents, and spreadsheets. Your work will directly impact thousands of developers building RAG applications and document agents, while also contributing to our open-source frameworks that shape how the industry approaches document processing. Depending on your background and interests, you might focus more on data curation and evaluation, model fine-tuning and experimentation, or ML infrastructure and production systems. We're hiring multiple people and will work with you to find the best fit.

Job Responsibility:

  • Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processing
  • Build robust data pipelines, evaluation frameworks, and experimentation infrastructure
  • Design and implement production ML systems that handle complex, real-world documents at scale
  • Stay current with latest advances in vision-language models, document AI, and multimodal learning
  • Collaborate with engineering teams to integrate ML innovations into production APIs
  • Contribute to both our open-source frameworks and enterprise offerings
  • Drive technical decisions while balancing research exploration with product delivery

Requirements:

  • 3-7 years of experience in machine learning engineering or applied research
  • Strong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)
  • Hands-on experience training, fine-tuning, or deploying ML models in production
  • Deep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning
  • Experience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructure
  • Ability to read and implement from research papers and technical specifications
  • Track record of executing with high intensity in fast-paced environments
  • Strong technical communication skills and comfort with open-source collaboration

Nice to have:

  • Experience with vision-language models, transformer architectures, or model fine-tuning (LoRA, QLoRA)
  • Experience building evaluation frameworks, benchmarks, or data quality pipelines
  • Experience with model serving frameworks (vLLM, TensorRT, ONNX) or MLOps tools
  • Experience specifically with document understanding, OCR, or layout analysis
  • Contributions to open-source ML projects or frameworks
  • Experience with LLM applications and RAG systems
  • Strong understanding of model optimization techniques (quantization, distillation, pruning)
  • Experience with Docker/Kubernetes and distributed systems
  • Active participation in ML research community
What we offer:
  • Competitive base salary and equity compensation
  • Comprehensive medical/dental/vision coverage for you and your family
  • Unlimited paid time off policy
  • Daily catered lunch and snacks in the San Francisco office
  • Budget for conferences, research materials, and professional development
  • Access to cutting-edge compute resources and research tools

Additional Information:

Job Posted:
December 10, 2025

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Multimodal AI Engineer, Document Understanding

Senior AI Engineer

This role will be tasked with applying machine learning/deep learning to the aut...
Location
Location
United States , Belmont
Salary
Salary:
170000.00 - 210000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Requirements
Requirements
  • 6-8 years of professional experience post graduate degree preferred
  • 4+ years' Deep Learning experience post graduate degree preferred
  • Master's Degree in Computer Science or equivalent
  • PhD Strongly Preferred
  • Strong knowledge of different machine learning algorithms
  • Proficiency in deep learning techniques and frameworks
  • Strong understanding of traditional machine learning algorithms and their applications
  • Expertise in computer vision, including object detection, image segmentation, and image recognition
  • Proficiency in NLP techniques, including sentiment analysis, text generation, and language understanding models
  • Experience with multimodal language modeling and applications
Job Responsibility
Job Responsibility
  • Applying machine learning/deep learning to the automotive industry
  • Maintaining and enhancing existing machine learning modules for autonomous vehicles
  • Designing and implementing new machine learning based approaches based on existing frameworks
  • Keeping up to speed with the state of the art of academic research and technology in the industry
  • Coordinating with engineers at the ICC and in Germany on the development of autonomous driving software
  • Transferring technologies and solutions to Volkswagen Group development divisions
  • Developing technical specifications and documentation
  • Representing Volkswagen Group in the technical community, such as at conferences
  • Fulltime
Read More
Arrow Right
New

AI Content Engineer

Join us and help shape the future of AI by architecting next-generation knowledg...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
llamaindex.ai Logo
LlamaIndex
Expiration Date
Until further notice
Requirements
Requirements
  • Experience in software engineering (ML engineering + research a bonus)
  • Strong software engineering fundamentals with production Python experience
  • Understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning
  • Demonstrated ability to write clearly, quickly, and authentically about technical topics
  • Bias toward shipping - comfortable publishing at blog pace, not paper pace
  • Ability to read, understand, and synthesize research papers rapidly
  • Scrappy and self-directed - can identify what's worth writing about and execute end-to-end
  • Track record of high-velocity output in fast-paced environments
Job Responsibility
Job Responsibility
  • Design, build, and maintain comprehensive benchmarks for document parsing and understanding
  • Publish high-quality technical content at a weekly cadence (blog posts, benchmark reports, technical comparisons, tutorials)
  • Stay deeply current with the document AI landscape - new models, papers, competitors, techniques
  • Run experiments and translate findings into publishable artifacts quickly
  • Produce technical analyses that demonstrate our capabilities against alternatives
  • Contribute to open-source examples, notebooks, and documentation
  • Collaborate with the core ML team to surface improvements and capabilities worth highlighting
  • Engage authentically with the developer community through technical content (not conferences/events)
What we offer
What we offer
  • Shape the Narrative: Your content will define how developers think about document understanding. You'll have direct influence on market perception
  • Technical Credibility: Work with cutting-edge document AI systems processing millions of documents. Your benchmarks and analyses will be grounded in real capabilities
  • High Autonomy: Significant freedom to identify what matters and publish quickly. No lengthy approval chains
  • Growth Opportunity: Help build this function from the ground up as we scale
  • Fulltime
Read More
Arrow Right

AI Solutions Architect

We are looking for a highly skilled AI Architect with deep expertise in Generati...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
nstarxinc.com Logo
NStarX
Expiration Date
Until further notice
Requirements
Requirements
  • Minimum 10 years of experience in ML/AI solution architecture
  • Deep expertise in Generative AI: LLMs, Vision/Video models, Digital Avatars, RAG systems, and multimodal architectures
  • Strong experience in ML engineering, data pipelines, and scalable model APIs
  • Hands-on experience with Nvidia GPU systems, CUDA stack, TensorRT, vLLM/Ollama, and model optimization
  • Experience building AI on edge devices (Intel, AMD, Qualcomm NPUs, AI PCs)
  • Proficiency in AWS and Azure cloud ecosystems, including GPU-based deployments
  • Strong knowledge of Python, ML frameworks (PyTorch, TensorFlow), model serving frameworks, and MLOps tools
  • Proven track record of architecting POC, MVP, and production-grade AI products
  • Strong architectural documentation and diagramming skills (Mermaid, Draw.io, Lucidchart, ArchiMate)
  • Excellent communication skills for client presentations and internal leadership discussions
Job Responsibility
Job Responsibility
  • Design and architect LLM-based systems using both open-source (Llama, Mistral, etc.) and proprietary (OpenAI, Azure OpenAI, Anthropic, etc.) models
  • Architect video-based AI systems, including Digital Human Avatars, Video Generation, Video-to-Text, and multimodal pipelines
  • Build end-to-end GenAI pipelines including data ingestion, preprocessing, retrieval, fine-tuning (LoRA, QLoRA, DAPT), evaluation, guardrailing, and deployment
  • Define and orchestrate data pipelines, ML workflows, vector search architecture, and embedding strategies
  • Build scalable, secure ML engineering wrappers around models (inference servers, orchestration layers, API microservices)
  • Oversee experimentation frameworks, evaluation methodologies, and MLOps integration
  • Architect AI solutions on AWS and Azure (preferred), including GPU clusters, model hosting, DevOps/MLOps, and autoscaling
  • Work with Nvidia GPU server stacks (DGX, H200, H100, L40S) and edge AI systems (Intel, AMD, Qualcomm AI PCs)
  • Optimize AI workloads across heterogeneous compute environments
  • Lead AI architecture across POC → MVP → GA → production-scale phases
What we offer
What we offer
  • Competitive base + commission
  • Fast growth into leadership roles
  • Fulltime
Read More
Arrow Right

Data Engineer (AI / ML)

We are investing massively in developing next-generation AI tools for multimodal...
Location
Location
United Kingdom; Greece , London
Salary
Salary:
Not provided
satalia.com Logo
Satalia
Expiration Date
Until further notice
Requirements
Requirements
  • High proficiency in Python and SQL
  • Strong knowledge of data structures, data modelling, and database operation
  • Proven hands-on experience building and deploying data solutions on a major cloud platform (AWS, GCP, or Azure)
  • Familiarity with containerization technologies such as Docker and Kubernetes
  • Familiarity with Retrieval-Augmented Generation (RAG) applications and modern AI/LLM frameworks (e.g., LangChain, Haystack, Google GenAI, etc.)
  • Demonstrable experience designing, implementing, and optimizing robust data pipelines for performance, reliability, and cost-effectiveness in a cloud-native environment
  • Experience in supporting data science workloads and working with both structured and unstructured data
  • Experience working with both relational (e.g., PostgreSQL, MySQL) and NoSQL databases
  • Experience with a big data processing framework (e.g., Spark)
Job Responsibility
Job Responsibility
  • Collaborate closely with data scientists, architects, and other stakeholders to understand and break down business requirements
  • Collaborate on schema design, data contracts, and architecture decisions, ensuring alignment with AI/ML needs
  • Provide data engineering support for AI model development and deployment, ensuring data scientists have access to the data they need in the format they need it
  • Leverage cloud-native tools (GCP/AWS/Azure) for orchestrating data pipelines, AI inference workloads, and scalable data services
  • Develop and maintain APIs for data services and serving model predictions
  • Support the development, evaluation and productionisation of agentic systems with: LLM-powered features and prompt engineering
  • Retrieval-Augmented Generation (RAG) pipelines
  • Multimodal vector embeddings and vector stores
  • Agent development frameworks: ADK, LangGraph, Autogen
  • Model Context Protocol (MCP) for integrating agents with tools, data and AI services
What we offer
What we offer
  • enhanced pension
  • life assurance
  • income protection
  • private healthcare
  • Remote working
  • Truly flexible working hours
  • Generous Leave - 27 days holiday plus bank holidays and enhanced family leave
  • Annual bonus
  • Impactful projects
  • People oriented culture
  • Fulltime
Read More
Arrow Right

Senior Data Engineer - AI Focused

At Doctolib, we're on a mission to transform healthcare through the power of AI....
Location
Location
France , Paris
Salary
Salary:
Not provided
doctolib.fr Logo
Doctolib
Expiration Date
Until further notice
Requirements
Requirements
  • Master’s or Ph.D. degree in Computer Science, Data Engineering, or a related field
  • 5+ years of experience in Data Engineering, ideally supporting AI or ML workloads
  • Strong experience with the GCP data ecosystem
  • Proficiency in Python and SQL, with experience in data pipeline orchestration (e.g., Airflow, Dagster, Cloud Composer)
  • Deep understanding of NoSQL systems (e.g., MongoDB) and vector databases (e.g., FAISS, Vector Search)
  • Experience designing data architectures for RAG, embeddings, or model training pipelines
  • Knowledge of data governance, security, and compliance for sensitive or regulated data
  • Familiarity with W&B / MLflow / Braintrust / DVC for experiment tracking and dataset versioning (extract snapshots, change tracking, reproducibility)
  • Familiarity with containerized environments (Docker, Kubernetes) and CI/CD for data workflows
  • A collaborative mindset and passion for building the data foundations of next-generation AI systems
Job Responsibility
Job Responsibility
  • Ensure high standards of data quality for AI model inputs
  • Design, build, and maintain scalable data pipelines on Google Cloud Platform (GCP) for AI and machine learning use cases
  • Implement data ingestion and transformation frameworks that power Retrieval systems and training datasets for LLMs and multimodal models
  • Architect and manage NoSQL and Vector Databases to store and retrieve embeddings, documents, and model inputs efficiently
  • Collaborate with ML and platform teams to define data schemas, partitioning strategies, and governance rules that ensure privacy, scalability, and reliability
  • Integrate unstructured and structured data sources (text, speech, image, documents, metadata) into unified data models ready for AI consumption
  • Optimize performance and cost of data pipelines using GCP native services (BigQuery, Dataflow, Pub/Sub, Cloud Storage, Vertex AI)
  • Contribute to data quality and lineage frameworks, ensuring AI models are trained on validated, auditable, and compliant datasets
  • Continuously evaluate and improve our data stack to accelerate AI experimentation and deployment
What we offer
What we offer
  • Free comprehensive health insurance for you and your children
  • Parent Care Program: additional leave on top of the legal parental leave
  • Free mental health and coaching services through our partner Moka.care
  • For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
  • Work from EU countries and the UK for up to 10 days per year, thanks to our flexibility days policy
  • Work Council subsidy to refund part of a sport club membership or a creative class
  • Up to 14 days of RTT
  • Lunch voucher with Swile card
  • Fulltime
Read More
Arrow Right

Senior AI Engineer (ML/DL)

This role will be tasked with applying machine learning/deep learning to the aut...
Location
Location
United States , Belmont
Salary
Salary:
170000.00 - 210000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Requirements
Requirements
  • 6-8 years of professional experience post graduate degree PREFERRED
  • 4+ years’ Deep Learning experience post graduate degree PREFERRED
  • Master’s Degree in Computer Science or equivalent
  • Proficiency in deep learning techniques and frameworks
  • Strong understanding of traditional machine learning algorithms and their applications
  • Expertise in computer vision, including object detection, image segmentation, and image recognition
  • Proficiency in NLP techniques, including sentiment analysis, text generation, and language understanding models
  • Experience with multimodal language modeling and applications
  • Deep understanding of various neural network architectures such as CNNs, RNNs, and Transformers
  • Familiarity with reinforcement learning algorithms and their applications in AI
Job Responsibility
Job Responsibility
  • Applying machine learning/deep learning to the automotive industry
  • Applications could include autonomous driving, manufacturing, design etc
  • Integrating the modules you build into real cars
  • Thinking about questions around testability and proving safety
  • Maintaining, as well as furthers, enhances existing machine learning modules for autonomous vehicles
  • Designing and implementing new machine learning based approaches based on existing frameworks
  • Keeping up to speed with the state of the art of academic research and technology in the industry
  • Coordinating with engineers at the ICC and in Germany on the development of autonomous driving software
  • Transferring technologies and solutions to Volkswagen Group development divisions
  • Developing technical specifications and documentation
What we offer
What we offer
  • Eligibility for annual performance bonus
  • Healthcare benefits
  • 401(k), with company match
  • Defined contribution retirement program
  • Tuition reimbursement
  • Company lease car program
  • Paid time off
  • Fulltime
Read More
Arrow Right

Associate Director, SEO Strategy

The Associate Director, SEO Strategy leads the charge in shaping and executing i...
Location
Location
United States
Salary
Salary:
94700.00 - 130000.00 USD / Year
basicagency.com Logo
BASIC/DEPT®
Expiration Date
Until further notice
Requirements
Requirements
  • 7+ years of professional SEO experience (agency experience strongly preferred)
  • Proven track record of leading successful enterprise-level SEO strategies
  • Deep expertise in advanced technical SEO, content, and authority
  • Curiosity and working knowledge of AI Search / AIO / GEO principles
  • Experience mentoring and managing small teams or project pods
  • Excellent communication and presentation skills for executive audiences
  • Strong proficiency in tools such as Google Search Console, GA4, Screaming Frog, Semrush, Ahrefs, Looker Studio, BigQuery, Scrunch, Profound, Otterly, and AirOps
  • Strong understanding of data storytelling and performance forecasting
  • Demonstrated ability to collaborate cross-functionally and build consensus
  • Excellent writing, communication, and organizational skills
Job Responsibility
Job Responsibility
  • Prepare and confidently present advanced Technical SEO, Content, AIO / GEO, and other SEO-adjacent capabilities to internal and external clients
  • Own the full strategic direction for multiple mid-market and enterprise SEO clients, ensuring all initiatives ladder up to business outcomes and integrated marketing goals
  • Proactively identify growth opportunities within content, technical, and authority strategies, ensuring ongoing innovation and measurable improvement across all areas of organic performance
  • Demonstrate deep technical proficiency across crawling, indexing, rendering, structured data, Core Web Vitals, and JavaScript frameworks
  • Drive innovation through AI-driven initiatives such as AIO / GEO audits, AI Overview visibility analyses, semantic similarity assessments, passage-level optimization, and synthetic query testing
  • Translate emerging AI search opportunities and trends into actionable deliverables, training, and measurable KPIs
  • Collaborate cross-functionally with Paid Media, Conversion Rate Optimization, Social, and Content teams to translate opportunities into integrated holistic search strategies that drive unified channel growth
  • Serve as the senior strategic lead for organic search, driving growth and long-term partnership success
  • Lead executive-level presentations and quarterly business reviews (QBRs), sharing clear performance stories backed by data and actionable insights
  • Oversee client communication, expectation management, and escalation resolution with clarity and confidence
What we offer
What we offer
  • Healthcare, Dental, and Vision coverage
  • 401k plan, plus matching
  • PTO
  • Paid Company Holidays
  • Parental Leave
  • Fulltime
Read More
Arrow Right

Research Intern - GenAI

Appen is seeking Research Interns to support innovative research in Generative A...
Location
Location
Australia , Chatswood, Sydney
Salary
Salary:
Not provided
appen.com Logo
Appen
Expiration Date
Until further notice
Requirements
Requirements
  • Postgraduate students in Linguistics, Computer Science, AI, Data Science, or similar disciplines preferred
  • strong final-year and recent undergraduate candidates in these fields will also be considered
  • Familiarity with programming languages such as Python, R, or similar tools used in data analysis and machine learning
  • Experience with data annotation, model evaluation, or prompt engineering
  • Understanding of multilingual NLP, speech technologies, or agentic AI systems
  • Strong written communication skills, especially for summarizing research and drafting technical content
  • Ability to work independently and collaboratively in a remote research environment
Job Responsibility
Job Responsibility
  • Conduct literature reviews on topics such as adversarial prompting, multilingual evaluation, and agentic AI
  • Assist in dataset curation, annotation, and quality assurance for speech, text, and multimodal data
  • Support model evaluation experiments, including prompt engineering and red teaming
  • Develop scripts and tools for data analysis, visualization, and automation
  • Contribute to internal documentation, research reports, and thought leadership content
  • Participate in team meetings and cross-functional collaborations
  • Help prepare materials for conferences, publications, and workshops
What we offer
What we offer
  • Hands-on experience in applied AI research with real-world impact
  • Mentorship from experienced researchers and exposure to industry workflows
  • Opportunities to contribute to publications, datasets, and thought leadership
  • A collaborative and inclusive research environment
Read More
Arrow Right