CrawlJobs Logo

Multimodal Algorithm Engineer (Model Optimization)

China, Shanghai Employment contract · Job Posted May 28, 2026
Apply Position
Job Link Share

Job Description

We are a core algorithm team at AMD, dedicated to end-to-end AI workload optimization on AMD platforms. We are seeking talented engineers specializing in multimodal foundation models, with a focus on Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs), and World Action Models (WAMs). In this role, you will drive model training, compression, quantization, inference optimization, and efficient deployment—enabling next-generation embodied AI and multimodal agents to achieve peak performance on AMD hardware platforms.

Job Responsibility

  • Optimize training strategies, fine-tuning, and alignment for multimodal models (VLM / VLA / WAM) on AMD platforms
  • Enhance action prediction, world state modeling, and long-horizon planning capabilities of WAM/VLA models for embodied intelligence scenarios (e.g., robotics, simulation-based interaction)
  • Design and implement model optimization techniques including quantization (PTQ/QAT), pruning, knowledge distillation, operator fusion, and KV cache optimization to improve inference latency, throughput, and energy efficiency
  • Collaborate closely with compiler, driver, and system software teams to deeply integrate models into AMD’s software stack
  • Stay at the forefront of research in World Models, action generation, and multimodal agents—and explore novel architectures for AMD’s heterogeneous compute platforms

Requirements

  • Master’s or PhD in Computer Science, Artificial Intelligence, Robotics, Electrical Engineering, or a related field
  • Hands-on experience with VLMs, VLAs, or WAMs (World Action Models)—especially in robotics decision-making, simulated environment training, or action sequence generation—is highly preferred
  • Proficiency in PyTorch
  • familiarity with multimodal and embodied AI frameworks
  • Familiarity with simulation platforms such as Isaac Gym, LIBERO, MuJoCo, or RoboTwin
  • Strong software engineering skills and ability to deliver full-cycle solutions—from research prototyping to production deployment

Nice to have

  • Contributions to open-source projects in multimodal agents, world models, or robotics (e.g., OpenVLA, DROID, ACT)
  • Publication record in top-tier conferences (e.g., CVPR, ICRA, CoRL, NeurIPS, ICLR) in multimodal learning or embodied AI is a strong advantage
  • Strong background in model optimization: quantization, sparsity, kernel fusion, dynamic batching, etc.
  • Experience with AMD ROCm ecosystem or heterogeneous computing performance tuning
  • Understanding of GPU/accelerator architecture
  • experience with CUDA or HIP is a plus

What we offer

  • Access to cutting-edge AMD compute resources
  • Unique opportunity to shape full-stack co-design across algorithms, compilers, and hardware
  • A collaborative, globally distributed team of world-class AI systems and robotics researchers

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Multimodal Algorithm Engineer (Model Optimization)

8 matching positions

LLM Algorithm Tech Lead – Applied Large Language Model Systems

Plaud is building the next generation intelligence infrastructure and interfaces...
Location
Location
United States , San Francisco
Salary
Salary:
230000.00 - 300000.00 USD / Year
plaud.ai Logo
Plaud
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5–10 years of experience in LLM/NLP/AI
  • Strong prompt engineering and reasoning design skills
  • Proven ability to deliver LLM-powered features into production
  • Experience with RAG and knowledge-enhanced reasoning
  • Strong architectural thinking and system design skills
  • Strong communication and leadership capabilities
  • Knowledge of memory systems or personalization engines
  • Experience building eval frameworks or safety systems
  • Experience leading technical teams
  • Experience in foundational model algorithm design or efficiency optimization
Job Responsibility
Job Responsibility
  • Intelligence Architecture Development: Design structured reasoning pipelines, planning flows, chain-of-thought workflows
  • Build capability primitives such as memory, personalization, proactive insights
  • Develop modular and reusable intelligence components
  • Applied LLM Features & Production Integration: Lead the design and deployment of LLM-based product functionality
  • Ensure output reliability, consistency, safety, and user-centric alignment
  • Apply prompting, constraints, and reasoning structures to reduce hallucination
  • Retrieval-Augmented Generation (RAG): Build and optimize multi-hop, multi-source retrieval pipelines
  • Implement chunking, indexing, reranking, and retrieval evaluation
  • Ensure RAG improves factuality and reduces error rates
  • Model Strategy & Inference Optimization: Select appropriate model families based on capability and cost constraints
What we offer
What we offer
  • Competitive Compensation: $230K-$300K base salary+performance bonus+Equity
  • Comprehensive Benefits: Top-tier healthcare for employees and dependents, including dental and vision, and a generous employer subsidy
  • Retirement Planning: 401(k) plan for full time employees with company matching
  • Paid Time Off: Unlimited PTO, plus 13 paid holidays
  • New Parent Leave: 12 weeks of paid time off to spend time with your new family, regardless of gender
  • Hybrid Office: Minimum of 3x in office per week
  • Gear: New hires are equipped with their choice of new top-of-the-line laptops and workstation setups
  • Perks: Best office equipment. Annual offsites. Free office drinks and snacks
  • Fulltime
Read More
Arrow Right

GenAI ML Engineer

We are seeking a talented GenAI ML Engineer to develop and deploy cutting-edge g...
Location
Location
Canada , Toronto
Salary
Salary:
130000.00 USD / Year
realign-llc.com Logo
Realign
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Total Experience: 6-8 years
  • Generative AI Development Design and implement generative AI models for text, code, and multimodal applications
  • LLM Engineering Fine-tune, optimize, and deploy large language models (GPT, Claude, Llama, etc.)
  • Model Training Develop training pipelines for custom generative models and foundation model adaptation
  • Python Development Build robust ML applications, APIs, and services using Python and ML frameworks
  • Prompt Engineering Create and optimize prompts for various LLM applications and use cases
  • Model Evaluation Implement evaluation frameworks for generative AI model performance and safety
  • Production Deployment Deploy and monitor ML models in production environments with proper scaling
  • Research Innovation Stay current with latest GenAI research and implement state-of-the-art techniques
  • Data Pipeline Management Build data preprocessing and feature engineering pipelines for ML workflows
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

This role will be tasked with applying machine learning/deep learning to the aut...
Location
Location
United States , Belmont
Salary
Salary:
170000.00 - 210000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of professional experience post graduate degree preferred
  • 4+ years' Deep Learning experience post graduate degree preferred
  • Master's Degree in Computer Science or equivalent
  • PhD Strongly Preferred
  • Strong knowledge of different machine learning algorithms
  • Proficiency in deep learning techniques and frameworks
  • Strong understanding of traditional machine learning algorithms and their applications
  • Expertise in computer vision, including object detection, image segmentation, and image recognition
  • Proficiency in NLP techniques, including sentiment analysis, text generation, and language understanding models
  • Experience with multimodal language modeling and applications
Job Responsibility
Job Responsibility
  • Applying machine learning/deep learning to the automotive industry
  • Maintaining and enhancing existing machine learning modules for autonomous vehicles
  • Designing and implementing new machine learning based approaches based on existing frameworks
  • Keeping up to speed with the state of the art of academic research and technology in the industry
  • Coordinating with engineers at the ICC and in Germany on the development of autonomous driving software
  • Transferring technologies and solutions to Volkswagen Group development divisions
  • Developing technical specifications and documentation
  • Representing Volkswagen Group in the technical community, such as at conferences
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, Multimodal and Multitasking Machine Learning

Meta Reality Labs Research is looking for upcoming scientists and researchers wi...
Location
Location
United States , Redmond
Salary
Salary:
7313.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD in the fields of Computer Science, Electrical Engineering, or related field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • 2+ years research experience in one or more of the following: developing machine learning and computer vision models, optimization of edge computing algorithms, or distributed compute architectures
  • 2+ years experience programming in Python/C++
  • Experience with Deep Learning frameworks (Pytorch, TensorFlow, etc)
Job Responsibility
Job Responsibility
  • Research on design / model / execution of efficient ML algorithms
  • Research on novel ML or computational imaging algorithms for applications and optimize existing algorithms
  • Research on development and optimization of edge computing algorithms (ML and non-ML)
  • Collaboration with and support of other researchers across various disciplines
  • Communication of research agenda, progress and results
  • Prototyping, building and characterizing experimental systems and custom hardware
Read More
Arrow Right

Research Scientist Intern, Multimodal and Multitasking Machine Learning

Meta Reality Labs Research is looking for upcoming scientists and researchers wi...
Location
Location
United States , Redmond
Salary
Salary:
7313.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD in the fields of Computer Science, Electrical Engineering, or related field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • 2+ years research experience in one or more of the following: developing machine learning and computer vision models, optimization of edge computing algorithms, or distributed compute architectures
  • 2+ years experience programming in Python/C++
  • Experience with Deep Learning frameworks (Pytorch, TensorFlow, etc)
Job Responsibility
Job Responsibility
  • Research on design / model / execution of efficient ML algorithms
  • Research on novel ML or computational imaging algorithms for applications and optimize existing algorithms
  • Research on development and optimization of edge computing algorithms (ML and non-ML)
  • Collaboration with and support of other researchers across various disciplines
  • Communication of research agenda, progress and results
  • Prototyping, building and characterizing experimental systems and custom hardware
Read More
Arrow Right

Search Machine Learning Research Engineer

Perplexity is seeking an experienced Senior Machine Learning Engineer to help bu...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
perplexity.ai Logo
Perplexity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of search and retrieval systems, including quality evaluation principles and metrics
  • Proven track record with large-scale search or recommender systems
  • Strong proficiency with PyTorch, including experience in distributed training techniques and performance optimization for large models
  • Expertise in representation learning, including contrastive learning and embedding space alignment for multilingual and multimodal applications
  • Strong publication record in AI/ML conferences or workshops (e.g., NeurIPS, ICML, ICLR, ACL, CVPR, SIGIR)
  • Self-driven, with a strong sense of ownership and execution
  • Minimum of 3 years (preferably 5+) working on search, recommender systems, or closely related research areas
Job Responsibility
Job Responsibility
  • Relentlessly push search quality forward — through models, data, tools, or any other leverage available
  • Architect and build core components of the search platform and model stack
  • Design, train, and optimize large-scale deep learning models using frameworks like PyTorch, leveraging distributed training (e.g., PyTorch Distributed, DeepSpeed, FSDP) and hardware acceleration, with a focus on retrieval and ranking models
  • Conduct advanced research in representation learning, including contrastive learning, multilingual, and multimodal modeling for search and retrieval
  • Deploy models — from boosting algorithms to LLMs — in a scalable and performant way
  • Build and optimize RAG pipelines for grounding and answer generation
  • Collaborate with Data, AI, Infrastructure, and Product teams to ensure fast and high-quality delivery
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Scientist

We are seeking highly skilled and innovative Machine Learning Scientists to join...
Location
Location
United States , Boston
Salary
Salary:
290250.00 - 500400.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and with +5 years for ML Scientist, +8 years for Sr. ML Scientist, +10 years for Principal ML Scientist experience in Computer Science or a related field with a focus on LLM, MLLMs, Computer Vision, GenAI
  • Proven track record of research excellence in LLM, MLLM, Computer Vision, Robotics Perception, GenAI, demonstrated through publications in top-tier conferences or journals
  • Strong proficiency in programming languages such as Python, C/C++, experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras and experience with ROS or robotic operational system
  • Drive one or more phases of the ML development lifecycle: shape datasets, investigate modeling approaches and architectures, train/evaluate/tune models and implement the end-to-end training pipeline
  • Leverage state-of-the-art research to deliver high quality models enabling multiple AI projects at scale
  • Contribute back to the research community via academic publications, tech blogs, open-source code and contributing to internal/external AI challenges
  • Experience in developing computer vision algorithms for resource-constrained devices such as mobile phones, IoT devices, or embedded systems is highly desirable
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Own one or more key technical areas across LLM, MLLM, CV product portfolio
  • Provide technical leadership to junior scientists, guiding the transition of R&D concepts into impactful Axon product feature
  • Research and develop cutting-edge techniques in LLM, MLLMs, GenAI, and Computer Vision across cloud, devices and sensors based data sources
  • Design and implement efficient and scalable MLLM models for inference and analysis of multimodal data
  • Explore novel approaches to address challenges in NLP, NLU, Object Detection, Object Recognition, Object Tracking, Segmentation, and Scene Understanding
  • Optimize AI models, algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Join force with MLEs or firmware or hardware engineers to leverage hardware accelerators and optimize algorithms for specific hardware architectures
  • Evaluate the performance of LLM, MLLM, CV models using real-world datasets and design experiments to validate their effectiveness
  • Stay up-to-date with the latest research trends and advancements in computer vision, machine learning, and deep learning, MLLMs, GenAI and integrate relevant findings into our projects
  • Contribute to patent disclosures, academic publications, and technical documentation to share insights and findings with the broader community
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Scientist

Join Axon and be a Force for Good. At Axon, we’re on a mission to Protect Life. ...
Location
Location
United States , Scottsdale
Salary
Salary:
Not provided
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science or a related field with a focus on LLM, MLLMs, Computer Vision, GenAI
  • +5 years for ML Scientist, +8 years for Sr. ML Scientist, +10 years for Principal ML Scientist experience
  • Proven track record of research excellence in LLM, MLLM, Computer Vision, Robotics Perception, GenAI, demonstrated through publications in top-tier conferences or journals
  • Strong proficiency in programming languages such as Python, C/C++
  • Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras
  • Experience with ROS or robotic operational system
  • Drive one or more phases of the ML development lifecycle: shape datasets, investigate modeling approaches and architectures, train/evaluate/tune models and implement the end-to-end training pipeline
  • Leverage state-of-the-art research to deliver high quality models enabling multiple AI projects at scale
  • Contribute back to the research community via academic publications, tech blogs, open-source code and contributing to internal/external AI challenges
  • Experience in developing computer vision algorithms for resource-constrained devices such as mobile phones, IoT devices, or embedded systems is highly desirable
Job Responsibility
Job Responsibility
  • Own one or more key technical areas across LLM, MLLM, CV product portfolio
  • Provide technical leadership to junior scientists, guiding the transition of R&D concepts into impactful Axon product feature
  • Research and develop cutting-edge techniques in LLM, MLLMs, GenAI, and Computer Vision across cloud, devices and sensors based data sources
  • Design and implement efficient and scalable MLLM models for inference and analysis of multimodal data
  • Explore novel approaches to address challenges in NLP, NLU, Object Detection, Object Recognition, Object Tracking, Segmentation, and Scene Understanding
  • Optimize AI models, algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Join force with MLEs or firmware or hardware engineers to leverage hardware accelerators and optimize algorithms for specific hardware architectures
  • Evaluate the performance of LLM, MLLM, CV models using real-world datasets and design experiments to validate their effectiveness
  • Stay up-to-date with the latest research trends and advancements in computer vision, machine learning, and deep learning, MLLMs, GenAI and integrate relevant findings into our projects
  • Contribute to patent disclosures, academic publications, and technical documentation to share insights and findings with the broader community
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right