This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are a core algorithm team at AMD, dedicated to end-to-end AI workload optimization on AMD platforms. We are seeking talented engineers specializing in multimodal foundation models, with a focus on Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs), and World Action Models (WAMs). In this role, you will drive model training, compression, quantization, inference optimization, and efficient deployment—enabling next-generation embodied AI and multimodal agents to achieve peak performance on AMD hardware platforms.
Job Responsibility
Optimize training strategies, fine-tuning, and alignment for multimodal models (VLM / VLA / WAM) on AMD platforms
Enhance action prediction, world state modeling, and long-horizon planning capabilities of WAM/VLA models for embodied intelligence scenarios (e.g., robotics, simulation-based interaction)
Design and implement model optimization techniques including quantization (PTQ/QAT), pruning, knowledge distillation, operator fusion, and KV cache optimization to improve inference latency, throughput, and energy efficiency
Collaborate closely with compiler, driver, and system software teams to deeply integrate models into AMD’s software stack
Stay at the forefront of research in World Models, action generation, and multimodal agents—and explore novel architectures for AMD’s heterogeneous compute platforms
Requirements
Master’s or PhD in Computer Science, Artificial Intelligence, Robotics, Electrical Engineering, or a related field
Hands-on experience with VLMs, VLAs, or WAMs (World Action Models)—especially in robotics decision-making, simulated environment training, or action sequence generation—is highly preferred
Proficiency in PyTorch
familiarity with multimodal and embodied AI frameworks
Familiarity with simulation platforms such as Isaac Gym, LIBERO, MuJoCo, or RoboTwin
Strong software engineering skills and ability to deliver full-cycle solutions—from research prototyping to production deployment
Nice to have
Contributions to open-source projects in multimodal agents, world models, or robotics (e.g., OpenVLA, DROID, ACT)
Publication record in top-tier conferences (e.g., CVPR, ICRA, CoRL, NeurIPS, ICLR) in multimodal learning or embodied AI is a strong advantage
Strong background in model optimization: quantization, sparsity, kernel fusion, dynamic batching, etc.
Experience with AMD ROCm ecosystem or heterogeneous computing performance tuning
Understanding of GPU/accelerator architecture
experience with CUDA or HIP is a plus
What we offer
Access to cutting-edge AMD compute resources
Unique opportunity to shape full-stack co-design across algorithms, compilers, and hardware
A collaborative, globally distributed team of world-class AI systems and robotics researchers