CrawlJobs Logo

AI Model, Framework, and GPU Engineer

amd.com Logo

AMD

Location Icon

Location:
Germany , Munich

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are looking for an experienced Machine Learning Software Engineer who will be part of the AMD GPU Technology and Engineering Software Team developing our latest AI software technologies. You will engage with cross-functional teams to optimize various parts of the AI software stack and deliver AI solutions across AMD Radeon and Ryzen product families.

Job Responsibility:

  • Develop and deliver innovative AI software solutions to AMD customers and users
  • Enable and optimize software stack for standard frameworks like ONNX and PyTorch, as well as new popular Open-Source AI software
  • Bring up new SOTA AI models, analyze and improve their performance
  • Participate and drive end-2-end AI software development from feature scoping, implementation, integration and verification, to customer enablement

Requirements:

  • Strong technical and analytical skills in C/C++/Python AI development in Windows and Linux environment
  • Some knowledge on GPU programming and compiler
  • Capable problem solver
  • Technical leader to define goals and scope and drive development effort
  • Good communication skills
  • Enthusiastic about AI technologies
  • Strongly motivated to enable customers with best feature-rich efficient solutions
  • Strong cross-platform software development experience and deep programming skills in C/C++ and Python
  • Excellent problem-solving and effective communication skills
  • Development experience on CONV, GEMM, and/or non-linear operators
  • GPU acceleration experience with compiler and low-level GPU programming is a plus
  • Experience with common AI frameworks and inference stacks
  • Solid knowledge of AI and ML concepts and techniques
  • Understanding the performance implications on AI acceleration of different compute, memory, and communication configurations
  • Open-source software development experience is a plus
  • Bachelor’s, Master, or PhD in Computer Science, Electrical Engineering or relevant fields

Nice to have:

  • GPU acceleration experience with compiler and low-level GPU programming
  • Open-source software development experience

Additional Information:

Job Posted:
March 20, 2026

Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI Model, Framework, and GPU Engineer

Senior AI Engineer

This is an exciting new role at Efficy, central to our next-generation CRM innov...
Location
Location
France , Toulouse; Paris
Salary
Salary:
Not provided
efficy.com Logo
efficy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong knowledge of the technical stack with excellent command of AI-related languages and frameworks
  • Experience architecting distributed, orchestrated AI systems (protocols, pipelines, integration into complex workflows)
  • Skills in cost-performance optimization for models (latency, GPU usage, scalability)
  • Strong analytical mindset, technical rigor, and ability to prioritize
  • Autonomy, proactivity, and ability to drive complex initiatives in a rapidly evolving environment
  • Excellent written and verbal communication skills in French and English (fluency required)
Job Responsibility
Job Responsibility
  • Define and execute the AI agents strategy within the CRM platform, ensuring strong business relevance and smooth integration into existing workflows
  • Design agent architectures, including MCP protocol, orchestration, tool calling, memory management, evaluation, contextualization, RAG, and prompt engineering
  • Oversee deployment of AI solutions in both multi-tenant SaaS and on-premise environments
  • Select, adapt, and industrialize European open-source models, optimizing cost-performance trade-offs
  • Ensure security, GDPR compliance, and explainability of all developed agents
  • Implement observability and evaluation tools, including tracing, A/B testing, and cost monitoring
  • Collaborate closely with Data Science, Product, and Engineering teams to guarantee technical and functional alignment
What we offer
What we offer
  • A stable and growing company with an entrepreneurial mindset, where your ideas are valued, and we support you in making them happen
  • High flexibility—hybrid work is part of our DNA
  • State-of-the-art offices where teamwork is the norm
  • International growth opportunities and internal mobility
  • A competitive salary package with and a referral program
  • Engaging events: team lunches, after-work gatherings, sports activities, and trips
  • Learning opportunities: languages, technology, product knowledge, sales techniques, and leadership development
Read More
Arrow Right

AI Engineer

Location
Location
Vietnam , Da Nang
Salary
Salary:
Not provided
saigontechnology.com Logo
Saigon Technology
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have programming skills in one of these languages: Python, Java...
  • At least 1 years of experience working with AI/ML projects as a Data Engineer, Research Engineer, or Software Engineer.
  • Have experience with one of ML/DL Frameworks: Tensorflow, Pytorch, Keras, scikit-learn, Pandas, Langchain, LlamaIndex…
  • Have experience working with OpenAI, Gemini, any LLM... to build agents, workflows, or RAG systems.
  • Have experience with one of chatbot building frameworks or services like Rasa, Dialogflow, Transformer, BERT, LLM/Prompt…
  • Familiar with OCR algorithms or services: OpenCV, Tesseract, Textract (AWS), Google Cloud Vision, PaddleOCR.
  • Experience with common development tools: Linux, GPU server, Google Colab, Jupyter, Git, Docker.
  • Good English proficiency and communication skills.
Job Responsibility
Job Responsibility
  • Join in developing and applying the ML/DL techniques to resolve our client’s business problems such as building chatbot systems, LLM/Prompt, OCR systems, fraud detection systems, facial recognition systems…
  • Join in developing some internal products applying AI models
What we offer
What we offer
  • Competitive Salary and Brilliant Health Benefits
  • Attractive salary (13th-month salary, salary review twice/year) and project bonus
  • Bonus programs for candidate referral, technical article writing
  • Interest-free loan support for personal plan
  • Allowance for sickness, maternity, paternity and periodic health examination
  • PVI health care program
  • The staff of the quarter and year reward
  • Progressive and Fun Working Environment
  • A professional English-speaking working environment with Agile – Scrum model
  • Hybrid Working Model: Flexible working time and WFH support.
  • Fulltime
Read More
Arrow Right

AI Solution Engineer

As an AI Solution Engineer your role will be to architect, build, and deliver AI...
Location
Location
United States
Salary
Salary:
130500.00 - 300000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's, Master's or other Advanced degree in Engineering, Computer Science, or similar quantitative focus
  • 4 years + experience working with Machine Learning or Deep Learning
  • Experience working with Kubernetes
  • Competency working with the latest LLM frameworks, both Open Source (e.g. LangChain, LllamaIndex) and proprietary (e.g. NVIDIA NeMo/NIM)
  • Competency writing ML code (for example, using PyTorch)
  • Experience with Python, Unix-like systems
  • Ability to quickly prototype functionality into scripts for demos, integrations, troubleshooting, etc.
  • Understanding of hardware requirements associated with deep learning model training or inference, and how model attributes and performance factors affect it
  • Knowledge of current AI landscape, including popular models, frameworks, applications, and capabilities
  • Experience working with on-premise hardware / GPU clusters
Job Responsibility
Job Responsibility
  • Lead technical discussions with prospects and partners to propose HPE and partner Integrated solutions that address business challenges and opportunities using AI
  • Demo AI solutions (either existing or built by you) to prospects that address their use cases or desired AI outcomes
  • Lead Proof-of-Concepts / Proof-of-Value engagements for HPE prospects that demonstrate clear value from HPE's AI offerings, likely in combination with 3rd Party and Open Source components
  • Assist in any product or technical issue towards an initial sale or renewal of a customer
  • Help enable prospects, partners, and internal HPE teams on HPE's value in the AI landscape and how HPE and partner solutions can help solve real world business problems
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

AI Solutions Architect

We are looking for a highly skilled AI Architect with deep expertise in Generati...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
nstarxinc.com Logo
NStarX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 10 years of experience in ML/AI solution architecture
  • Deep expertise in Generative AI: LLMs, Vision/Video models, Digital Avatars, RAG systems, and multimodal architectures
  • Strong experience in ML engineering, data pipelines, and scalable model APIs
  • Hands-on experience with Nvidia GPU systems, CUDA stack, TensorRT, vLLM/Ollama, and model optimization
  • Experience building AI on edge devices (Intel, AMD, Qualcomm NPUs, AI PCs)
  • Proficiency in AWS and Azure cloud ecosystems, including GPU-based deployments
  • Strong knowledge of Python, ML frameworks (PyTorch, TensorFlow), model serving frameworks, and MLOps tools
  • Proven track record of architecting POC, MVP, and production-grade AI products
  • Strong architectural documentation and diagramming skills (Mermaid, Draw.io, Lucidchart, ArchiMate)
  • Excellent communication skills for client presentations and internal leadership discussions
Job Responsibility
Job Responsibility
  • Design and architect LLM-based systems using both open-source (Llama, Mistral, etc.) and proprietary (OpenAI, Azure OpenAI, Anthropic, etc.) models
  • Architect video-based AI systems, including Digital Human Avatars, Video Generation, Video-to-Text, and multimodal pipelines
  • Build end-to-end GenAI pipelines including data ingestion, preprocessing, retrieval, fine-tuning (LoRA, QLoRA, DAPT), evaluation, guardrailing, and deployment
  • Define and orchestrate data pipelines, ML workflows, vector search architecture, and embedding strategies
  • Build scalable, secure ML engineering wrappers around models (inference servers, orchestration layers, API microservices)
  • Oversee experimentation frameworks, evaluation methodologies, and MLOps integration
  • Architect AI solutions on AWS and Azure (preferred), including GPU clusters, model hosting, DevOps/MLOps, and autoscaling
  • Work with Nvidia GPU server stacks (DGX, H200, H100, L40S) and edge AI systems (Intel, AMD, Qualcomm AI PCs)
  • Optimize AI workloads across heterogeneous compute environments
  • Lead AI architecture across POC → MVP → GA → production-scale phases
What we offer
What we offer
  • Competitive base + commission
  • Fast growth into leadership roles
  • Fulltime
Read More
Arrow Right

Director Software Development

At AMD, we are enabling the next generation of AI innovation by leveraging the p...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years in AI/ML software development
  • 5+ years in leadership roles managing AI model enablement or optimization teams
  • Expertise in optimizing real-time AI models for deep learning applications (computer vision, NLP, etc.)
  • Proficiency with AI frameworks (TensorFlow, PyTorch, ONNX Runtime, JAX, Triton) and their optimization for GPU architectures
  • Strong background in optimizing software for AMD GPUs or similar high-performance platforms
  • Familiarity with ROCm is a plus
  • Proven experience with performance optimization, benchmarking, and scaling AI models on GPUs
  • Exceptional ability to collaborate cross-functionally and define long-term strategies for AI/ML innovation
  • Strong verbal and written communication skills, with experience presenting to senior leadership and working with customers and partners
  • Advanced degree (Master’s or PhD) in Computer Science, Electrical Engineering, AI/ML, or related field
Job Responsibility
Job Responsibility
  • Lead and develop teams responsible for AI inference model enablement and optimization
  • Direct efforts to optimize AI frameworks for seamless compatibility and performance on AMD GPUs (Instinct, Navi)
  • Oversee benchmarking, performance tuning, and optimization of AI inference models to improve latency, throughput, and efficiency on AMD hardware
  • Partner with hardware, software, and QA teams to ensure tight integration of AI frameworks with ROCm for maximum performance
  • Drive AI model optimization innovations, enhancing the speed, efficiency, and scalability of AI workloads
  • Lead the vision and strategy for optimizing AI inference on AMD GPUs
  • Collaborate with customers and open-source communities to ensure that AMD’s AI solutions meet industry needs, fostering contributions to MIGraphX, vLLM, and other AMD AI Framework Inference teams
  • Oversee automation frameworks to streamline model integration and performance testing, ensuring scalability across diverse AI workloads
Read More
Arrow Right

AI Models MAD - Model Automation and Dashboarding Engineer

AMD is looking for a skilled and motivated software engineer to join the Model A...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Undergraduate and/or Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field
  • Strong C/C++/Python programming and software design skills, including debugging, performance analysis, and test design
  • Experience in test automation, CI/CD, and Linux scripting
  • Knowledge of GPU computing (HIP, CUDA, OpenCL)
  • AI model experience or knowledge in Natural Language Processing, Vision, Audio, Recommendation systems
  • Knowledge of Docker, Kubernetes, or Ansible for testing and deploying AI models and services at scale
  • Experience with profiling tools, system monitoring, or regression tracking systems for deep learning models
  • Proficiency with version control (GitHub), testing strategies, code reviews, and collaborative software development
  • Strong written and verbal communication skills with a proactive approach to defining and driving development efforts
Job Responsibility
Job Responsibility
  • AI Model Enablement & Optimization: Enable and optimize key AI models (LLM, Vision, MultiModal, etc.) on AMD GPUs. Optimize AI frameworks like PyTorch, TensorFlow, etc., on AMD GPUs in upstream open-source repositories
  • Collaboration & Integration: Collaborate with internal GPU library teams and open-source framework maintainers to analyze, optimize, and integrate code changes upstream
  • Model Testing & Validation: Build and maintain automated functional and performance testing pipelines for AI models across ROCm-supported hardware using scalable tools
  • Benchmarking & Metrics: Develop tools and automation for continuous benchmarking and regression tracking across hardware generations and ROCm releases. Build and maintain real-time dashboards that report relevant performance, accuracy, and reliability metrics
  • Ecosystem & Open-Source Contributions: Support public-facing MAD GitHub repositories and Docker releases, enabling the community to run and validate models on ROCm. Contribute to the design of portable, easy-to-use Python interfaces that support multi-node profiling, distributed workloads, and containerized deployments
Read More
Arrow Right

Senior Software Engineer, AI Platform and Enablement

We're building a next-generation AI-powered platform and web application for cre...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 286000.00 USD / Year
descript.com Logo
Descript
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in deploying and managing AI models in production
  • Experience with the tools of large volume data pipelines like spark, flume, dask, etc.
  • Familiarity with cloud platforms (AWS, Google Cloud, Azure) and container technologies (Docker, Kubernetes)
  • Knowledge of DevOps and MLOps best practices
  • Strong problem-solving abilities and excellent communication skills
Job Responsibility
Job Responsibility
  • Build, maintain, and standardize third-party model integrations, including consulting for other engineering teams with AI model integration needs
  • Design, implement, and maintain our AI infrastructure supporting our machine learning life cycle, including data ingestion pipelines, training developer experience and infrastructure, evaluation frameworks, and deployments / GPU infrastructure
  • Collaborate with Product Managers, Research Engineers, and AI Researchers to understand their infrastructure needs and ensure our AI systems are robust, scalable, and efficient
  • Optimize and scale our models and algorithms for efficient inference
  • Deploy, monitor, and manage AI models in production
What we offer
What we offer
  • Generous healthcare package
  • 401k matching program
  • Catered lunches
  • Flexible vacation time
  • Fulltime
Read More
Arrow Right

AI Research Lab Research Associate

We are currently seeking highly qualified interns to accelerate research towards...
Location
Location
United States , Milpitas
Salary
Salary:
43.27 - 93.15 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
May 26, 2026
Flip Icon
Requirements
Requirements
  • Pursuing PhD degree (or other degree with significant research and innovation experience) in a relevant discipline (e.g. machine learning, computer science, electrical engineering, math, statistics, etc.)
  • Track record of world-class innovative contributions and ideas in machine learning
  • Experience with innovative solution development, such as developing proofs-of-concept, first-of-a-kind solutions, and/or technology transfer
  • Experience in deep learning research
  • Experience in developing deep learning software with high proficiency in data structures and algorithms
  • Strong programming skills and experience with Python, C/C++, and preferably Java
  • Software development experience in Deep Learning, GPU acceleration, and Model Optimization
  • Experience in Deep Learning and Machine Learning frameworks and models like Tensorflow, PyTorch
  • Experience in Transformer Neural Network architectures for Generative AI and natural language processing
  • Experience with Agentic AI and Generative AI workflows - desired
Job Responsibility
Job Responsibility
  • Conduct research and come up with solutions with a fast turnaround time
  • Build the software and applications for Neural Networks and Machine Learning
  • Work with system programming, Deep Learning frameworks and models, GPU acceleration, Model optimization, real-time streaming data, distributed computing, and deployment
  • Provide thought leadership and technical influence both internally and externally to HPE
  • Collaborate with HPE Labs research teams as well as external partners
  • Work in alignment with HPE's broader innovation community.
What we offer
What we offer
  • Health & Wellbeing benefits including physical, financial and emotional wellbeing support
  • Personal and professional development programs
  • Unconditional inclusion and flexibility to manage work and personal needs.
  • Fulltime
Read More
Arrow Right