CrawlJobs Logo

AI Software Product Engineer (GPU)

amd.com Logo

AMD

Location Icon

Location:
China , Beijing

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

AI Product Applications Engineer (GPU AI SW Solution Architect) – China position is in the AMD AI group, located in China. At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.

Job Responsibility:

  • Lead and contribute to AI open‑source software projects that support the developer community and the broader ecosystem
  • Drive developer enablement through technical content (blogs, tutorials, user guides) and AI Academy initiatives
  • Support the success of AI developers, communities, and customer PoCs through hands‑on technical contributions
  • Capture and prioritize developer and customer requirements to influence AMD’s AI software and solutions roadmap
  • Analyze competitive AI software and solutions to identify strengths/weaknesses and clearly communicate AMD’s value propositions
  • Provide feedback and requirements for AI software across cloud, client, and edge deployments

Requirements:

  • Deep knowledge of Data Center, Client, Endpoint AI workloads such as LLM, Generative AI, Recommendation, and/or transformer
  • Hands-on experiences with various AI models, end-to-end pipeline, industry framework (pytorch, vLLM, SGLang, llm-d,Triton) / SDKs and solutions
  • Hands‑on experience with AI frameworks, including PyTorch, vLLM, SGLang, Unsloth, TensorRT‑LLM, Megatron‑LM, and DeepSpeed
  • Proven experience in LLMs, Generative AI models, transformer architectures, and end‑to‑end AI pipelines
  • Familiarity with AMD MI‑series GPU architecture, GPU kernel programming, and the ROCm AI software stack is strongly preferred
  • Strong communication and presentation skills, with the ability to articulate architectural proposals and value propositions clearly
  • BS required
  • MS preferred, with 6+ years of relevant industry experience

Additional Information:

Job Posted:
April 02, 2026

Job Link Share:
PREMIUM
More languages and countries
+ Unlock 29494 hidden job offers
Languages
English Čeština Deutsch Ελληνικά Español Français +15
Countries
United States United Kingdom India Canada Australia +
See plans
Plans from $2.99 / month

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI Software Product Engineer (GPU)

Software Engineer

Software Engineer – Stealth AI Start-Up – Remote (US / Canada). I’m supporting a...
Location
Location
United States
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience, ideally with exposure to developer tools, distributed systems, or high-performance applications
  • A strong academic foundation – typically demonstrated through excellent performance in a rigorous CS or engineering program
  • Great fit for someone coming from Big Tech who wants more ownership, autonomy, and direct product impact
  • Experience with compiled or systems-level languages (e.g., Rust, Go, C++, or similar) is a plus, but not required
  • Additional bonus experience with: Linux / shell development
  • Backend systems
  • GPU, graphics, or performance-oriented programming
  • Love building tools that improve developer workflows and productivity
  • Have strong engineering fundamentals and a user-focused mindset
  • Communicate clearly and collaborate well across teams
Job Responsibility
Job Responsibility
  • Build features across the full stack: design, development, testing, and deployment
  • Improve performance, reliability, and the end-to-end developer experience
  • Partner closely with engineering, product, growth, and marketing teams
  • Prototype new ideas and iterate quickly based on real user feedback
  • Mentor teammates and help shape a strong, high-quality engineering culture
  • Fulltime
Read More
Arrow Right

AI Engineer

Location
Location
Vietnam , Da Nang
Salary
Salary:
Not provided
saigontechnology.com Logo
Saigon Technology
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have programming skills in one of these languages: Python, Java...
  • At least 1 years of experience working with AI/ML projects as a Data Engineer, Research Engineer, or Software Engineer.
  • Have experience with one of ML/DL Frameworks: Tensorflow, Pytorch, Keras, scikit-learn, Pandas, Langchain, LlamaIndex…
  • Have experience working with OpenAI, Gemini, any LLM... to build agents, workflows, or RAG systems.
  • Have experience with one of chatbot building frameworks or services like Rasa, Dialogflow, Transformer, BERT, LLM/Prompt…
  • Familiar with OCR algorithms or services: OpenCV, Tesseract, Textract (AWS), Google Cloud Vision, PaddleOCR.
  • Experience with common development tools: Linux, GPU server, Google Colab, Jupyter, Git, Docker.
  • Good English proficiency and communication skills.
Job Responsibility
Job Responsibility
  • Join in developing and applying the ML/DL techniques to resolve our client’s business problems such as building chatbot systems, LLM/Prompt, OCR systems, fraud detection systems, facial recognition systems…
  • Join in developing some internal products applying AI models
What we offer
What we offer
  • Competitive Salary and Brilliant Health Benefits
  • Attractive salary (13th-month salary, salary review twice/year) and project bonus
  • Bonus programs for candidate referral, technical article writing
  • Interest-free loan support for personal plan
  • Allowance for sickness, maternity, paternity and periodic health examination
  • PVI health care program
  • The staff of the quarter and year reward
  • Progressive and Fun Working Environment
  • A professional English-speaking working environment with Agile – Scrum model
  • Hybrid Working Model: Flexible working time and WFH support.
  • Fulltime
Read More
Arrow Right

AI Software Product Engineer (GPU Kernel)

AI Product Applications Engineer (Solution Architect) – China position is in the...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands‑on experience with AI frameworks, including PyTorch, vLLM, SGLang, Unsloth, TensorRT‑LLM, Megatron‑LM, and DeepSpeed
  • Proven experience in LLMs, Generative AI models, transformer architectures, and end‑to‑end AI pipelines
  • Familiarity with AMD MI‑series GPU architecture, GPU kernel programming, and the ROCm AI software stack is strongly preferred
  • Strong communication and presentation skills, with the ability to articulate architectural proposals and value propositions clearly
  • BS required
  • MS preferred, with 6+ years of relevant industry experience
Job Responsibility
Job Responsibility
  • Lead and contribute to AI open‑source software projects that support the developer community and the broader ecosystem
  • Drive developer enablement through technical content (blogs, tutorials, user guides) and AI Academy initiatives
  • Support the success of AI developers, communities, and customer PoCs through hands‑on technical contributions
  • Capture and prioritize developer and customer requirements to influence AMD’s AI software and solutions roadmap
  • Analyze competitive AI software and solutions to identify strengths/weaknesses and clearly communicate AMD’s value propositions
  • Provide feedback and requirements for AI software across cloud, client, and edge deployments
Read More
Arrow Right

Senior Software Engineer- AI

Are you looking for an opportunity to work with the latest Azure offerings and p...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in Software Development
  • Strong programming expertise in one or more languages such as Python, Go, Java, or C#, with experience designing production-grade services and APIs
  • Experience building AI-powered applications, including integrating LLMs, implementing agent or Copilot workflows, and orchestrating multi-step AI interactions
  • Hands-on experience with LLM application frameworks and orchestration tools such as Semantic Kernel, LangChain, or similar agent frameworks
  • Familiarity with retrieval-augmented generation (RAG) architectures, vector databases, embeddings, and semantic search systems
  • Experience evaluating and improving model performance through prompt design, evaluation frameworks, fine-tuning, or feedback loops
  • Solid understanding of distributed systems concepts including scalability, reliability, observability, caching, and asynchronous processing
  • Experience deploying and operating AI workloads in cloud environments (preferably Azure), including containerized services and GPU-enabled infrastructure
  • Understanding of Responsible AI practices, including model governance, safety, privacy, and evaluation of AI behaviour in production systems
  • Ability to work across product, research, and engineering teams to translate product scenarios into scalable AI system architectures
Job Responsibility
Job Responsibility
  • Design, build, and operate scalable AI systems that power intelligent product experiences, including Copilot and agent-driven workflows
  • Architect and implement backend services that support multi-step AI interactions, including orchestration pipelines, context management, memory/state persistence, and tool execution
  • Integrate large language models (LLMs), APIs, and internal services to enable context-aware, human-in-the-loop experiences across customer scenarios
  • Build and maintain data and inference pipelines that support model training, fine-tuning, evaluation, and real-time inference across diverse data sources
  • Evaluate, benchmark, and tune AI/ML models (LLMs and traditional models) to meet product requirements for accuracy, latency, reliability, and safety
  • Implement robust retrieval, grounding, and knowledge integration mechanisms (e.g., RAG systems, semantic indexing, vector search) to power intelligent applications
  • Collaborate with product managers, software engineers, and researchers to translate product vision into production-ready AI capabilities and measurable outcomes
  • Ensure reliability, observability, and governance of AI systems, including monitoring model performance, data quality, and responsible AI practices
  • Build reusable platforms, APIs, and tools that enable teams to rapidly develop AI-powered features and self-service intelligent applications
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

Are you looking for an opportunity to work with the latest Azure offerings and p...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10–12+ years of experience in software engineering, with significant experience building scalable backend or distributed systems
  • Strong programming expertise in one or more languages such as Python, Go, Java, or C#, with experience designing production-grade services and APIs
  • Experience building AI-powered applications, including integrating LLMs, implementing agent or Copilot workflows, and orchestrating multi-step AI interactions
  • Hands-on experience with LLM application frameworks and orchestration tools such as Semantic Kernel, LangChain, or similar agent frameworks
  • Familiarity with retrieval-augmented generation (RAG) architectures, vector databases, embeddings, and semantic search systems
  • Experience evaluating and improving model performance through prompt design, evaluation frameworks, fine-tuning, or feedback loops
  • Solid understanding of distributed systems concepts including scalability, reliability, observability, caching, and asynchronous processing
  • Experience deploying and operating AI workloads in cloud environments (preferably Azure), including containerized services and GPU-enabled infrastructure
  • Understanding of Responsible AI practices, including model governance, safety, privacy, and evaluation of AI behaviour in production systems
  • Ability to work across product, research, and engineering teams to translate product scenarios into scalable AI system architectures
Job Responsibility
Job Responsibility
  • Design, build, and operate scalable AI systems that power intelligent product experiences, including Copilot and agent-driven workflows
  • Architect and implement backend services that support multi-step AI interactions, including orchestration pipelines, context management, memory/state persistence, and tool execution
  • Integrate large language models (LLMs), APIs, and internal services to enable context-aware, human-in-the-loop experiences across customer scenarios
  • Build and maintain data and inference pipelines that support model training, fine-tuning, evaluation, and real-time inference across diverse data sources
  • Evaluate, benchmark, and tune AI/ML models (LLMs and traditional models) to meet product requirements for accuracy, latency, reliability, and safety
  • Implement robust retrieval, grounding, and knowledge integration mechanisms (e.g., RAG systems, semantic indexing, vector search) to power intelligent applications
  • Collaborate with product managers, software engineers, and researchers to translate product vision into production-ready AI capabilities and measurable outcomes
  • Ensure reliability, observability, and governance of AI systems, including monitoring model performance, data quality, and responsible AI practices
  • Build reusable platforms, APIs, and tools that enable teams to rapidly develop AI-powered features and self-service intelligent applications
  • Fulltime
Read More
Arrow Right

Senior Manager, Performance AI/ML Network Deployment Engineering

The Senior Manager, DC GPU Advanced Forward Deployment and Systems Engineering i...
Location
Location
United States , Santa Clara
Salary
Salary:
210400.00 - 315600.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise in networking and performance optimization for large-scale AI/ML networks, including network, compute, storage cluster design, modelling, analytics, performance tuning, convergence, scalability improvements
  • Prefer candidates with solid, hands-on expertise in at least one or more of 3 domains, namely compute, network, storage
  • Experience in working with large customers such as Cloud Service Providers and global enterprise customers
  • Proven leadership in engaging customers with diverse technical disciplines in avenues such as Proof of Concept, Competitive evaluations, Early Field Trials etc
  • Direct experience in working with large customers and can operate with sense of urgency, own the problems and resolve it
  • Demonstrated leadership in network architecture, hands on experience in RoCEv2 Design, VXLAN-EVPN, BGP, and Lossless Fabrics
  • Proven ability to influence design and technology roadmaps, leveraging a deep understanding of datacenter products and market trends
  • Extensive hands-on Network deployment expertise and proven track record of delivering large projects on time. Cisco, Juniper or Arista experience is preferred
  • Direct, co-development/deployment experience in working with strategic customers/partners in bringing solutions to market
  • Excellent communication level from engineer to mid-management to C-level of audience
Job Responsibility
Job Responsibility
  • Collaborate with strategic customers on scalable designs involving compute, networking, storage environment, work with industry partners, Internal teams to accelerate the deployment, adoption of various AI/ML models
  • Engage system-level triage and at-scale debug of complex issues across hardware, firmware, and software, ensuring rapid resolution and system reliability
  • Drive the ramp of Instinct-based large scale AI datacenter infrastructure based on NPI base platform hardware with ROCm, scaling up to pod and cluster level, leveraging the best in network architecture for AI/ML workloads
  • Enhance tools and methodologies for large-scale deployments to meet customer uptime goals and exceed performance expectations
  • Engage with clients to deeply understand their technical needs, ensuring their satisfaction with tailored solutions that leverage your past experience in strategic customer engagements and architectural wins
  • Provide domain specific knowledge to other groups at AMD, share the lessons learnt to drive continuous improvement
  • Engage with AMD product groups to drive resolution of application and customer issues
  • Develop and present training materials to internal audiences, at customer venues, and at industry conferences
Read More
Arrow Right

Principal Software Engineering Manager - AI Frameworks

As a Principal Software Engineering Manager - AI Frameworks on the team, you wil...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 304200.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master’s Degree in Computer Science or related technical field AND 10+ years of software engineering experience, including 6+ years in engineering management, OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years of software engineering experience, including 6+ years in engineering management, or equivalent experience
  • Strong technical foundation in software engineering principles, computer architecture, GPU architecture, and hardware acceleration for neural networks, with the ability to guide teams working in these areas
  • Experience leading teams responsible for end-to-end performance analysis and optimization of LLMs, AI systems, or HPC workloads, including use of GPU profiling and performance analysis tools
  • Demonstrated ability to lead cross-team initiatives, align stakeholders, and translate research or platform capabilities into scalable, production-ready solutions
  • Proven people leadership skills, including hiring, coaching, performance management, and career development, with a track record of building high-performing, inclusive teams
  • Exposure to AI / ML infrastructure, including DNN or LLM training and/or inference systems, and experience with at least one modern deep learning framework (e.g., PyTorch, TensorFlow, ONNX Runtime)
  • Familiarity with GPU software stacks and acceleration technologies such as CUDA, ROCm, Triton, or equivalent, sufficient to guide technical direction and evaluate tradeoffs
Job Responsibility
Job Responsibility
  • Lead and develop a team of engineers working across multiple layers of the AI software stack to enable large-scale training and inference
  • Set technical vision and execution strategy for model performance benchmarking, optimization, and deployment across GPUs and Microsoft hardware
  • Drive performance outcomes by prioritizing and overseeing efforts to benchmark, profile, debug, and optimize training and inference workloads
  • Own performance health by establishing mechanisms to monitor regressions, measure impact, and continuously improve time-to-deploy and hardware efficiency
  • Partner cross-functionally with research, product, infrastructure, and hardware teams to deliver scalable, production-ready AI performance improvements
  • Balance short-term delivery and long-term investments, ensuring the team’s work aligns with organizational goals, platform roadmaps, and Azure capex objectives
  • Build a strong engineering culture through coaching, feedback, hiring, and career development, enabling the team to operate with increasing autonomy and impact
  • Fulltime
Read More
Arrow Right

Ai Gpu Product Planning Lead - Embedded Software

AMD's AECG group is looking for a AI GPU Product Planning Lead - Embedded Softwa...
Location
Location
Canada , Markham
Salary
Salary:
147680.00 - 221520.00 CAD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong understanding of GPU architectures and GPU software stacks, including graphics, compute, and AI/ML workloads, across Linux and Windows environments
  • Experience with virtualization and embedded systems is highly desirable
  • Strong cross-functional communication and dependency management skills
  • Experience in software integration, program planning, or technical project coordination
  • BS/MS in Computer Science, Computer Engineering, or Electrical Engineering
Job Responsibility
Job Responsibility
  • Work with market segment leads, product line managers, and solution planning team to translate key embedded customer needs into actionable engineering requirements
  • Work with engineering teams to establish and maintain deliverable roadmaps with a focus on early enablement for key embedded customers
  • Define and align requirements for all solution components: Virtualization, Yocto Project Linux for embedded ROCm support, ROCm for AI workloads and specialized stacks (ROS, multimedia analytics, VLM/LLM/CNN's)
  • Work with AECG technical marketing and field support teams to deliver timely demonstrations and examples to promote customer adoption
  • Act as single-point coordinator between ROCm dev, OS enablement, virtualization, QA, software engineering and customer enablement teams
  • Fulltime
Read More
Arrow Right