AI Software Product Engineer (GPU Kernel) Job at AMD (Shanghai)

AI Software Product Engineer (GPU)

AI Product Applications Engineer (GPU AI SW Solution Architect) – China position...

Location

China , Beijing

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Deep knowledge of Data Center, Client, Endpoint AI workloads such as LLM, Generative AI, Recommendation, and/or transformer
Hands-on experiences with various AI models, end-to-end pipeline, industry framework (pytorch, vLLM, SGLang, llm-d,Triton) / SDKs and solutions
Hands‑on experience with AI frameworks, including PyTorch, vLLM, SGLang, Unsloth, TensorRT‑LLM, Megatron‑LM, and DeepSpeed
Proven experience in LLMs, Generative AI models, transformer architectures, and end‑to‑end AI pipelines
Familiarity with AMD MI‑series GPU architecture, GPU kernel programming, and the ROCm AI software stack is strongly preferred
Strong communication and presentation skills, with the ability to articulate architectural proposals and value propositions clearly
BS required
MS preferred, with 6+ years of relevant industry experience

Job Responsibility

Lead and contribute to AI open‑source software projects that support the developer community and the broader ecosystem
Drive developer enablement through technical content (blogs, tutorials, user guides) and AI Academy initiatives
Support the success of AI developers, communities, and customer PoCs through hands‑on technical contributions
Capture and prioritize developer and customer requirements to influence AMD’s AI software and solutions roadmap
Analyze competitive AI software and solutions to identify strengths/weaknesses and clearly communicate AMD’s value propositions
Provide feedback and requirements for AI software across cloud, client, and edge deployments

Senior Software Engineer- AI

Are you looking for an opportunity to work with the latest Azure offerings and p...

Location

India , Bangalore

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

7+ years of experience in Software Development
Strong programming expertise in one or more languages such as Python, Go, Java, or C#, with experience designing production-grade services and APIs
Experience building AI-powered applications, including integrating LLMs, implementing agent or Copilot workflows, and orchestrating multi-step AI interactions
Hands-on experience with LLM application frameworks and orchestration tools such as Semantic Kernel, LangChain, or similar agent frameworks
Familiarity with retrieval-augmented generation (RAG) architectures, vector databases, embeddings, and semantic search systems
Experience evaluating and improving model performance through prompt design, evaluation frameworks, fine-tuning, or feedback loops
Solid understanding of distributed systems concepts including scalability, reliability, observability, caching, and asynchronous processing
Experience deploying and operating AI workloads in cloud environments (preferably Azure), including containerized services and GPU-enabled infrastructure
Understanding of Responsible AI practices, including model governance, safety, privacy, and evaluation of AI behaviour in production systems
Ability to work across product, research, and engineering teams to translate product scenarios into scalable AI system architectures

Job Responsibility

Design, build, and operate scalable AI systems that power intelligent product experiences, including Copilot and agent-driven workflows
Architect and implement backend services that support multi-step AI interactions, including orchestration pipelines, context management, memory/state persistence, and tool execution
Integrate large language models (LLMs), APIs, and internal services to enable context-aware, human-in-the-loop experiences across customer scenarios
Build and maintain data and inference pipelines that support model training, fine-tuning, evaluation, and real-time inference across diverse data sources
Evaluate, benchmark, and tune AI/ML models (LLMs and traditional models) to meet product requirements for accuracy, latency, reliability, and safety
Implement robust retrieval, grounding, and knowledge integration mechanisms (e.g., RAG systems, semantic indexing, vector search) to power intelligent applications
Collaborate with product managers, software engineers, and researchers to translate product vision into production-ready AI capabilities and measurable outcomes
Ensure reliability, observability, and governance of AI systems, including monitoring model performance, data quality, and responsible AI practices
Build reusable platforms, APIs, and tools that enable teams to rapidly develop AI-powered features and self-service intelligent applications

Fulltime

Principal Software Engineer

Are you looking for an opportunity to work with the latest Azure offerings and p...

Location

India , Bangalore

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

10–12+ years of experience in software engineering, with significant experience building scalable backend or distributed systems
Strong programming expertise in one or more languages such as Python, Go, Java, or C#, with experience designing production-grade services and APIs
Experience building AI-powered applications, including integrating LLMs, implementing agent or Copilot workflows, and orchestrating multi-step AI interactions
Hands-on experience with LLM application frameworks and orchestration tools such as Semantic Kernel, LangChain, or similar agent frameworks
Familiarity with retrieval-augmented generation (RAG) architectures, vector databases, embeddings, and semantic search systems
Experience evaluating and improving model performance through prompt design, evaluation frameworks, fine-tuning, or feedback loops
Solid understanding of distributed systems concepts including scalability, reliability, observability, caching, and asynchronous processing
Experience deploying and operating AI workloads in cloud environments (preferably Azure), including containerized services and GPU-enabled infrastructure
Understanding of Responsible AI practices, including model governance, safety, privacy, and evaluation of AI behaviour in production systems
Ability to work across product, research, and engineering teams to translate product scenarios into scalable AI system architectures

Job Responsibility

Design, build, and operate scalable AI systems that power intelligent product experiences, including Copilot and agent-driven workflows
Architect and implement backend services that support multi-step AI interactions, including orchestration pipelines, context management, memory/state persistence, and tool execution
Integrate large language models (LLMs), APIs, and internal services to enable context-aware, human-in-the-loop experiences across customer scenarios
Build and maintain data and inference pipelines that support model training, fine-tuning, evaluation, and real-time inference across diverse data sources
Evaluate, benchmark, and tune AI/ML models (LLMs and traditional models) to meet product requirements for accuracy, latency, reliability, and safety
Implement robust retrieval, grounding, and knowledge integration mechanisms (e.g., RAG systems, semantic indexing, vector search) to power intelligent applications
Collaborate with product managers, software engineers, and researchers to translate product vision into production-ready AI capabilities and measurable outcomes
Ensure reliability, observability, and governance of AI systems, including monitoring model performance, data quality, and responsible AI practices
Build reusable platforms, APIs, and tools that enable teams to rapidly develop AI-powered features and self-service intelligent applications

Fulltime

Principal Ai Software Engineer

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great prod...

Location

United States , San Jose

Salary:

240000.00 - 360000.00 USD / Year

AMD

Expiration Date

Until further notice

Requirements

Knowledge in GPU architectures, basic knowledge of CPU architecture
Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
Experience in hardware/software co-design, building high-performance products across the full product lifecycle
Experience with operating systems (OS) and device driver development is a plus
Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred

Job Responsibility

Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
Strong Execution: Deliver innovations and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure out-of-the-box performance excellence on AMD hardware
Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
Community & Open Source: Mentor and inspire other engineers and contribute to ROCm Opensource

What we offer

AMD benefits at a glance

Fulltime

Principal Open Source AI/ML Solutions Engineer

The Senior Member in the GPU domain is a technical role responsible for owning t...

Location

India , Bangalore

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Strong C++ and Python programming skills
Performance analysis skills for both CPU and GPU
Good knowledge of AI/ML Frameworks and Architecture
Basic GPU kernel programming knowledge
Experience with software engineering methodologies such as Agile, Scrum, Kanban
Experience in all the phases of software development, from requirement gathering, analysis, design, development, testing to final release
Experience developing software in an end customer product delivery environment
Experience with open-source software development including collaboration with community maintainers and submitting contributions
Excellent analytical and problem-solving skills
Strong communication skills to effectively convey complex technical concepts to both technical and non-technical stakeholders

Job Responsibility

Architectural Design: Own architectural design and development of GPU software components, ensuring alignment with industry standards and best practices
Technical Leadership: Act as one of the subject matter experts in GPU technologies, providing guidance and mentorship to junior engineers in the team on complex technical challenges
Software Development: Design, write, and deliver high-quality open software solutions that enhance GPU performance and capabilities. This includes developing drivers, APIs, and other critical software components
Research and Innovation: Conduct research to explore new technologies and methodologies that can improve GPU performance and efficiency. Propose innovative solutions to meet evolving market demands
Collaboration: Work collaboratively with cross-functional teams, including hardware engineers, system architects, and product managers, to ensure successful integration of GPU technologies into broader systems
Documentation and Standards: Develop comprehensive technical documentation and establish coding standards to ensure maintainability and scalability of software products

Principal AI Software Engineer

AMD AI Group is seeking a highly influential technical leader for OneROCm — driv...

Location

United States , San Jose

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Knowledge in GPU architectures, basic knowledge of CPU architecture
Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
Experience in hardware/software co-design, building high-performance products across the full product lifecycle
Experience with operating systems (OS) and device driver development is a plus
Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred

Job Responsibility

Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
Strong Execution: Deliver innovations and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure out-of-the-box performance excellence on AMD hardware
Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
Community & Open Source: Mentor and inspire other engineers and contribute to ROCm Opensource

What we offer

Benefits offered are described: AMD benefits at a glance

Fulltime

Fellow, AI Software Architecture

AMD AI Group is seeking a highly influential technical leader for the role of AM...

Location

United States , San Jose

Salary:

268000.00 - 402000.00 USD / Year

AMD

Expiration Date

Until further notice

Requirements

Knowledge in GPU architectures, basic knowledge of CPU architecture
Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
Experience in hardware/software co-design, building high-performance products across the full product lifecycle
Experience with operating systems (OS) and device driver development is a plus
Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred

Job Responsibility

Strategic Leadership: Set the technical vision and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure 'out-of-the-box' performance excellence on AMD hardware
Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
Community & Mentorship: Act as a technical ambassador in industry forums and open-source communities. Mentor and inspire the next generation of AMD's technical leaders and engineers.

Fulltime

AI Inference/GPU Kernel Engineer

AMD is looking for a specialized software engineer who is passionate about impro...

Location

China , Beijing

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Strong object-oriented programming background, C/C++ preferred
Ability to write high quality code with a keen attention to detail
Experience with modern concurrent programming and threading APIs
Experience with Windows, Linux and/or Android operating system development
Experience with software development processes and tools such as debuggers, source code control systems (GitHub) and profilers is a plus
Effective communication and problem-solving skills
Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

Job Responsibility

Work with AMD’s architecture specialists to improve future products
Apply a data minded approach to target optimization efforts
Stay informed of software and hardware trends and innovations, especially pertaining to algorithms and architecture
Design and develop new groundbreaking AMD technologies
Participating in new ASIC and hardware bring ups
Debugging/fix existing issues and research alternative, more efficient ways to accomplish the same work
Develop technical relationships with peers and partners

AI Software Product Engineer (GPU Kernel)

AMD

Location:
China , Shanghai

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:
March 21, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for AI Software Product Engineer (GPU Kernel)

AI Software Product Engineer (GPU)

Senior Software Engineer- AI

Principal Software Engineer

Principal Ai Software Engineer

Principal Open Source AI/ML Solutions Engineer

Principal AI Software Engineer

Fellow, AI Software Architecture

AI Inference/GPU Kernel Engineer

AI Software Product Engineer (GPU Kernel)

AMD

Location:China , Shanghai

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:March 21, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for AI Software Product Engineer (GPU Kernel)

AI Software Product Engineer (GPU)

Senior Software Engineer- AI

Principal Software Engineer

Principal Ai Software Engineer

Principal Open Source AI/ML Solutions Engineer

Principal AI Software Engineer

Fellow, AI Software Architecture

AI Inference/GPU Kernel Engineer

Location:
China , Shanghai

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
March 21, 2026