AI Software Product Engineer (GPU) Job at AMD (Beijing)

AI Software Product Engineer (GPU Kernel)

AI Product Applications Engineer (Solution Architect) – China position is in the...

Location

China , Shanghai

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Hands‑on experience with AI frameworks, including PyTorch, vLLM, SGLang, Unsloth, TensorRT‑LLM, Megatron‑LM, and DeepSpeed
Proven experience in LLMs, Generative AI models, transformer architectures, and end‑to‑end AI pipelines
Familiarity with AMD MI‑series GPU architecture, GPU kernel programming, and the ROCm AI software stack is strongly preferred
Strong communication and presentation skills, with the ability to articulate architectural proposals and value propositions clearly
BS required
MS preferred, with 6+ years of relevant industry experience

Job Responsibility

Lead and contribute to AI open‑source software projects that support the developer community and the broader ecosystem
Drive developer enablement through technical content (blogs, tutorials, user guides) and AI Academy initiatives
Support the success of AI developers, communities, and customer PoCs through hands‑on technical contributions
Capture and prioritize developer and customer requirements to influence AMD’s AI software and solutions roadmap
Analyze competitive AI software and solutions to identify strengths/weaknesses and clearly communicate AMD’s value propositions
Provide feedback and requirements for AI software across cloud, client, and edge deployments

Ai Gpu Product Planning Lead - Embedded Software

AMD's AECG group is looking for a AI GPU Product Planning Lead - Embedded Softwa...

Location

Canada , Markham

Salary:

147680.00 - 221520.00 CAD / Year

AMD

Expiration Date

Until further notice

Requirements

Strong understanding of GPU architectures and GPU software stacks, including graphics, compute, and AI/ML workloads, across Linux and Windows environments
Experience with virtualization and embedded systems is highly desirable
Strong cross-functional communication and dependency management skills
Experience in software integration, program planning, or technical project coordination
BS/MS in Computer Science, Computer Engineering, or Electrical Engineering

Job Responsibility

Work with market segment leads, product line managers, and solution planning team to translate key embedded customer needs into actionable engineering requirements
Work with engineering teams to establish and maintain deliverable roadmaps with a focus on early enablement for key embedded customers
Define and align requirements for all solution components: Virtualization, Yocto Project Linux for embedded ROCm support, ROCm for AI workloads and specialized stacks (ROS, multimedia analytics, VLM/LLM/CNN's)
Work with AECG technical marketing and field support teams to deliver timely demonstrations and examples to promote customer adoption
Act as single-point coordinator between ROCm dev, OS enablement, virtualization, QA, software engineering and customer enablement teams

Fulltime

Senior Software Development Engineer, AI Open-Source Software

AMD is looking for a principal software developer to join our growing team. As a...

Location

United States , Santa Clara

Salary:

204000.00 - 306000.00 USD / Year

AMD

Expiration Date

Until further notice

Requirements

10+ years professional software development experience
Demonstrated capacity to technically lead and people manage junior to mid-level developers
Proficient in C/C++ & Python programming employing best software design practices
GPU software development or validation involving HIP, CUDA, or OpenCL
Experience with software libraries and API design
Exposure to Matrix/Tensor operations and numerical work
Software emulation to support FP numerical formats is a plus
Experience in software performance estimations, optimizations and debugging
Ability to closely interact with technical leads, developers, and test teams to maintain and release production software

Job Responsibility

Develop software in C++, Python, HIP, assembly, and SOTA programming technologies to enable key mathematical operations on GPU
Design GPU computational software libraries for AI, HPC applications
Aid management in planning, and delivering industry-leading software for current and future processors
Supervise small development team
Carry-out performance optimizations and projections for important use-cases to maximize hardware utilization
Support development of programs to sustain seamless performance analysis, and performance/functional test coverage
Identify and help resolve quality issues working closely with libraries development teams and other internal engineering teams

What we offer

Benefits offered are described: AMD benefits at a glance

Fulltime

Principal Ai Software Engineer

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great prod...

Location

United States , San Jose

Salary:

240000.00 - 360000.00 USD / Year

AMD

Expiration Date

Until further notice

Requirements

Knowledge in GPU architectures, basic knowledge of CPU architecture
Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
Experience in hardware/software co-design, building high-performance products across the full product lifecycle
Experience with operating systems (OS) and device driver development is a plus
Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred

Job Responsibility

Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
Strong Execution: Deliver innovations and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure out-of-the-box performance excellence on AMD hardware
Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
Community & Open Source: Mentor and inspire other engineers and contribute to ROCm Opensource

What we offer

AMD benefits at a glance

Fulltime

Principal AI Software Engineer

AMD AI Group is seeking a highly influential technical leader for OneROCm — driv...

Location

United States , San Jose

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Knowledge in GPU architectures, basic knowledge of CPU architecture
Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
Experience in hardware/software co-design, building high-performance products across the full product lifecycle
Experience with operating systems (OS) and device driver development is a plus
Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred

Job Responsibility

Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
Strong Execution: Deliver innovations and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure out-of-the-box performance excellence on AMD hardware
Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
Community & Open Source: Mentor and inspire other engineers and contribute to ROCm Opensource

What we offer

Benefits offered are described: AMD benefits at a glance

Fulltime

Senior Software Engineer- AI

Are you looking for an opportunity to work with the latest Azure offerings and p...

Location

India , Bangalore

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

7+ years of experience in Software Development
Strong programming expertise in one or more languages such as Python, Go, Java, or C#, with experience designing production-grade services and APIs
Experience building AI-powered applications, including integrating LLMs, implementing agent or Copilot workflows, and orchestrating multi-step AI interactions
Hands-on experience with LLM application frameworks and orchestration tools such as Semantic Kernel, LangChain, or similar agent frameworks
Familiarity with retrieval-augmented generation (RAG) architectures, vector databases, embeddings, and semantic search systems
Experience evaluating and improving model performance through prompt design, evaluation frameworks, fine-tuning, or feedback loops
Solid understanding of distributed systems concepts including scalability, reliability, observability, caching, and asynchronous processing
Experience deploying and operating AI workloads in cloud environments (preferably Azure), including containerized services and GPU-enabled infrastructure
Understanding of Responsible AI practices, including model governance, safety, privacy, and evaluation of AI behaviour in production systems
Ability to work across product, research, and engineering teams to translate product scenarios into scalable AI system architectures

Job Responsibility

Design, build, and operate scalable AI systems that power intelligent product experiences, including Copilot and agent-driven workflows
Architect and implement backend services that support multi-step AI interactions, including orchestration pipelines, context management, memory/state persistence, and tool execution
Integrate large language models (LLMs), APIs, and internal services to enable context-aware, human-in-the-loop experiences across customer scenarios
Build and maintain data and inference pipelines that support model training, fine-tuning, evaluation, and real-time inference across diverse data sources
Evaluate, benchmark, and tune AI/ML models (LLMs and traditional models) to meet product requirements for accuracy, latency, reliability, and safety
Implement robust retrieval, grounding, and knowledge integration mechanisms (e.g., RAG systems, semantic indexing, vector search) to power intelligent applications
Collaborate with product managers, software engineers, and researchers to translate product vision into production-ready AI capabilities and measurable outcomes
Ensure reliability, observability, and governance of AI systems, including monitoring model performance, data quality, and responsible AI practices
Build reusable platforms, APIs, and tools that enable teams to rapidly develop AI-powered features and self-service intelligent applications

Fulltime

Member of Technical Staff - Software Engineer (AI infra)

Microsoft AI is looking for a Member of Technical Staff - Software Engineer to h...

Location

Switzerland , Zürich

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Experience with generative AI
Experience with distributed computing

Job Responsibility

Develop and tune the pretraining scalable software for Nvidia GB200 72NVL CX8 and AMD MIxxx architectures
Benchmark GB200 and AMD MIxxx GPU clusters
Gather data and insights to develop the pretraining compute roadmap
Care deeply about conversational AI and its deployment
Actively contribute to the development of AI models that are powering our innovative products
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values

Fulltime

Ai Software Engineer

Meta is seeking a Software Engineer to join our team. The candidate is someone w...

Location

United Kingdom , London

Salary:

Not provided

Select Country

AI Software Product Engineer (GPU)

Job Description

Job Responsibility

Requirements

Looking for more opportunities?