CrawlJobs Logo

AI Product Performance Engineer

China, Shenzhen · Job Posted May 17, 2026
Apply Position
Job Link Share

Job Description

WHAT YOU DO AT AMD CHANGES EVERYTHING. At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

Job Responsibility

  • High-Performance Kernel Development: Design, implement, and optimize high-performance GPU kernels for AI/ML workloads to maximize hardware utilization
  • Performance Optimization: Analyze and optimize kernel execution for latency and throughput, addressing bottlenecks in memory bandwidth, instruction latency, and thread divergence
  • Workload Analysis: Evaluate the end-to-end performance impact of individual kernels on full-stack AI models, ensuring that micro-optimizations translate to application-level speedups
  • Profiling & Tuning: Utilize advanced GPU profiling tools (e.g., ROCm Profiler, Pytorch Profiler) to identify performance cliffs, stall pipelines, and memory hierarchy inefficiencies
  • Architecture Adaptation: Tailor implementation strategies to leverage specific features of modern GPU architectures (e.g., Matrix Cores, HBM characteristics)
  • Framework Integration: Collaborate with software stack teams to expose optimized kernels within high-level frameworks and inference engines

Requirements

  • deep knowledge of Data Center AI workloads such as LLM, Generative AI, Recommendation, NLP, Video Analytics, and/or transformer
  • hands-on experiences with various AI models, end-to-end pipeline, industry framework / SDKs and solutions
  • GPU Architecture Mastery
  • Kernel Programming Expertise: Strong proficiency in C++ and parallel computing, with extensive hands-on experience in NVIDIA CUDA or AMD HIP kernel programming
  • Performance Engineering: Demonstrated ability to debug and profile complex GPU workloads
  • Systems Knowledge: Familiarity with asynchronous execution, stream management, and host-device memory transfers
  • Python DSLs & Triton: Experience implementing kernels using OpenAI Triton or other Python-based DSLs
  • Inference Engine Experience: Hands-on experience integrating custom kernels into large-scale inference frameworks such as vLLM, SGLang, or TensorRT-LLM
  • Deep Learning Frameworks: Familiarity with writing custom extensions or operators for PyTorch (C++/CUDA extensions)
  • Hardware Agnosticism: Experience porting kernels between NVIDIA and AMD architectures or working with cross-platform HPC libraries
  • BS required
  • MS preferred with several years of relevant industry experience

What we offer

AMD benefits at a glance

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI Product Performance Engineer

8 matching positions

Senior Golang Engineer - AI Product & Platforms

We are Citi's Application, Platform and Engineering team, a start-up with the ex...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Golang expertise essential
  • Deep-dive Golang engineering expertise from building high-performance, large-scale production systems
  • Production system builder – proven track record of architecting and building large-scale, high-availability production applications and business-facing platforms from the ground up using Go
  • Advanced Go expertise – deep proficiency in Go's concurrency model (goroutines, channels, select), memory management, profiling, and performance tuning for latency-sensitive systems
  • Microservices and API design – extensive experience designing, building, and maintaining RESTful and gRPC APIs with a focus on clean contracts, versioning, and backward compatibility in high-traffic production systems
  • HashiCorp Vault and secrets management – experience integrating with Vault for dynamic credentials, secrets engines, and enterprise-scale secrets management within Go services
  • Enterprise authentication & authorization – designing and implementing OAuth, JWT, RBAC, and complex identity systems with fine-grained access controls in business-critical applications
  • Cloud-native and Kubernetes expertise – building, deploying, and operating containerized Go applications in Kubernetes, leveraging service meshes, Helm charts, and cloud-native patterns at enterprise scale
  • AI/ML platform engineering – experience building backend infrastructure and APIs that serve AI/ML models, manage inference pipelines, and support LLM-powered applications at scale
  • Observability and reliability engineering – implementing comprehensive logging, metrics, distributed tracing, and alerting to ensure system health and rapid incident resolution
Job Responsibility
Job Responsibility
  • Build AI-powered products from 0-1 – Engineer production-grade, business-facing AI platforms with clean, performant, and maintainable Go code from day one
  • Design and build high-performance backend services – Architect and implement low-latency, high-throughput microservices in Go that operate reliably at planetary scale
  • Design and build developer tools and frameworks – Create reusable libraries, SDKs, and internal tooling in Go that accelerate development across fast-paced engineering teams
  • Tackle complex distributed systems challenges – Design solutions for concurrency, fault tolerance, data consistency, and service orchestration across large-scale distributed environments
  • Champion engineering excellence – Drive best practices in code quality, testing strategies, CI/CD pipelines, and observability to maintain velocity without sacrificing reliability
  • Mentor and elevate the team – Guide other engineers on Go idioms, system design patterns, performance optimization, and building software that scales
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Fulltime
Read More
Arrow Right

AI Performance Engineer

As a member of the Computing Product Line, Heterogeneous Memory Software Lab, yo...
Location
Location
Poland , Warszawa
Salary
Salary:
40000.00 - 50000.00 PLN / Month
devire.pl Logo
Devire
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of GPU or NPU architecture, including execution units, memory hierarchy, interconnects, and thread scheduling, as well as performance bottleneck analysis methodologies
  • Familiarity with mainstream deep learning frameworks such as PyTorch, TensorFlow, or JAX
  • Hands-on experience in deep learning operator/kernel development and performance tuning, with the ability to implement and optimize complex operators
  • Proficiency with performance analysis and profiling tools (e.g., Nsight Compute, nvprof, torch.profiler), and ability to conduct quantitative analysis and performance modeling
  • Strong system design and software engineering skills, with the ability to balance performance, maintainability, and generality in complex systems
  • Master’s or Ph.D. degree in Computer Architecture, Compiler Design, High Performance Computing, or a related field
Job Responsibility
Job Responsibility
  • Lead performance optimization of AI models on Ascend NPUs, including performance analysis, bottleneck identification, and optimization implementation for both training and inference workloads
  • Analyze performance bottlenecks of multimodal models and large language models (LLMs) on the Ascend platform, covering operators, kernels, memory access patterns, and scheduling
  • design and implement optimization strategies
  • Develop and optimize critical operators/kernels, continuously improving execution efficiency, memory access patterns, parallelization strategies, and hardware resource utilization
  • Research and apply advanced techniques such as auto-tuning, operator fusion, graph optimization, and scheduling optimization in real-world production scenarios
  • Build and lead an NPU performance optimization team
  • communicate findings to cross-functional teams and leadership, and contribute to the evolution of next-generation Ascend NPU architecture
What we offer
What we offer
  • Private healthcare package
  • Sport Cards
  • Benefit Platform
  • Special discounts for employees
  • Office massages
  • annual bonus
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - AI Product Engineer - Web

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, or related technical discipline AND 4+ years software engineering experience building web products at scale by coding in languages including, but not limited to, Typescript and React OR equivalent experience.
  • Bachelor’s degree in computer science, or related technical discipline AND 10+ years software engineering experience building web products at scale by coding in languages including, but not limited to, Typescript and React OR Master's Degree in Computer Science, or related technical discipline AND 8+ years software engineering experience building web products at scale by coding in languages including, but not limited to, Typescript and React OR equivalent experience.
  • Have 0 to 1 experience with a bias towards shipping and learning, while balancing a high-quality bar.
  • Thrive in a fast-paced, collaborative environment and are comfortable making progress in ambiguity.
  • Enjoy working closely with cross-functional partners and teammates in an inclusive, curious culture.
  • Take a user-centric approach to product development, prioritizing solutions that result in the best user experience and have the technical expertise to pull it off.
Job Responsibility
Job Responsibility
  • Ship delightful, AI powered experiences that will shape how millions of people will interact with AI in the future
  • Collaborate with AI researchers, product managers, and designers to bring a world-class AI companion to the world
  • Design and build efficient and reusable front-end systems that drive complex web applications
  • Plan and deploy front end infrastructure necessary to build, test, and deploy our products
  • Join a small team of world class product engineers with deep frontend expertise who are obsessed with building beautiful and performant products
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Senior Product AI Engineer

You’ll join us to bring this framework to the next level, leading projects (AI, ...
Location
Location
Salary
Salary:
Not provided
wetravel.com Logo
WeTravel
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of software engineering experience (ideally full-stack)
  • Strong engineering skills and proven experience in GenAI / LLM applications (i.e. built and launched LLM-enabled products to customers)
  • Proficiency with Ruby on Rails, or in at least two other languages Python/Go/Java/Kotlin/Node.js or .NET with desire to learn Ruby
  • Have experience and the desire to build user experiences (e.g. web front-ends)
  • Have experience building and working with distributed systems, microservices and event-driven architecture and demonstrate strong systems thinking and can design for scalability
  • Have experience with production systems, monitoring, and on-call responsibilities
  • Have excellent communication skills and experience working in multicultural, distributed teams
  • Take a pragmatic approach to using AI tools to improve productivity
  • Have experience leading projects
  • Proficiency with LLM providers and SDKs (e.g., OpenAI, Anthropic, Google, Meta)
Job Responsibility
Job Responsibility
  • Lead and build features end-to-end: from reviewing user interviews and product design, through architection and building systems to deployment and monitoring in production
  • Partner closely with our product team to discover user problems and shape their solutions - creating a world class experience for our organizers and travelers
  • Write high-quality, maintainable code across both backend (Ruby on Rails), and frontend (TypeScript/React)
  • Ensure our services are always on by building resilient applications, ensuring they are well monitored and mitigating incidents as an on-call/incident responder
  • Mentor teammates, grow the team's AI capacity, and contribute to WeTravel’s engineering practices and excellence
What we offer
What we offer
  • Generous "Time to Recharge" policy — enjoy unlimited paid time off to rest, recharge, and show up as your best self
  • Amsterdam Program – visit us in Amsterdam (HQ) for 2-4 weeks every year, staying in one of our WeTravel apartments
  • Work remotely for a maximum of 4 weeks per calendar year
  • Extensive paid family leave
  • Three paid volunteer days per year — take time to give back to causes you care about, on us
  • 2-week cross-functional onboarding program
  • Cutting-edge equipment and tools to set you up for success. Coverage for certain work-from-home (WFH) equipment
  • Cambly for colleagues for whom English is not their first language
  • Join an international, travel-loving team with a passion for adventure and innovation
  • Fulltime
Read More
Arrow Right

AI Product Engineer

As an AI Product Engineer at Speak, you'll play a pivotal role in developing the...
Location
Location
United States , San Francisco
Salary
Salary:
170000.00 - 280000.00 USD / Year
speak.com Logo
Speak
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3-5+ years of experience in full stack/backend, product-focused software engineering
  • Proficiency in React/Node/Typescript and Python
  • Real-world experience developing and deploying LLM apps and a strong understanding, gained through experience, of what works and what doesn't
  • A keen intuition for improving performance and output quality of LLM systems
  • Experience with LLM Ops and tools (e.g., vector databases, RAG, prompt ops)
  • Strong product intuition — the ability to think broadly and cross-functionally about innovative LLM-powered capabilities and product experiences
  • Ability to work independently and build at a high velocity
Job Responsibility
Job Responsibility
  • Developing and deploying LLM-powered language learning products across the full stack, as well as enhancing the quality and performance of existing AI-powered features within Speak
  • Collaborating cross-functionally with other Engineering teams, Applied ML, Product, Design, and Content
  • Refining our process for building LLM apps, including best practices for prompting, experimentation/evaluation, LLM Ops, measuring quality and performance, etc.
  • Scaling existing product features to many more users and languages
What we offer
What we offer
  • Offers Equity
  • Join a fantastic, tight-knit team at the right time
  • Do your life's work with people you’ll love working with
  • Global in nature
  • Impact people's lives in a major way
  • Fulltime
Read More
Arrow Right

AI Product Engineer

As an AI Product Engineer at Brain Co., you will play a pivotal role in building...
Location
Location
United States , San Francisco Bay Area
Salary
Salary:
Not provided
brain.co Logo
Brain Co.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 2+ years of experience and an appetite for working directly with the customer to develop the software spec and build from zero
  • Experience with front-end and back-end technologies, microservices, and cloud platforms
  • Experience with modern web tooling such as React, Typescript, RESTful APIs, and database management systems
  • Possess a strong foundation in software design principles, data structures, and algorithms
  • Exhibit excellent problem-solving and analytical skills, with a proactive approach to challenges
  • Enjoys working collaboratively with cross-functional teams
  • Thrive in fast-paced environments where priorities or deadlines may compete
  • Eager to own problems end-to-end and willing to acquire any necessary knowledge to get the job done
  • Hold a Bachelor’s/Master’s degree in Computer Science, Software Engineering, or a related field
Job Responsibility
Job Responsibility
  • Innovate and Deploy: Design, develop, and deploy advanced software solutions that integrate AI to tackle real-world problems, particularly in automating complex, manual processes in government and industrial sectors. Utilize modern web frameworks, microservices architecture, and cloud computing to build applications that apply AI to intricate optimization challenges
  • Make a Big Impact: Interact directly with key customer stakeholders to apply pioneering AI solutions while working alongside experienced ex-founders, government officials/ministers, AI researchers, and engineers. Understand complex business challenges and deliver software solutions powered by AI. Join a dynamic team where ideas are exchanged freely, and creativity flourishes. You will wear many hats: software development, product management, sales, and interpersonal skills
  • Optimize and Scale: Build scalable data pipelines, integrate industrial sensor networks, optimize application performance and reliability, and prepare systems for production. Engage in projects including but not limited to optimizing the world's most advanced energy production systems, modernizing core government workflows, or improving patient outcomes in advanced public healthcare systems
  • Learn and Lead: Stay abreast of the latest developments in software engineering and AI. Participate in code reviews, share knowledge, and set an example with high-quality engineering practices. Mentor junior engineers and lead by example
  • Make a Difference: Monitor and maintain deployed applications to ensure they continue delivering value across various governments worldwide. Work directly with customer engineers and SMEs to develop and tune applications that optimize their workflows and deliver tangible upside. Your work will directly impact how AI benefits individuals, businesses, and society at large
What we offer
What we offer
  • Competitive salary
  • Medical, Dental, and Vision (100% Coverage)
  • Paid Maternity and Paternity Leave
  • 401(k)
  • Daily Lunches
  • Commuter Benefits
  • Unlimited PTO
  • Fulltime
Read More
Arrow Right

AI Product Engineer

As an AI Product Engineer at Brain Co., you will play a pivotal role in building...
Location
Location
United Arab Emirates; Qatar , Abu Dhabi; Doha
Salary
Salary:
Not provided
brain.co Logo
Brain Co.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 2+ years of experience and an appetite for working directly with the customer to develop the software spec and build from zero
  • Experience with front-end and back-end technologies, microservices, and cloud platforms
  • Experience with modern web tooling such as React, Typescript, RESTful APIs, and database management systems
  • Exhibit excellent problem-solving and analytical skills, with a proactive approach to challenges
  • Enjoys working collaboratively with cross-functional teams
  • Thrive in fast-paced environments where priorities or deadlines may compete
  • Eager to own problems end-to-end and willing to acquire any necessary knowledge to get the job done
  • Are eager to travel to customer locations globally for 6+ months and work directly with SMEs and end-users
Job Responsibility
Job Responsibility
  • Innovate and Deploy: Design, develop, and deploy advanced software solutions that integrate AI to tackle real-world problems, particularly in automating complex, manual processes in government and industrial sectors. Utilize modern web frameworks, microservices architecture, and cloud computing to build applications that apply AI to intricate optimization challenges
  • Make a Big Impact: Interact directly with key customer stakeholders to apply pioneering AI solutions while working alongside experienced ex-founders, government officials/ministers, AI researchers, and engineers. Understand complex business challenges and deliver software solutions powered by AI. Join a dynamic team where ideas are exchanged freely, and creativity flourishes. You will wear many hats: software development, product management, sales, and interpersonal skills
  • Optimize and Scale: Build scalable data pipelines, integrate industrial sensor networks, optimize application performance and reliability, and prepare systems for production. Engage in projects including but not limited to optimizing the world's most advanced energy production systems, modernizing core government workflows, or improving patient outcomes in advanced public healthcare systems
  • Learn and Lead: Stay abreast of the latest developments in software engineering and AI. Participate in code reviews, share knowledge, and set an example with high-quality engineering practices. Mentor junior engineers and lead by example
  • Make a Difference: Monitor and maintain deployed applications to ensure they continue delivering value across various governments worldwide. Work directly with customer engineers and SMEs to develop and tune applications that optimize their workflows and deliver tangible upside. Your work will directly impact how AI benefits individuals, businesses, and society at large
What we offer
What we offer
  • Competitive salary plus equity
  • Commuter benefits
  • 401(k)
  • Medical, Dental and Vision
  • Unlimited PTO
  • Fulltime
Read More
Arrow Right

Experienced AI Product Engineer

Build AI-powered features that thousands of product teams rely on - from cluster...
Location
Location
Czechia , Prague
Salary
Salary:
Not provided
productboard.com Logo
ProductBoard
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Professional expertise in building Python applications
  • Proficiency in designing, executing, and maintaining ML systems and solutions in a production environment
  • Familiarity with the management of performance and testing of ML systems
  • Practical experience with message queue systems and a grasp of event-driven architecture
  • A background in data science and LLMs would be highly advantageous
Job Responsibility
Job Responsibility
  • Building AI-powered product features
  • Enhancing and sustaining our internal tech stack, while identifying and incorporating new state-of-the-art technologies
  • Discovering and experimenting across different domains, creating MVPs and POCs, engaging in discussions about findings with fellow engineers and the product team, and planning the execution
  • Collaborating with other engineers, introducing fresh concepts and methodologies to the team
What we offer
What we offer
  • Stock options
  • MacBook + 34″ monitor
  • Budget for online courses, books, and conferences
  • 5 weeks of vacation + 9 sick days
  • Volunteer Days
  • Carrot Fertility Benefits
  • Free snacks, drinks, and yummy catered lunches
  • MultiSport card to access sports facilities
  • Flexible working hours and home office
  • Parental benefits
  • Fulltime
Read More
Arrow Right