CrawlJobs Logo

GPU Application Engineer

amd.com Logo

AMD

Location Icon

Location:
Poland , Warsaw

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are looking for a Software Engineer to join an ISV GPU enablement team focused on accelerating real‑world applications on AMD GPUs. In this role, you will support ongoing GPU enablement efforts by contributing to HIP backend integration, validating application workflows, and assisting with debugging and performance analysis of GPU‑accelerated software. You will work closely with experienced GPU engineers and contribute to production‑level projects used by professional workstation applications.

Job Responsibility:

  • Contribute to the design and implementation of HIP backends for applications without AMD GPU support
  • Assist with integrating HIP ray tracing (HIPRT) where relevant
  • Optimize GPU kernels and application pipelines for AMD GPU architectures
  • Debug correctness, performance, and stability issues in application-level GPU code
  • Use profiling tools to identify bottlenecks and propose improvements
  • Work with internal compiler/runtime teams to surface lower-level issues when necessary
  • Help maintain clean, portable GPU code that works across multiple vendors

Requirements:

  • 1–3 years of professional software development experience, or strong academic / project‑based experience
  • Solid programming fundamentals in C/C++
  • Practical experience with CUDA or ROCm
  • Experience developing or contributing to GPU-accelerated applications
  • Solid understanding of GPU programming fundamentals
  • Debugging experience with GPU kernels or performance critical code
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a related discipline

Nice to have:

  • Development experience on Linux
  • Experience porting CUDA code to HIP (manually or using hipify)
  • Exposure to GPU ray tracing concepts or APIs (HIPRT, OptiX, etc.)
  • Experience working in larger, multi-backend codebases is a plus

Additional Information:

Job Posted:
March 25, 2026

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for GPU Application Engineer

Software System Design Engineer

Bring AMD’s leading-edge SoCs to life in a virtual environment! As an early cons...
Location
Location
United Kingdom , Cambridge
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • University degree in Engineering or higher or equivalent experience is preferred
  • Proven experience in designing and implementing maintainable, performant, professional C++11/14/17 code
  • Familiarity with x86, ARM, or GPU architecture, including drivers and applications
  • Proficiency in Linux and Windows kernel debugging
  • Prior involvement in functional modelling, architecture simulation, or hypervisor development
  • Familiarity with modelling technologies such as QEMU, VirtualBox, SystemC, Gem5, SIMICS, dynamic binary translation, or JIT
  • Experience of cross-platform development for Linux and Windows environments
Job Responsibility
Job Responsibility
  • Develop high-performance FFMs (Fast-Functional models) for AMD SoCs and platforms using C++
  • Collaborate closely with internal and external customers to understand and assist in running their workloads on the models
  • Create test plans and validate model functionality using Python
  • Improve functionality, stability, and performance of existing models
  • Develop innovative modelling features to streamline workload debugging
  • Close collaboration with architecture teams to foster in-depth understanding of new designs
Read More
Arrow Right

Senior Manager, Performance AI/ML Network Deployment Engineering

The Senior Manager, DC GPU Advanced Forward Deployment and Systems Engineering i...
Location
Location
United States , Santa Clara
Salary
Salary:
210400.00 - 315600.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise in networking and performance optimization for large-scale AI/ML networks, including network, compute, storage cluster design, modelling, analytics, performance tuning, convergence, scalability improvements
  • Prefer candidates with solid, hands-on expertise in at least one or more of 3 domains, namely compute, network, storage
  • Experience in working with large customers such as Cloud Service Providers and global enterprise customers
  • Proven leadership in engaging customers with diverse technical disciplines in avenues such as Proof of Concept, Competitive evaluations, Early Field Trials etc
  • Direct experience in working with large customers and can operate with sense of urgency, own the problems and resolve it
  • Demonstrated leadership in network architecture, hands on experience in RoCEv2 Design, VXLAN-EVPN, BGP, and Lossless Fabrics
  • Proven ability to influence design and technology roadmaps, leveraging a deep understanding of datacenter products and market trends
  • Extensive hands-on Network deployment expertise and proven track record of delivering large projects on time. Cisco, Juniper or Arista experience is preferred
  • Direct, co-development/deployment experience in working with strategic customers/partners in bringing solutions to market
  • Excellent communication level from engineer to mid-management to C-level of audience
Job Responsibility
Job Responsibility
  • Collaborate with strategic customers on scalable designs involving compute, networking, storage environment, work with industry partners, Internal teams to accelerate the deployment, adoption of various AI/ML models
  • Engage system-level triage and at-scale debug of complex issues across hardware, firmware, and software, ensuring rapid resolution and system reliability
  • Drive the ramp of Instinct-based large scale AI datacenter infrastructure based on NPI base platform hardware with ROCm, scaling up to pod and cluster level, leveraging the best in network architecture for AI/ML workloads
  • Enhance tools and methodologies for large-scale deployments to meet customer uptime goals and exceed performance expectations
  • Engage with clients to deeply understand their technical needs, ensuring their satisfaction with tailored solutions that leverage your past experience in strategic customer engagements and architectural wins
  • Provide domain specific knowledge to other groups at AMD, share the lessons learnt to drive continuous improvement
  • Engage with AMD product groups to drive resolution of application and customer issues
  • Develop and present training materials to internal audiences, at customer venues, and at industry conferences
Read More
Arrow Right

Senior Systems Software Engineer

As Senior Systems Software Engineer for Vision, you will design, develop, and op...
Location
Location
United States , Palo Alto
Salary
Salary:
137861.00 - 240000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience with Linux and C++ programming
  • Proven experience building large software applications in relevant domains
  • Proficiency in managing memory and hardware resources in C++ and/or Rust
  • Strong skills using C‑APIs to access low‑level system resources (e.g. V4L, POSIX, OpenGL, CUDA)
  • Deep understanding of OS primitives and experience applying them in real‑time systems
  • Demonstrated ability to optimize for low latency and high throughput on modern CPUs
  • Experience working with hardware accelerators through low‑level C‑APIs
  • Background in systems where latency is critical (such as robotics, real‑time control, gaming engines, or similar)
Job Responsibility
Job Responsibility
  • Create and implement novel techniques to minimize system latency and computational load
  • Utilize GPU acceleration for image processing and inference tasks, ensuring high performance and responsiveness in real‑time applications
  • Develop low‑latency vision systems by integrating cameras, sensors, and image‑processing algorithms to enhance robotic perception
  • Build and optimize real‑time streaming solutions, with efficient compression, decompression, and data transmission under limited compute resources
  • Collaborate closely with cross‑functional teams to integrate vision systems into the overall robot architecture, contributing to teleoperation and autonomy stacks
  • Fulltime
Read More
Arrow Right
New

Ai Solutions Architect / Field Application Engineer

We are looking for an AI enthusiast with strong technical fundamentals and custo...
Location
Location
United States , Austin
Salary
Salary:
102320.00 - 153480.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent practical experience)
  • Strong interest in AI/ML technologies and a desire to work across hardware and software layers
  • Hands-on experience with Linux-based systems
  • Programming experience in one or more of the following: Python, C/C++, Bash
  • Familiarity with AI frameworks or tools (e.g., PyTorch, TensorFlow, ONNX, Hugging Face, or similar)
  • Strong communication skills with the ability to explain technical concepts clearly
  • Ability to work effectively in a team-oriented, cross-functional environment
Job Responsibility
Job Responsibility
  • Serve as a technical point of contact for customers, supporting AI and HPC workloads on AMD CPU and GPU platforms
  • Work directly with customers to understand their use cases, requirements, and constraints, and guide them through solution design and deployment
  • Deliver technical presentations, demos, and architecture walkthroughs to both technical and non-technical audiences
  • Program-manage customer opportunities as they grow in complexity, coordinating activities across internal and external stakeholders
  • Perform hands-on system bring-up including hardware installation, firmware configuration, OS installation, and driver setup
  • Deploy and validate open-source AI and HPC software stacks (e.g., Linux, ROCm, AI frameworks, containers)
  • Run functionality, performance, and scalability benchmarks on CPU and GPU workloads
  • Perform first-level profiling and analysis of applications to identify performance bottlenecks and optimization opportunities
  • Support AI workloads such as training, inference, and data preprocessing across CPU and GPU platforms
  • Develop working knowledge of AMD CPU and GPU architectures and how they impact real-world workloads
  • Fulltime
Read More
Arrow Right

Ai Solutions Architect / Field Application Engineer

We are looking for an AI enthusiast with strong technical fundamentals and custo...
Location
Location
United States , Austin
Salary
Salary:
128400.00 - 192600.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent practical experience)
  • Strong interest in AI/ML technologies and a desire to work across hardware and software layers
  • Hands-on experience with Linux-based systems
  • Programming experience in one or more of the following: Python, C/C++, Bash
  • Familiarity with AI frameworks or tools (e.g., PyTorch, TensorFlow, ONNX, Hugging Face, or similar)
  • Strong communication skills with the ability to explain technical concepts clearly
  • Ability to work effectively in a team-oriented, cross-functional environment
Job Responsibility
Job Responsibility
  • Serve as a technical point of contact for customers, supporting AI and HPC workloads on AMD CPU and GPU platforms
  • Work directly with customers to understand their use cases, requirements, and constraints, and guide them through solution design and deployment
  • Deliver technical presentations, demos, and architecture walkthroughs to both technical and non-technical audiences
  • Program-manage customer opportunities as they grow in complexity, coordinating activities across internal and external stakeholders
  • Perform hands-on system bring-up including hardware installation, firmware configuration, OS installation, and driver setup
  • Deploy and validate open-source AI and HPC software stacks (e.g., Linux, ROCm, AI frameworks, containers)
  • Run functionality, performance, and scalability benchmarks on CPU and GPU workloads
  • Perform first-level profiling and analysis of applications to identify performance bottlenecks and optimization opportunities
  • Support AI workloads such as training, inference, and data preprocessing across CPU and GPU platforms
  • Develop working knowledge of AMD CPU and GPU architectures and how they impact real-world workloads
  • Fulltime
Read More
Arrow Right
New

Senior Software Development Engineer

We are seeking an experienced and highly technical SMTS Software Development Eng...
Location
Location
United Kingdom
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or related technical field
  • 8+ years of software engineering experience in systems software, runtime libraries, GPU programming, or compiler/runtime interfaces
  • Strong proficiency in modern C++ (C++14/C++17 or newer), templates, memory models, and low‑level systems programming
  • Deep understanding of at least one GPU computing model (HIP, CUDA, SYCL, OpenCL, OpenMP offload)
  • Hands‑on experience with runtime systems, driver interfaces, or high‑performance compute libraries
  • Strong debugging skills using tools such as gdb, sanitizers, profilers, and GPU debugging tools
  • Solid understanding of parallel programming concepts—memory hierarchy, synchronization, concurrency, thread scheduling
Job Responsibility
Job Responsibility
  • Architect, implement, and optimize features in the HIP runtime, including memory management, kernel dispatch, device abstraction, multi‑GPU coordination, and synchronization primitives
  • Contribute to the evolution of the HIP programming model and interoperability with ROCr, HSA runtime, and compiler toolchains
  • Ensure functional correctness, performance, and scalability of runtime APIs across different GPU generations
  • Conduct root‑cause analysis and systems‑level debugging across the runtime, driver, compiler, and hardware layers
  • Profile GPU applications and internal runtime components to identify bottlenecks and design performance improvements
  • Optimize HIP runtime behavior for large-scale AI, HPC, and cloud workloads
  • Work closely with compiler teams (LLVM/Clang), driver teams, GPU architecture, and systems engineers to deliver end‑to‑end GPU software solutions
  • Contribute to API specifications and collaborate with upstream open-source communities where appropriate
  • Define and drive technical strategy for correctness, reliability, and conformance of the HIP runtime
  • Support enhancements in automated testing, CI, and stress/failure scenarios in the HIP test suite
Read More
Arrow Right

Senior Hardware Quality Engineer

Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the...
Location
Location
Taiwan , Taipei
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Electrical Engineering, or related field AND 3+ years technical engineering experience
  • Bachelor's Degree in Electrical Engineering, or related field AND 5+ years technical engineering experience
  • equivalent experience
  • 5+ years of work experience in managing product quality in the electronic industry
  • 5+ years of direct engineering experience in hardware system issue resolution for GPU Servers
  • Versed in filtering through applicable debug data, like telemetry and logs to identify and investigate HW failure signatures
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Develop and implement a robust supplier quality management strategy to ensure the data center hardware is manufactured at the highest level of quality standards
  • Lead quality issues and improvement task force to contain, mitigate, and resolve the top-quality issues impacting global data centers
  • Conduct debug and failure analysis for GPU subsystems in the Azure fleet and drive resolution with partners and suppliers
  • Drive the continuous improvement process based on Root Cause Analysis (RCA) and identified opportunities
  • Responsible for quality readouts based on your telemetry data analysis, to bring clarity on status, actions across the organization and next steps for issue resolution
  • Establish Critical-to-Quality performance metrics to measure and improve product quality
  • Fulltime
Read More
Arrow Right

Software Engineer - Performance Tools

Join our team as a Software Engineer - Performance Tools and take the lead in il...
Location
Location
United States , San Jose
Salary
Salary:
150000.00 - 275000.00 USD / Year
etched.com Logo
Etched
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong proficiency in C++ or Rust
  • Proficiency in Python is a plus
  • Deep understanding of computer architecture (CPU, GPU, accelerators), memory hierarchies (caches, DRAM), and interconnects (especially PCIe)
  • Proven experience in low-level performance analysis, profiling, and bottleneck identification on complex hardware systems (GPUs, CPUs, FPGAs, or custom ASICs)
  • Experience with performance analysis tools (e.g., NVIDIA Nsight, AMD uProf, Intel VTune, perf, Tracy, ETW)
  • Experience working close to hardware, potentially reading performance counters or interacting directly with device drivers
Job Responsibility
Job Responsibility
  • Tool Architecture & Design: Lead the design and architecture of a comprehensive performance analysis suite, including data collection mechanisms, data processing pipelines, analysis engines, and user interfaces (CLI and/or GUI)
  • Low-Level Data Collection: Develop robust methods to capture performance data directly from our custom ML accelerator hardware (e.g., hardware performance counters, execution unit status, memory access patterns) via driver interfaces or other mechanisms
  • Host & System Tracing: Implement tracing for host-side API calls (runtime libraries, driver interactions) and system-level events (CPU activity, PCIe traffic, memory usage, network contention) related to Sohu workloads
  • Data Correlation & Synchronization: Design and implement techniques to accurately correlate performance events across the host CPU, device driver, PCIe bus, multiple accelerators, and multiple hosts, ensuring precise time synchronization
  • Performance Analysis Engine: Build analysis modules to automatically interpret collected trace and counter data, identifying key performance limiters (e.g., compute-bound, memory bandwidth-bound, latency-bound, PCIe-bound, specific hardware bottlenecks)
  • Visualization & Reporting: Develop intuitive visualizations (timelines, dependency graphs, resource utilization charts, statistical summaries) to clearly communicate performance characteristics and bottlenecks to users
  • Collaboration & Support: Work closely with hardware architects, firmware engineers, driver developers, compiler engineers, and ML application engineers to understand their needs, define tool requirements, and provide expert guidance on performance analysis and optimization using the tool
What we offer
What we offer
  • Medical, dental, and vision packages with generous premium coverage
  • $500 per month credit for waiving medical benefits
  • Housing subsidy of $2k per month for those living within walking distance of the office
  • Relocation support for those moving to San Jose (Santana Row)
  • Various wellness benefits covering fitness, mental health, and more
  • Daily lunch + dinner in our office
  • Fulltime
Read More
Arrow Right