CrawlJobs Logo

Applied AI & GPU Software Engineer

Poland, Warsaw · Job Posted May 31, 2026
Apply Position
Job Link Share

Job Description

AMD is seeking a Software Engineer to join the Software Ecosystem Enablement team, focused on deploying and optimizing machine learning inference in commercial applications in content creation and CAE spaces. This role targets performance, scalability, and memory efficiency across heterogeneous platforms (GPU, NPU, CPU), enabling next-generation ML-driven features in real-world applications.

Job Responsibility

  • Investigate and prototype hybrid ML systems for graphics, simulation, and media-generation pipelines
  • Integrate existing ML models and inference pipelines into commercial software systems
  • Design efficient workload scheduling and distribution across heterogeneous resources
  • Profile workloads across GPU, NPU, and CPU to identify bottlenecks and optimize performance
  • Evaluate runtimes, execution providers, and deployment strategies for modern hardware architectures
  • Collaborate with domain experts and existing GPU engineering teams

Requirements

  • 10+ years of professional software development experience
  • Solid programming fundamentals in C/C++
  • Experience developing or contributing to GPU-accelerated applications
  • Solid understanding of GPU programming fundamentals
  • Debugging experience with GPU kernels or performance-critical code
  • Familiarity with modern ML frameworks and inference systems
  • Experience with denoising, neural rendering, or ML simulation is an asset
  • Experience with content creation apps, CAD/CAE tools, or HPC pipeline is an asset

Nice to have

  • Experience with denoising, neural rendering, or ML simulation is an asset
  • Experience with content creation apps, CAD/CAE tools, or HPC pipeline is an asset

What we offer

Benefits offered are described: AMD benefits at a glance

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Applied AI & GPU Software Engineer

8 matching positions

AI Engineer

Location
Location
Vietnam , Da Nang
Salary
Salary:
Not provided
saigontechnology.com Logo
Saigon Technology
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have programming skills in one of these languages: Python, Java...
  • At least 1 years of experience working with AI/ML projects as a Data Engineer, Research Engineer, or Software Engineer.
  • Have experience with one of ML/DL Frameworks: Tensorflow, Pytorch, Keras, scikit-learn, Pandas, Langchain, LlamaIndex…
  • Have experience working with OpenAI, Gemini, any LLM... to build agents, workflows, or RAG systems.
  • Have experience with one of chatbot building frameworks or services like Rasa, Dialogflow, Transformer, BERT, LLM/Prompt…
  • Familiar with OCR algorithms or services: OpenCV, Tesseract, Textract (AWS), Google Cloud Vision, PaddleOCR.
  • Experience with common development tools: Linux, GPU server, Google Colab, Jupyter, Git, Docker.
  • Good English proficiency and communication skills.
Job Responsibility
Job Responsibility
  • Join in developing and applying the ML/DL techniques to resolve our client’s business problems such as building chatbot systems, LLM/Prompt, OCR systems, fraud detection systems, facial recognition systems…
  • Join in developing some internal products applying AI models
What we offer
What we offer
  • Competitive Salary and Brilliant Health Benefits
  • Attractive salary (13th-month salary, salary review twice/year) and project bonus
  • Bonus programs for candidate referral, technical article writing
  • Interest-free loan support for personal plan
  • Allowance for sickness, maternity, paternity and periodic health examination
  • PVI health care program
  • The staff of the quarter and year reward
  • Progressive and Fun Working Environment
  • A professional English-speaking working environment with Agile – Scrum model
  • Hybrid Working Model: Flexible working time and WFH support.
  • Fulltime
Read More
Arrow Right

Software Engineering Intern

As an AMD intern, you’ll be placed at the epicenter of the AI ecosystem, working...
Location
Location
Serbia , Nis
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a Serbia-based University in a BSc (3rd or 4th year) or MSc program majoring in Computer Science, Computer Engineering, Electrical Engineering, or related discipline
  • Knowledge/experience with C/C++ programming and Object-Oriented Design
  • Knowledge/experience with Operating systems concepts (including memory management, multithreading, assembly, x86) and computer architecture knowledge
  • Knowledge/experience with Algorithms and data structures knowledge
  • Knowledge/experience with GPU and parallel programming (CUDA/HIP) basics
  • Knowledge/experience with AI fundamentals and ML frameworks familiarity
  • Solid foundations in Python, Linux shell, and Git, with interest in automation and building tools for scalable testing
  • Curiosity about applying AI in software testing, including prompt design, AI‑assisted test generation, and log analysis with LLMs
  • Motivation to contribute to the validation of AI‑assisted systems such as the ROCm software stack for GPU‑accelerated workloads
Job Responsibility
Job Responsibility
  • Work closely with our software engineers to build and maintain our software libraries, tools and applications
  • Learn debugging methodologies to investigate the root cause of software issues
  • Involvement in crafting technical/functional documents and reports as needed
  • Apply a data-minded approach to targeted optimization efforts
What we offer
What we offer
  • AMD benefits at a glance
  • Fulltime
Read More
Arrow Right

Software Development Engineer

As a core member of the team, you will play a pivotal role in optimizing and dev...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s and/or Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field
  • 5+ years of professional experience in technical software development, with a focus on GPU optimization, performance engineering, and framework development
  • Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments
  • Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential
  • GPU Kernel Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM)
  • Strong knowledge of AMD architectures (GCN, RDNA) and low-level programming
  • Leveraging tools like Compute Kernel (CK), CUTLASS, and Triton for multi-GPU and multi-platform performance
  • Deep Learning Integration: Experienced in integrating optimized GPU performance into machine learning frameworks (e.g., TensorFlow, PyTorch) to accelerate model training and inference
  • Software Engineering: Skilled in Python and C++
  • Experience in debugging, performance tuning, and test design
Job Responsibility
Job Responsibility
  • Optimize Deep Learning Frameworks: Enhance and optimize frameworks like TensorFlow and PyTorch for AMD GPUs in open-source repositories
  • Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations
  • Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance
  • Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs
  • Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream
  • Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems
  • Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance
  • Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers
  • Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions
What we offer
What we offer
  • AMD benefits at a glance
Read More
Arrow Right

Software Engineering Intern

As an AMD intern, you’ll be placed at the epicenter of the AI ecosystem, working...
Location
Location
Serbia , Belgrade
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a Serbia-based University in a BSc (3rd or 4th year) or MSc program majoring in Computer Science, Computer Engineering, Electrical Engineering, or related discipline
  • Knowledge/experience with C/C++ programming and Object-Oriented Design
  • Knowledge/experience with Operating systems concepts (including memory management, multithreading, assembly, x86) and computer architecture knowledge
  • Knowledge/experience with Algorithms and data structures knowledge
  • Knowledge/experience with GPU and parallel programming (CUDA/HIP) basics
  • Knowledge/experience with AI fundamentals and ML frameworks familiarity
  • Solid foundations in Python, Linux shell, and Git, with interest in automation and building tools for scalable testing
  • Curiosity about applying AI in software testing, including prompt design, AI‑assisted test generation, and log analysis with LLMs
  • Motivation to contribute to the validation of AI‑assisted systems such as the ROCm software stack for GPU‑accelerated workloads
  • Knowledge/experience with Full stack development (preferably .Net/C# and Angular or React)
Job Responsibility
Job Responsibility
  • Work closely with our software engineers to build and maintain our software libraries, tools and applications
  • Learn debugging methodologies to investigate the root cause of software issues
  • Involvement in crafting technical/functional documents and reports as needed
  • Apply a data-minded approach to targeted optimization efforts
  • Fulltime
Read More
Arrow Right

Ai Software Engineer

Meta is seeking a Software Engineer to join our team. The candidate is someone w...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science or a related STEM field
  • Specialized experience in one or more of the following AI/deep learning domains: AI infrastructure, hardware accelerators, high performance computing, AI compilers, performance optimizations, GPU architecture, on-device optimization, AI frameworks (PyTorch), HW/SW co-design and numerics
  • Experience developing AI algorithms in C/C++ or Python for large-scale AI applications
Job Responsibility
Job Responsibility
  • Apply in depth knowledge of AI infrastructure and hardware acceleration techniques to build and optimize our intelligent AI systems that improve Meta’s product and experiences
  • Goal setting related to project impact, AI system design, and infrastructure efficiency
  • Directly or influencing partners to deliver impact through thorough data analysis
  • Drive large efforts across multiple teams
  • Define use cases, and develop methodology & benchmarks to evaluate different approaches
  • Apply in depth knowledge of how the AI infra interacts with the other systems around it
Read More
Arrow Right

Software Development Engineer

As a core member of the team, you will play a pivotal role in optimizing and dev...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s and/or Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field
  • 5+ years of professional experience in technical software development, with a focus on GPU optimization, performance engineering, and framework development
  • Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments
  • Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices
  • Experience in GPU Kernel Development & Optimization for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM)
  • Strong knowledge of AMD architectures (GCN, RDNA) and low-level programming
  • Experience leveraging tools like Compute Kernel (CK), CUTLASS, and Triton for multi-GPU and multi-platform performance
  • Experience in Deep Learning Integration into machine learning frameworks (e.g., TensorFlow, PyTorch) to accelerate model training and inference
  • Skilled in Python and C++, with experience in debugging, performance tuning, and test design
  • Solid experience in running large-scale workloads on heterogeneous compute clusters
Job Responsibility
Job Responsibility
  • Optimize Deep Learning Frameworks: Enhance and optimize frameworks like TensorFlow and PyTorch for AMD GPUs in open-source repositories
  • Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations
  • Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance
  • Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs
  • Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream
  • Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems
  • Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance
  • Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers
  • Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions
What we offer
What we offer
  • Benefits offered are described: AMD benefits at a glance
Read More
Arrow Right

Sr. Software Development Engineer

As a core member of the team, you will play a pivotal role in optimizing and dev...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Skilled engineer with strong technical and analytical expertise in C++ development within Linux environments
  • Ability to define goals, manage development efforts, and deliver high-quality solutions
  • Strong problem-solving skills
  • Proactive approach
  • Keen understanding of software engineering best practices
  • Experience in GPU kernel development & optimization for AMD GPUs using HIP, CUDA, and assembly (ASM)
  • Strong knowledge of AMD architectures (GCN, RDNA) and low-level programming
  • Experience leveraging tools like Compute Kernel (CK), CUTLASS, and Triton for multi-GPU and multi-platform performance
  • Experience in integrating optimized GPU performance into machine learning frameworks (e.g., TensorFlow, PyTorch)
  • Skilled in Python and C++
Job Responsibility
Job Responsibility
  • Optimize Deep Learning Frameworks: Enhance and optimize frameworks like TensorFlow and PyTorch for AMD GPUs in open-source repositories
  • Develop GPU Kernels: Create and optimize GPU kernels to maximize performance for specific AI operations
  • Develop & Optimize Models: Design and optimize deep learning models specifically for AMD GPU performance
  • Collaborate with GPU Library Teams: Work closely with internal teams to analyze and improve training and inference performance on AMD GPUs
  • Collaborate with Open-Source Maintainers: Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream
  • Work in Distributed Computing Environments: Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems
  • Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance
  • Optimize Deep Learning Pipeline: Enhance the full pipeline, including integrating graph compilers
  • Software Engineering Best Practices: Apply sound engineering principles to ensure robust, maintainable solutions
Read More
Arrow Right
New

Senior Principal AI Infrastructure Architect

The Senior Principal AI Infrastructure Architect is a highly skilled and advance...
Location
Location
Italy , Milano
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant experience in a consulting, presales or architecture role within a large-scale (preferably multi-national) technology services environment, with a track record of leading AI infrastructure pursuits
  • Demonstrable experience designing and delivering production AI platforms — from single multi-GPU servers through to multi-rack training clusters and inference factories
  • Strong working knowledge of the AI hardware vendor landscape (NVIDIA, AMD, Intel, Dell, HPE, Lenovo, Supermicro, Cisco, Pure, VAST, WEKA, DDN, NetApp) and how to position partner ecosystems competitively
  • Proven ability to translate AI workload requirements (model size, parameter count, sequence length, throughput SLOs, latency targets) into accurate hardware bills of materials and sizing justifications
  • Significant client engagement and consulting experience, including client needs assessment, change management and the ability to identify whitespace for follow-on AI infrastructure and managed-services work
  • Significant business development and presales experience on infrastructure-led deals, ideally including sovereign AI, AI Factory or regulated-industry GenAI programmes
  • Strong understanding of how AI infrastructure integrates with business processes, applications, data platforms and existing enterprise architecture
  • Bachelor's degree or equivalent in Information Technology, Engineering, Computer Science or a related field
  • Deep, hands-on knowledge of AI hardware: GPU and accelerator portfolios (NVIDIA Hopper / Blackwell, AMD MI300/MI325, Intel Gaudi 3, emerging custom silicon), host CPU platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), system topologies (HGX, DGX, MGX, OAM) and how each choice maps to specific AI workloads
  • Strong understanding of AI-class storage: parallel filesystems, all-flash NVMe platforms, S3-class object stores, checkpoint and dataset pipelines and the I/O patterns of large-scale training and inference (VAST, WEKA, DDN EXAScaler, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
Job Responsibility
Job Responsibility
  • Lead the end-to-end design of large, complex AI infrastructure solutions — covering accelerated compute (NVIDIA H100/H200/B200 and GB200 NVL72, AMD Instinct MI300X/MI325X, Intel Gaudi 3), CPU host platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), high-throughput storage tiers and lossless AI fabric — for enterprise, sovereign AI and AI Factory clients
  • Architect reference designs built on NVIDIA DGX/HGX SuperPOD, Dell AI Factory with NVIDIA, Cisco Nexus HyperFabric AI, HPE / Lenovo / Supermicro accelerated compute and equivalent platforms, balancing single-node performance with cluster-scale efficiency
  • Size and validate GPU clusters against real workloads — foundation-model pre-training, distributed fine-tuning, RAG, real-time and batch inference — using the right combination of NVLink/NVSwitch domains, InfiniBand NDR/XDR or Ultra Ethernet / NVIDIA Spectrum-X fabrics and tiered NVMe and parallel storage (VAST, WEKA, DDN, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
  • Define the supporting datacenter design: high-density power (50–140 kW/rack), direct-to-chip and rear-door liquid cooling, structured cabling for AI fabrics and modular deployment models across on-prem, colo and sovereign-cloud footprints
  • Work closely with the sales team to drive the presales process for AI infrastructure pursuits — client discovery, technical workshops, proposal writing, executive presentations and bid defence
  • Translate clients' AI ambitions and business outcomes into a hardware and platform roadmap, positioning NTT DATA's end-to-end portfolio — silicon, systems, storage, fabric, MLOps stack and managed services — to land service-led AI solutions
  • Lead integration of compute, storage, networking, the AI software stack (CUDA, ROCm, Triton, NIM, NVIDIA AI Enterprise, Run:ai, Slurm, Kubernetes / Kubeflow) and managed-service operating models across multiple domains, delivery units and geographies
  • Build business cases, TCO and unit-economics models (cost per token, cost per training run, GPU-hour economics) and end-to-end transition roadmaps for cloud-to-private AI migrations and sovereign AI deployments
  • Define architectural principles for AI infrastructure — accelerator utilisation, data gravity, multi-tenancy, model lifecycle, energy efficiency — and apply them to influence architectural outcomes and governance
  • Develop As-Is, Vision, FMO and To-Be AI platform architectures, identify gaps and develop transition roadmaps
  • Fulltime
Read More
Arrow Right