CrawlJobs Logo

Senior GPU Engineer

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
China , Beijing

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are seeking an expert Senior GPU Engineer to join our AI Infrastructure team. In this role, you will architect and optimize the core inference engine that powers our large-scale AI models. You will be responsible for pushing the boundaries of hardware performance, reducing latency, and maximizing throughput for Generative AI and Deep Learning workloads. You will work at the intersection of Deep Learning algorithms and low-level hardware, designing custom operators and building a highly efficient training/inference execution engine from the ground up.

Job Responsibility:

  • Custom Operator Development: Design and implement highly optimized GPU kernels (CUDA/Triton) for critical deep learning operations (e.g., FlashAttention, GEMM, LayerNorm) to outperform standard libraries
  • Inference Engine Architecture: Contribute to the development of our high-performance inference engine, focusing on graph optimizations, operator fusion, and dynamic memory management (e.g., KV Cache optimization)
  • Performance Optimization: Deeply analyze and profile model performance using tools like Nsight Systems/Compute. Identify bottlenecks in memory bandwidth, instruction throughput, and kernel launch overheads
  • Model Acceleration: Implement advanced acceleration techniques such as Quantization (INT8, FP8, AWQ), Kernel Fusion, and continuous batching
  • Distributed Computing: Optimize communication primitives (NCCL) to enable efficient multi-GPU and multi-node inference (Tensor Parallelism, Pipeline Parallelism)
  • Hardware Adaptation: Ensure the software stack fully utilizes modern GPU architecture features (e.g., NVIDIA Hopper/Ampere Tensor Cores, Asynchronous Copy)

Requirements:

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 4+ years of experience in systems programming, HPC, or GPU software development, featuring at least 5 years of hands-on CUDA/C++ kernel development
  • Expertise in the CUDA programming model and NVIDIA GPU architectures (specifically Ampere/Hopper)
  • Deep understanding of the memory hierarchy (Shared Memory, L2 cache, Registers), warp-level primitives, occupancy optimization, and bank conflict resolution
  • Familiarity with advanced hardware features: Tensor Cores, TMA (Tensor Memory Accelerator), and asynchronous copy
  • Proven ability to navigate and modify complex, large-scale codebases (e.g., PyTorch internals, Linux kernel)
  • Experience with build and binding ecosystems: CMake, pybind11, and CI/CD for GPU workloads
  • Mastery of NVIDIA Nsight Systems/Compute
  • Ability to mathematically reason about performance using the Roofline Model, memory bandwidth utilization, and compute throughput

Nice to have:

  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 5+ years of experience in systems programming, HPC, or GPU software development, featuring at least 5 years of hands-on CUDA/C++ kernel development
  • Working knowledge of state-of-the-art inference/training stacks: sglang, vLLM, TensorRT-LLM, DeepSpeed, or Megatron-LM
  • Deep understanding of optimization patterns: PagedAttention, RadixAttention (Prefix Caching), continuous batching, and speculative decoding
  • Practical experience with CUTLASS, CuTe, or OpenAI Triton
  • Expertise in high-performance linear algebra (GEMM) optimization, including tiling strategies, data layouts, and mixed-precision accumulation
  • Proficiency in multi-GPU/multi-node scaling using NCCL and parallelism strategies (Tensor, Pipeline, and Sequence parallelism)
  • An AI-native mindset: Expert at using vibe coding tools to bypass boilerplate and accelerate the development lifecycle
  • The technical intuition to architect systems rapidly, moving from 'vibe' to 'highly-optimized production code' with extreme velocity

Additional Information:

Job Posted:
February 17, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior GPU Engineer

Senior Software Engineer - Rendering

Develop custom GPU technologies for the fastest 64-bit painting engine on iOS. Y...
Location
Location
Australia , Hobart; Sydney
Salary
Salary:
Not provided
procreate.com Logo
Procreate
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Firm understanding of systems programming in C/C++
  • Experience with CPU/GPU optimization and debugging
  • Experience with a modern graphics API (Metal/Vulkan/DirectX 11)
  • Deep understanding of modern rendering pipelines and hardware in a realtime domain
  • Competence in linear algebra and trigonometry
  • Experience writing vertex/fragment and compute shaders
  • Excellent verbal and written communication skills
Job Responsibility
Job Responsibility
  • Work on our next generation virtual texturing based rendering and painting engine
  • Solve complex and unique technical problems in a realtime and memory constrained environment
  • Develop next generation software that pushes the hardware to its limits
  • Work with our product designers, product owner and engineers to deliver industry-leading graphics experiences and functionality
What we offer
What we offer
  • Visa Support
  • Fulltime
Read More
Arrow Right

Senior Physical Design Engineer

We are looking for an adaptive, self-motivative physical design engineer to join...
Location
Location
Malaysia , Penang
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MSEE with 2+ years or Bachelor with 5+ years of industrial experience in ASIC design
  • Familiar with Back-End (physical design) EDA tools
  • Hands on experience in large scale ASIC chip physical design
  • Knowledgeable in all aspects of deep submicron ASIC design flow
  • Successfully gone through several complete product development cycles
  • Demonstrate strong problem-solving and work well with cross-functional teams
  • Good listening, writing and speaking English
  • Good communication skills, strong interpersonal skills and flexibility
  • Dedicated, hardworking and good team player
  • Familiar with Unix/Linux environment and good at scripts
Job Responsibility
Job Responsibility
  • Drive and improve AMD's abilities to deliver the MI/Navi series of GPU products to market
Read More
Arrow Right

Senior Systems Software Engineer

As Senior Systems Software Engineer for Vision, you will design, develop, and op...
Location
Location
United States , Palo Alto
Salary
Salary:
137861.00 - 240000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience with Linux and C++ programming
  • Proven experience building large software applications in relevant domains
  • Proficiency in managing memory and hardware resources in C++ and/or Rust
  • Strong skills using C‑APIs to access low‑level system resources (e.g. V4L, POSIX, OpenGL, CUDA)
  • Deep understanding of OS primitives and experience applying them in real‑time systems
  • Demonstrated ability to optimize for low latency and high throughput on modern CPUs
  • Experience working with hardware accelerators through low‑level C‑APIs
  • Background in systems where latency is critical (such as robotics, real‑time control, gaming engines, or similar)
Job Responsibility
Job Responsibility
  • Create and implement novel techniques to minimize system latency and computational load
  • Utilize GPU acceleration for image processing and inference tasks, ensuring high performance and responsiveness in real‑time applications
  • Develop low‑latency vision systems by integrating cameras, sensors, and image‑processing algorithms to enhance robotic perception
  • Build and optimize real‑time streaming solutions, with efficient compression, decompression, and data transmission under limited compute resources
  • Collaborate closely with cross‑functional teams to integrate vision systems into the overall robot architecture, contributing to teleoperation and autonomy stacks
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer (GCP)

Our client is a global UK-based financial services and investment banking organi...
Location
Location
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Cloud Engineering, or SRE roles
  • Strong hands-on experience with Google Cloud Platform, including: GKE / Kubernetes, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage, VPC, IAM, networking, security
  • Expertise in Terraform, Helm, or other IaC tools
  • Experience building CI/CD pipelines (GitHub Actions, GitLab CI, CircleCI, Jenkins, etc.)
  • Strong understanding of containerization and orchestration: Docker, Kubernetes
  • Solid experience with monitoring, observability, and logging stacks
  • Familiarity with networking, load balancing, security hardening, and zero-trust principles
  • Experience supporting production systems in high-availability, distributed environments
  • Strong scripting skills (Python, Bash, or similar)
  • Experience working with agile engineering teams
Job Responsibility
Job Responsibility
  • Design, implement, and maintain cloud infrastructure on Google Cloud (GKE, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage)
  • Build and optimize CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
  • Develop infrastructure-as-code using Terraform or similar tools
  • Set up and maintain container orchestration (Kubernetes, GKE) and automated deployment workflows
  • Implement monitoring, alerting, and observability using tools such as Prometheus, Grafana, ELK/Elastic, Stackdriver, or OpenTelemetry
  • Ensure compliance with security and governance standards across all environments
  • Collaborate closely with engineering teams to ensure scalable, high-performance deployment architectures
  • Support AI/ML and GenAI workloads (Vertex AI pipelines, model hosting, GPU workloads, inference optimization)
  • Manage environment strategies, release pipelines, configuration management, and secrets management
  • Optimize cloud costs and recommend improvements for performance and reliability
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right

Senior Research Engineer

We are seeking a highly skilled Senior Research Engineer to collaborate closely ...
Location
Location
United States
Salary
Salary:
210000.00 - 309000.00 USD / Year
assembly.ai Logo
Assembly
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong expertise in the Python ecosystem and major ML frameworks (PyTorch, JAX)
  • Experience with lower-level programming (C++ or Rust preferred)
  • Deep understanding of GPU acceleration (CUDA, profiling, kernel-level optimization)
  • TPU experience is a strong plus
  • Proven ability to accelerate deep learning workloads using compiler frameworks, graph optimizations, and parallelization strategies
  • Solid understanding of the deep learning lifecycle: model design, large-scale training, data processing pipelines, and inference deployment
  • Strong debugging, profiling, and optimization skills in large-scale distributed environments
  • Excellent communication and collaboration skills, with the ability to clearly prioritize and articulate impact-driven technical solutions
Job Responsibility
Job Responsibility
  • Investigate and mitigate performance bottlenecks in large-scale distributed training and inference systems
  • Develop and implement both low-level (operator/kernel) and high-level (system/architecture) optimization strategies
  • Translate research models and prototypes into highly optimized, production-ready inference systems
  • Explore and integrate inference compilers such as TensorRT, ONNX Runtime, AWS Neuron and Inferentia, or similar technologies
  • Design, test, and deploy scalable solutions for parallel and distributed workloads on heterogeneous hardware
  • Facilitate knowledge transfer and bidirectional support between Research and Engineering teams, ensuring alignment of priorities and solutions
What we offer
What we offer
  • competitive equity grants
  • 100% employer-paid benefits
  • flexibility of being fully remote
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Computational Geometry

We are seeking a Senior Software Engineer, Computational Geometry who will be a ...
Location
Location
United States , Chatsworth
Salary
Salary:
140000.00 - 200000.00 USD / Year
machinalabs.ai Logo
Machina Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MS or PhD. in Data Science, Computer Science, Machine Learning, Statistics, or a related field
  • 5+ years of experience in computational geometry, path planning, or related fields
  • Strong background in linear algebra, trigonometry, differential equations, and algorithm design
  • Proficiency in both Python and C++, with hands-on use of libraries such as Numpy (Python) or Eigen (C++)
  • Proven ability to manipulate meshes, vectors, and transforms in 2D and 3D
  • Experience with CAD file formats (STEP, IGES) and graphics-adjacent work (GPU acceleration, ray tracing, geometry kernels)
  • A track record of solving complex problems in fast-paced, iterative environments—independently and within teams
  • Excellent communication skills
  • able to explain technical concepts clearly to both engineers and non-engineers
Job Responsibility
Job Responsibility
  • Shape our geometry pipeline by leading the development of robust geometry ingestion and processing systems to transform complex CAD files into robot-ready parts
  • Innovate our path planning through the design and implementation of algorithms that expand the operational envelope of our robotic forming cells
  • Prototype and validate by rapidly testing new approaches while building high-quality, production-ready code for proven solutions
  • Bridge disciplines partnering with robotics and manufacturing engineers to troubleshoot, optimize, and validate geometry and path strategies on physical hardware
  • Model and simulate by building kinematic models, optimizers, and analysis tools to validate paths before they hit the robot floor
  • Uphold excellence through ensuring code quality with testing, documentation, and reviews aligned to Machina’s engineering standards
What we offer
What we offer
  • Medical, Dental, Vision
  • PTO
  • Stock Options
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

This is an exciting new role at Efficy, central to our next-generation CRM innov...
Location
Location
France , Toulouse; Paris
Salary
Salary:
Not provided
efficy.com Logo
efficy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong knowledge of the technical stack with excellent command of AI-related languages and frameworks
  • Experience architecting distributed, orchestrated AI systems (protocols, pipelines, integration into complex workflows)
  • Skills in cost-performance optimization for models (latency, GPU usage, scalability)
  • Strong analytical mindset, technical rigor, and ability to prioritize
  • Autonomy, proactivity, and ability to drive complex initiatives in a rapidly evolving environment
  • Excellent written and verbal communication skills in French and English (fluency required)
Job Responsibility
Job Responsibility
  • Define and execute the AI agents strategy within the CRM platform, ensuring strong business relevance and smooth integration into existing workflows
  • Design agent architectures, including MCP protocol, orchestration, tool calling, memory management, evaluation, contextualization, RAG, and prompt engineering
  • Oversee deployment of AI solutions in both multi-tenant SaaS and on-premise environments
  • Select, adapt, and industrialize European open-source models, optimizing cost-performance trade-offs
  • Ensure security, GDPR compliance, and explainability of all developed agents
  • Implement observability and evaluation tools, including tracing, A/B testing, and cost monitoring
  • Collaborate closely with Data Science, Product, and Engineering teams to guarantee technical and functional alignment
What we offer
What we offer
  • A stable and growing company with an entrepreneurial mindset, where your ideas are valued, and we support you in making them happen
  • High flexibility—hybrid work is part of our DNA
  • State-of-the-art offices where teamwork is the norm
  • International growth opportunities and internal mobility
  • A competitive salary package with and a referral program
  • Engaging events: team lunches, after-work gatherings, sports activities, and trips
  • Learning opportunities: languages, technology, product knowledge, sales techniques, and leadership development
Read More
Arrow Right

Senior Supplier Quality Engineer

The Senior Supplier Quality Engineer role is designed as onsite and focuses on e...
Location
Location
United States , Spring
Salary
Salary:
92600.00 - 213500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8 + years of experience in supplier quality management
  • Must have Lean, Six Sigma
  • Black belt or equivalent preferred
  • Bachelor's or Master's degree in Industrial Engineering, Manufacturing Engineering, Electrical Engineering, or equivalent
  • Extensive experience with using engineering tools and software packages to test, design, and evaluate products and processes
  • Evaluating forms of empirical and root-cause analysis, modeling, testing methodologies, and quality standards to validate product designs, specifications, and processes
  • Excellent analytical, statistical, and problem-solving skills
  • Conducting reviews of material specifications, hardware and electrical component design and processes, and industry quality standards and methodologies
  • Excellent written and verbal communication skills
  • Ability to effectively communicate quality architectures, plans, proposals, and results, and negotiate options at senior management levels
Job Responsibility
Job Responsibility
  • Ensure that products and materials from suppliers meet HPE’s quality standards and their Quality performance meets the set goals
  • Perform supplier site audits in person or virtual for new products, new sites, and new suppliers
  • Collaborate with other HPE functional teams: Sourcing, Planning, R&D Engineering, and Manufacturing Operations to evaluate supplier performance, support other functional team audits, and drive continuous improvements
  • Manage GPU, I/O Adapters, and Storage Controllers quality performance at the supplier’s manufacturing sites, at the HPE’s server integration RSPs and at the customer sites
  • Lead project teams of other quality engineers and internal and outsourced development and manufacturing partners responsible for the design and integration of quality requirements, processes, and practices into development, design, manufacturing, and continuous improvement
  • Review and evaluate product and solution designs and production activities for quality issues
  • Provide feedback based on results and root-cause analysis
  • Design and implement solutions to improve product quality and mitigate risk of failures
  • Drive innovation and integration of new technologies and quality initiatives
  • Provide guidance and mentoring to less-experienced staff members
What we offer
What we offer
  • Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Programs to support career goals
  • Flexibility to manage work and personal needs
  • Unconditional inclusion
Read More
Arrow Right