Low Level HPC Developer Job at Randstad (Kuala Lumpur)

Platform Specialist (Quant)

My client is a top-ranked, technology-driven trading firm that pushes tech to th...

Location

Singapore , Singapore

Salary:

220000.00 USD / Year

Hunter Bond

Expiration Date

Until further notice

Requirements

Linux Expertise: 2+ Years of professional experience working deeply within Linux environments
Observability: Hands-on experience with monitoring and observability stacks like Prometheus and Grafana
Development: Strong scripting and development ability, specifically in Python
Configuration Management: Experience managing large-scale environments using configuration management tools
Specialization: Practical experience with HPC (High-Performance Computing) environments

Job Responsibility

Infrastructure Design: Designing and researching the next generation of low-latency and High-Performance Computing (HPC) infrastructure
Systems Support: Providing high-tier support for the firm's mission-critical trading platforms
Technical Research: Investigating low-level system performance bottlenecks and hardware/software optimizations
Project Leadership: Assisting on high-end infrastructure projects with a clear path to leading your own technical workstreams

Fulltime

Senior Software Systems Designer

We are looking for a highly skilled Senior System Software Designer to design an...

Location

India , Bangalore

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Hands-on experience with performance profiling tools (e.g., AMD uProf, perf, VTune, rocProfiler)
Strong understanding of microarchitecture concepts (pipelines, caches, branch prediction, memory hierarchy)
Experience working with hardware performance counters (PMC), IBS, or similar sampling techniques
Familiarity with OS internals (Linux kernel, schedulers, memory management, tracing frameworks)
Experience with distributed/HPC workloads (MPI, OpenMP, large-scale systems)
Exposure to trace analysis, call stacks, and sampling-based profiling models
Knowledge of container environments and system-level debugging is a plus
Experience contributing to cross-platform tools and frameworks
Bachelors or master's degree in electrical or computer engineering.

Job Responsibility

Design and develop system-level profiling tools spanning CPU, memory, IO, and power analysis
Build and optimize data collection frameworks leveraging hardware counters (PMC), IBS, and OS tracing
Develop low-overhead profiling infrastructure for large-scale and long-running workloads
Enhance performance analysis pipelines including data processing, correlation, and visualization
Enable cross-platform profiling support across Linux, Windows, and emerging OS ecosystems (e.g., FreeBSD)
Work on advanced analysis techniques such as top-down microarchitecture analysis, pipeline utilization, and bottleneck detection
Contribute to CLI and GUI-based tools for performance debugging and visualization
Integrate support for runtime and framework-level tracing (OpenMP, MPI, Java, Python, etc.)
Collaborate with CPU, GPU, kernel, and compiler teams to enable new hardware features in profiling tools
Drive automation and intelligent analysis, including AI/ML-assisted performance insights

Fulltime

Senior HPC Software Engineer (C++)

The role focuses on the high- and detailed-level design of scientific processing...

Location

Malaysia , Kuala Lumpur

Salary:

Not provided

Randstad

Expiration Date

July 04, 2026

Requirements

Expert-level software development skills in C or C++
Deep knowledge of low-level optimization, including threading, concurrency, and loop unrolling
A history of advanced work in highly-parallel computing, numerical processing, or large I/O
Excellent written and spoken technical English for client interaction

Job Responsibility

High- and detailed-level design of scientific processing software
Implementation, testing, and optimization using C, C++, Python, and Java
Making heavy use of CPUs and GPUs to solve scientific problems
Making critical decisions on when to hand-vectorize code for performance gains
Acting as 3rd-level technical support for unresolvable customer issues

Fulltime

Senior Software Development Engineer

We are seeking an experienced and highly technical SMTS Software Development Eng...

Location

United Kingdom

Salary:

Not provided

AMD

Expiration Date

Until further notice

Requirements

Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or related technical field
8+ years of software engineering experience in systems software, runtime libraries, GPU programming, or compiler/runtime interfaces
Strong proficiency in modern C++ (C++14/C++17 or newer), templates, memory models, and low‑level systems programming
Deep understanding of at least one GPU computing model (HIP, CUDA, SYCL, OpenCL, OpenMP offload)
Hands‑on experience with runtime systems, driver interfaces, or high‑performance compute libraries
Strong debugging skills using tools such as gdb, sanitizers, profilers, and GPU debugging tools
Solid understanding of parallel programming concepts—memory hierarchy, synchronization, concurrency, thread scheduling

Job Responsibility

Architect, implement, and optimize features in the HIP runtime, including memory management, kernel dispatch, device abstraction, multi‑GPU coordination, and synchronization primitives
Contribute to the evolution of the HIP programming model and interoperability with ROCr, HSA runtime, and compiler toolchains
Ensure functional correctness, performance, and scalability of runtime APIs across different GPU generations
Conduct root‑cause analysis and systems‑level debugging across the runtime, driver, compiler, and hardware layers
Profile GPU applications and internal runtime components to identify bottlenecks and design performance improvements
Optimize HIP runtime behavior for large-scale AI, HPC, and cloud workloads
Work closely with compiler teams (LLVM/Clang), driver teams, GPU architecture, and systems engineers to deliver end‑to‑end GPU software solutions
Contribute to API specifications and collaborate with upstream open-source communities where appropriate
Define and drive technical strategy for correctness, reliability, and conformance of the HIP runtime
Support enhancements in automated testing, CI, and stress/failure scenarios in the HIP test suite

Firmware Engineering Manager - HPC/AI

Develop and maintain embedded software for HPC platforms, including low-level dr...

Location

United States

Salary:

137000.00 - 315000.00 USD / Year

Hewlett Packard Enterprise

Expiration Date

Until further notice

Requirements

First level university degree or equivalent experience required
May have advanced university degree
Typically 5 or more years of related work experience, including 0 -2 years of people management experience
Strong leadership skills, including coaching, team-building, and conflict resolution
Advanced project management skills including time and risk management, resource prioritization, and project structuring
Strong analytical and problem solving skills
Ability to manage human capital across geographies to drive workforce development and achieve desired results
Strong verbal and written communication skills, including negotiation, presentation, and influence skills
Advanced business acumen, technical knowledge, and extensive knowledge in applications and technologies
Strong multi-tasking and prioritization skills

Job Responsibility

Provides direct and ongoing leadership for a team of individual contributors designing and developing new products, enhancements and updates. and coordinating projects for systems software, including low-level drivers, hardware interface layers, monitoring agents, networking stacks, and development/test tooling
Manages headcount, deliverables, schedules, and costs for multiple ongoing projects, ensuring that resources are appropriately allocated and that goals, objectives, timelines, and budgets are met in accordance with program and organizational roadmaps
Communicates project status and escalates issues to direct managers, program managers, and internal and external development partners
Manages relationships with outsourced partners and suppliers, including setting expectations regarding deliverables, product quality, schedules, and costs
ensures that team members are effectively communicating and collaborating with outsourced resources
Proactively identifies opportunities for process improvement and cost reductions opportunities
Provides people-care management for assigned team members, including hiring, setting and monitoring of annual performance plans, coaching, and career development
ensures that proper knowledge and career development tools are in place to support ongoing team member and process development

What we offer

Health & Wellbeing
Personal & Professional Development
Unconditional Inclusion

Fulltime

Platform Architect

As a Platform Architect, you will lead the definition and realization of our AI ...

Location

United States , San Jose

Salary:

150000.00 - 275000.00 USD / Year

Etched

Expiration Date

Until further notice

Requirements

8+ years of experience in system or server hardware architecture, ideally in HPC, AI infrastructure, or hyperscale data centers
Deep understanding of PCIe protocols and topologies, including bifurcation, retimer tuning, switch fabrics, and accelerator communication
Experience with rack-level and multi-rack system design, including shared power and networking infrastructure
Strong expertise in BMC systems, control buses, telemetry integration, and orchestration tooling
Familiarity with modern high-speed networking technologies: 400G Ethernet, InfiniBand, CXL fabrics, and NIC-switch integration
Proven background in power architecture for dense compute systems, including power budgeting, sequencing logic, and VRM optimization
Rack-level management infrastructure design experience, including CDU layout, telemetry aggregation, and rack controller implementation
Proven track record of building infrastructure for at-scale deployment, such as automated diagnostics, health monitoring, and fleet orchestration frameworks
Understanding of thermal design principles such as airflow, heatsink selection, and liquid cooling systems
A systems-level perspective with the ability to design scalable, maintainable, and high-performance platforms

Job Responsibility

Architect the end-to-end hardware system stack, including server-level components, rack-scale systems, and multi-rack POD designs optimized for AI and high-performance workloads
Design and implement advanced PCIe Gen5/Gen6 topologies: root complex architecture, retimer placement, switch hierarchy, and accelerator fan-out strategies
Define scalable BMC architecture and platform management features across fleet deployments, including telemetry pipelines, orchestration hooks, and API integrations (e.g., Redfish, IPMI)
Specify and lead the implementation of chip-to-chip interconnects such as NVLink, UCIe, and other emerging high-bandwidth, low-latency fabrics
Develop integration strategies for power distribution, control planes, cooling systems (air and liquid), and shared interconnect fabrics at the rack level
Own the networking architecture across servers and racks, including 400G/800G Ethernet, leaf-spine switching, NIC-to-ToR planning, and cross-rack topology
Specify power delivery systems for high-density, multi-kilowatt platforms: VRM selection, power trees, sequencing, and protection logic
Guide system design decisions with awareness of mechanical and thermal constraints to ensure performance, manufacturability, and serviceability
Contribute to rack-level management infrastructure: CDU planning, telemetry aggregation, rack controller architecture, and out-of-band control
Support bring-up and validation teams in debugging complex issues at the system, rack, and POD levels

What we offer

Medical, dental, and vision packages with generous premium coverage
$500 per month credit for waiving medical benefits
Housing subsidy of $2k per month for those living within walking distance of the office
Relocation support for those moving to San Jose (Santana Row)
Various wellness benefits covering fitness, mental health, and more
Daily lunch + dinner in our office

Fulltime

Infrastructure Intern

Our infrastructure role drives the development and adoption of next-generation i...

Location

United States , San Jose

Salary:

Not provided

Etched

Expiration Date

Until further notice

Requirements

Progress towards a Bachelor’s, Master’s, or PhD degree in Computer Science, Engineering, or a related technical field
Proficiency in C/C++ or Rust
Proficiency in Python (for interns interested in systems software)
Strong fundamentals in data structures and algorithms
Strong understanding of low-level software engineering
Strong understanding of hardware/software co-design
Excellent communication and collaboration skills

Job Responsibility

Drives the development and adoption of next-generation infrastructure tooling, enabling Etched ASIC, Software, and Platform engineers to iterate faster, build more reliably, and push the boundaries of AI performance
Working on our hybrid-cloud high-performance compute (HPC) clusters, massively parallel CI executions, Infrastructure-as-code, scale-out observability platform with LLM integration, and building high-quality tools that engineers love using

What we offer

12-week paid internship (June - August 2026)
Generous housing support for those relocating
Daily lunch and dinner in our office
Direct mentorship from industry leaders and world-class engineers
Opportunity to work on one of the most important problems of our time

Performance Engineer - Inference

Engineers on the inference performance team operate at the intersection of hardw...

Location

Canada , Toronto

Salary:

Not provided

Cerebras Systems

Expiration Date

Until further notice

Requirements

Bachelors / Masters / PhD in Electrical Engineering or Computer Science
Strong background in computer architecture
Exposure to and understanding of low-level deep learning / LLM math
Strong analytical and problem-solving mindset
3+ years of experience in a relevant domain (Computer Architecture, CPU/GPU Performance, Kernel Optimization, HPC)
Experience working on CPU/GPU simulators
Exposure to performance profiling and debug on any system pipeline
Comfort with C++ and Python

Job Responsibility

Build performance models (kernel-level, end-to-end) to estimate the performance of state of the art and customer ML models
Optimize and debug our kernel micro code and compiler algorithms to elevate ML model inference speed, throughput and compute utilization on the Cerebras WSE
Debug and understand runtime performance on the system and cluster
Develop tools and infrastructure to help visualize performance data collected from the Wafer Scale Engine and our compute cluster

What we offer

Build a breakthrough AI platform beyond the constraints of the GPU
Publish and open source their cutting-edge AI research
Work on one of the fastest AI supercomputers in the world
Enjoy job stability with startup vitality
Our simple, non-corporate work culture that respects individual beliefs

Select Country

Low Level HPC Developer

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?