This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Platform Architect, you will lead the definition and realization of our AI server platform architecture, from server board design to rack-level integration and multi-rack POD-scale system orchestration. This is a hands-on technical leadership role that requires deep expertise in PCIe and fabric topologies, power and thermal constraints, system controls, and high-speed networking. Key responsibilities will include creating advanced new platform architecture for next generation Sohu AI servers as part of a future new product development roadmap. You will collaborate cross-functionally with electrical, mechanical, thermal, firmware, and operations teams to architect systems that scale from a single server to full-rack and multi-rack POD deployments.
Job Responsibility:
Architect the end-to-end hardware system stack, including server-level components, rack-scale systems, and multi-rack POD designs optimized for AI and high-performance workloads
Design and implement advanced PCIe Gen5/Gen6 topologies: root complex architecture, retimer placement, switch hierarchy, and accelerator fan-out strategies
Define scalable BMC architecture and platform management features across fleet deployments, including telemetry pipelines, orchestration hooks, and API integrations (e.g., Redfish, IPMI)
Specify and lead the implementation of chip-to-chip interconnects such as NVLink, UCIe, and other emerging high-bandwidth, low-latency fabrics
Develop integration strategies for power distribution, control planes, cooling systems (air and liquid), and shared interconnect fabrics at the rack level
Own the networking architecture across servers and racks, including 400G/800G Ethernet, leaf-spine switching, NIC-to-ToR planning, and cross-rack topology
Specify power delivery systems for high-density, multi-kilowatt platforms: VRM selection, power trees, sequencing, and protection logic
Guide system design decisions with awareness of mechanical and thermal constraints to ensure performance, manufacturability, and serviceability
Contribute to rack-level management infrastructure: CDU planning, telemetry aggregation, rack controller architecture, and out-of-band control
Support bring-up and validation teams in debugging complex issues at the system, rack, and POD levels
Requirements:
8+ years of experience in system or server hardware architecture, ideally in HPC, AI infrastructure, or hyperscale data centers
Deep understanding of PCIe protocols and topologies, including bifurcation, retimer tuning, switch fabrics, and accelerator communication
Experience with rack-level and multi-rack system design, including shared power and networking infrastructure
Strong expertise in BMC systems, control buses, telemetry integration, and orchestration tooling
Familiarity with modern high-speed networking technologies: 400G Ethernet, InfiniBand, CXL fabrics, and NIC-switch integration
Proven background in power architecture for dense compute systems, including power budgeting, sequencing logic, and VRM optimization
Rack-level management infrastructure design experience, including CDU layout, telemetry aggregation, and rack controller implementation
Proven track record of building infrastructure for at-scale deployment, such as automated diagnostics, health monitoring, and fleet orchestration frameworks
Understanding of thermal design principles such as airflow, heatsink selection, and liquid cooling systems
A systems-level perspective with the ability to design scalable, maintainable, and high-performance platforms
Excellent communication skills and experience collaborating with hardware, firmware, validation, and mechanical engineering teams
What we offer:
Medical, dental, and vision packages with generous premium coverage
$500 per month credit for waiving medical benefits
Housing subsidy of $2k per month for those living within walking distance of the office
Relocation support for those moving to San Jose (Santana Row)
Various wellness benefits covering fitness, mental health, and more