This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Machine Learning models are at the core of Waymo's fully autonomous driving technology. Our models allow the Waymo Driver to perceive the world around it, make the right decision for every situation, and deliver people safely to their destinations. We think deeply and solve complex technical challenges in areas like perception, planning and control while collaborating with hardware and systems engineers. Waymo interns work with leaders in the industry on projects that deliver significant impact to the company.
Job Responsibility:
Design and implement CUDA C++ kernels for tasks such as graph processing, tree traversal, and raw sensor data preparation
Conduct performance profiling of existing critical path workloads to identify bottlenecks and propose improvements
Prototype and evaluate novel approaches for accelerating existing CPU workloads to GPU
Requirements:
Currently enrolled in a Master's or PhD program focused on High-Performance Computing, GPU Architecture, Systems, or a related field
Strong understanding of CUDA C++ and parallel algorithm design
Experience with performance with CPU/GPU profiling tools like Nsight Compute, pprof, and Perfetto