About the Role
We are building the next generation of GPU‑accelerated recommendation tools, redefining how models are trained and deployed at scale. Our mission is to make developing and productizing GPU‑based recommender systems as seamless, efficient, and powerful as possible. As part of this effort, you will join a world‑class team of ML, HPC, and Software Engineers focused on maximizing training and inference speed while enabling effortless scalability.
What You’ll Be Doing:
+ Profile, analyze, and optimize GPU‑accelerated code to improve training and inference performance for large‑scale recommender systems.
+ Design, implement, and maintain high‑performance C++/CUDA components within our core recommendation framework.
+ Develop and execute tests (unit, integration, and performance) to ensure numerical correctness, stability, and regression prevention in GPU workloads.
+ Collaborate closely with CUDA and ML engineers to interpret profiling results, refine designs, and ...
What You’ll Be Doing:
+ Profile, analyze, and optimize GPU‑accelerated code to improve training and inference performance for large‑scale recommender systems.
+ Design, implement, and maintain high‑performance C++/CUDA components within our core recommendation framework.
+ Develop and execute tests (unit, integration, and performance) to ensure numerical correctness, stability, and regression prevention in GPU workloads.
+ Collaborate closely with CUDA and ML engineers to interpret profiling results, refine designs, and ...
Ready to Apply?
Submit your application today and take the next step in your career journey with NVIDIA.
Apply Now