Full-time

Senior Deep Learning Kernel Software Performance Architect

Posted by NVIDIA • Santa Clara, CA, United States

📍 Santa Clara, CA 🕒 March 03, 2026

About the Role

We are now looking for a Senior Kernel Performance Architect for Deep Learning Software!


NVIDIA is seeking extraordinary architects to develop processor and system architectures that accelerate machine learning, data analytics and high-performance computing applications. This position offers the chance to create a meaningful impact in a dynamic, technology-focused company.


What you will be doing:
+ Craft GPU-accelerated system architectures that push the boundaries of deep learning performance.
+ Prototype high-performance software for deep learning and data analytics workloads.
+ Analyze, visualize, and optimize software performance using analytical models, simulators, and test suites.
+ Collaborate closely across NVIDIA teams such as:
+ CUDA Compiler teams to identify performance issues.
+ AI/ML training and inference performance teams to identify and optimize critical deep learning layers.
+ hardware architecture performance team...

Ready to Apply?

Submit your application today and take the next step in your career journey with NVIDIA.

Apply Now