Full-time
Senior Software Engineer, Deep Learning Inference
Posted by NVIDIA • Tel Aviv, Israel, Israel
About the Role
NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence.
We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential.
What you’ll be doing:
+ Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes
+ Optimize inference workloads using sophisticated profiling and simulation tools
+ Build SOLID, extendab...
We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential.
What you’ll be doing:
+ Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes
+ Optimize inference workloads using sophisticated profiling and simulation tools
+ Build SOLID, extendab...
Ready to Apply?
Submit your application today and take the next step in your career journey with NVIDIA.
Apply Now