Full-time

AI Computing Development Engineer, TensorRT and TensorRT-LLM

Posted by NVIDIA • Shanghai, China, China

📍 Shanghai, China 🕒 June 17, 2026

Apply for this Job Similar Jobs

About the Role

                    NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like generative AI, computer vision, speech recognition, recommender systems, and large-scale language and multimodal models. Join the team building the inferencing software (TensorRT/TensorRT-LLM) that will be used across our product lines. The ability to work in a fast-paced, delivery-focused environment is required, and excellent interpersonal skills are a must.
  
  
What you'll be doing:
+ Design and develop robust inferencing software (TensorRT/TensorRT-LLM) optimized for functionality and performance across platforms
+ Perform performance analysis, optimization, and tuning of deep learning inference workloads
+ Track and integrate academic and industry advancements in AI and feature-update TensorRT/TensorRT-LLM accordingly
+ Provide feedback into archit...
                

Job Details

Location Shanghai, China
Job Type Full-time
Category other-general
Posted June 17, 2026
Deadline June 23, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with NVIDIA.

Apply Now