Full-time

ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

Posted by Amazon • toronto, on, Canada

📍 toronto, on 🕒 May 26, 2026

Apply for this Job Similar Jobs

About the Role

                    ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and generative AI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team focuses on maximizing performance for AWS’s custom ML accelerators by crafting high-performance kernels for ML functions at the hardware-software boundary.

Key Responsibilities

Design and implement high-performance compute kernels for ML operations, leveraging the Neuron architecture and programming models.

Analyze and optimize kernel-level performance across multiple generations of Neuron hardware.

Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks.

Implement compiler optimizations such as fusion, sharding, tiling, and scheduling.

Work directly with customers t...

Job Details

Location toronto, on
Job Type Full-time
Category Other-General
Posted May 26, 2026
Deadline July 05, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with Amazon.

Apply Now