Full-time

Inference Performance Engineer: AI Speed & Scale

Posted by Cerebras Systems • Toronto, ON, Canada

📍 Toronto, ON 🕒 February 17, 2026

Apply for this Job Similar Jobs

About the Role

                    A leading AI hardware provider in Toronto seeks an engineer for the inference performance team. Candidates will work at the intersection of hardware and software, enhancing model inference speed. The role demands a strong background in computer architecture and requires a degree in Electrical Engineering or Computer Science. Ideal applicants should have at least 3 years of experience in relevant domains, including CPU/GPU performance and kernel optimization, along with proficiency in C++ and Python.
#J-18808-Ljbffr
                

Job Details

Location Toronto, ON
Job Type Full-time
Category Other-General
Posted February 17, 2026
Deadline March 29, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with Cerebras Systems.

Apply Now