Full-time

Inference Performance Engineer: AI Speed & Scale

Posted by Cerebras Systems • Toronto, ON, Canada

📍 Toronto, ON 🕒 February 17, 2026

About the Role

A leading AI hardware provider in Toronto seeks an engineer for the inference performance team. Candidates will work at the intersection of hardware and software, enhancing model inference speed. The role demands a strong background in computer architecture and requires a degree in Electrical Engineering or Computer Science. Ideal applicants should have at least 3 years of experience in relevant domains, including CPU/GPU performance and kernel optimization, along with proficiency in C++ and Python.
#J-18808-Ljbffr

Ready to Apply?

Submit your application today and take the next step in your career journey with Cerebras Systems.

Apply Now