Full-time

Engineering Lead – On-Prem LLM Inference

Posted by Cerebras Systems • Toronto, ON, Canada

📍 Toronto, ON 🕒 March 04, 2026

About the Role

A cutting-edge AI technology company in Toronto is seeking a hands-on technical engineering leader to oversee their Inference Service Platform. Your role will involve leading a team to scale LLM inference on advanced compute clusters, ensuring high availability and performance. Candidates should have significant experience in distributed systems and managing ML frameworks, with a focus on creating enterprise-ready solutions. Join a forward-thinking team dedicated to innovation and excellence in AI development.
#J-18808-Ljbffr

Ready to Apply?

Submit your application today and take the next step in your career journey with Cerebras Systems.

Apply Now