Full-time

Engineering Lead – On-Prem LLM Inference

Posted by Cerebras Systems • Toronto, ON, Canada

📍 Toronto, ON 🕒 March 04, 2026

Apply for this Job Similar Jobs

About the Role

                    A cutting-edge AI technology company in Toronto is seeking a hands-on technical engineering leader to oversee their Inference Service Platform. Your role will involve leading a team to scale LLM inference on advanced compute clusters, ensuring high availability and performance. Candidates should have significant experience in distributed systems and managing ML frameworks, with a focus on creating enterprise-ready solutions. Join a forward-thinking team dedicated to innovation and excellence in AI development.
#J-18808-Ljbffr
                

Job Details

Location Toronto, ON
Job Type Full-time
Category Other-General
Posted March 04, 2026
Deadline April 13, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with Cerebras Systems.

Apply Now