Full-time

Senior LLM Inference & Model Optimization Engineer

Posted by Red Hat • Toronto, ON, Canada

📍 Toronto, ON 🕒 February 19, 2026

About the Role

A leading open-source software company seeks a Machine Learning Engineer in Toronto, Canada. You will focus on model optimization algorithms, working closely with product and research teams. Responsibilities include designing and implementing model compression pipelines and optimizing LLM performance. Ideal candidates should have a strong background in machine learning, programming skills in Python, and familiarity with LLM Inference Optimizations. This position offers a collaborative environment fostering continuous learning and innovation.
#J-18808-Ljbffr

Ready to Apply?

Submit your application today and take the next step in your career journey with Red Hat.

Apply Now