Intern Engineer – RL Post-Training for LLMs
Posted by Huawei Technologies Canada Co., Ltd. • Vancouver, British Columbia, Canada
About the Role
Job description
Huawei Canada has an immediate 6-12 months internship opening for an Intern Researcher.
About the team:
The Computing Data Application Acceleration Lab aims to create a leading global data analytics platform organized into three specialized teams using innovative programming technologies. This team focuses on full-stack innovations, including software-hardware co-design and optimizing data efficiency at both the storage and runtime layers. This team also develops next-generation GPU architecture for gaming, cloud rendering, VR/AR, and Metaverse applications. One of the goals of this lab are to enhance algorithm performance and training efficiency across industries, fostering long-term competitiveness.
About the job:
Develop and optimize RL post-training pipelines for LLMs (e.g., GRPO, reward modeling).
Conduct experiments to improve model performance, reasoning, and alignment.
Ready to Apply?
Submit your application today and take the next step in your career journey with Huawei Technologies Canada Co., Ltd..
Apply Now