Full-time
Senior AI Systems Engineer for Model Development
Posted by AMD • markham, york region, Canada
About the Role
Drive AI model development focusing on large-scale training optimization and high-performance inference. Enhance GPU capabilities to achieve world-class computational efficiency and reliability.
This senior engineering role is designed for candidates who excel in developing AI infrastructure and optimizing GPU performance. You will manage comprehensive training environments while addressing issues that arise during distributed processing. Experience with LLMs and GPU kernel development is essential.
Key Responsibilities:
• Ensure efficient large-scale model training on GPUs
• Architect solutions for complex inference serving frameworks
• Optimize pipeline reliability and performance monitoring
• Debug training issues across GPU generations
• Collaborate with architecture teams on performance enhancements
Requirements:
• Significant experience in AI/ML technologies and infrastructure
• Proven expertise in G...
This senior engineering role is designed for candidates who excel in developing AI infrastructure and optimizing GPU performance. You will manage comprehensive training environments while addressing issues that arise during distributed processing. Experience with LLMs and GPU kernel development is essential.
Key Responsibilities:
• Ensure efficient large-scale model training on GPUs
• Architect solutions for complex inference serving frameworks
• Optimize pipeline reliability and performance monitoring
• Debug training issues across GPU generations
• Collaborate with architecture teams on performance enhancements
Requirements:
• Significant experience in AI/ML technologies and infrastructure
• Proven expertise in G...
Ready to Apply?
Submit your application today and take the next step in your career journey with AMD.
Apply Now