H

Intern Engineer RL Post-Training for LLMs

salary Salary :

$58,000 - 104,000 yearly

icon briefcase Job Type : Internship

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Intern Engineer RL Post-Training for LLMs

Job description

Huawei Canada has an immediate 6-12 months internship opening for an Intern Researcher.

About the team: 

The Computing Data Application Acceleration Lab aims to create a leading global data analytics platform organized into three specialized teams using innovative programming technologies. This team focuses on full-stack innovations, including software-hardware co-design and optimizing data efficiency at both the storage and runtime layers. This team also develops next-generation GPU architecture for gaming, cloud rendering, VR/AR, and Metaverse applications. One of the goals of this lab are to enhance algorithm performance and training efficiency across industries, fostering long-term competitiveness.

About the job:

  • Develop and optimize RL post-training pipelines for LLMs (e.g., GRPO, reward modeling).

  • Conduct experiments to improve model performance, reasoning, and alignment.

  • Build scalable training, evaluation, and data generation systems.

  • Collaborate with researchers and engineers on cutting-edge LLM projects

  • Stay current with advancements in RL, LLMs, and post-training research.

The total target annual compensation (based on 2,080 hours per year) ranges from $58,000 to $104,000 depending on education, experience, and demonstrated expertise.

Job requirements

About the ideal candidate:

  • Enrolled as Master or Ph.D. student in Computer Science, AI, or related field.

  • Strong background in machine learning, reinforcement learning, and deep learning. Familiarity with Large Language Models, transformer architectures, and post-training methods.

  • Proficiency in Python, PyTorch, and LLM frameworks.

  • Hands-on experience with LLMs and RL training algorithms (e.g., GRPO) is an asset.

  • Familiarity with RL frameworks, such as VeRL.

  • Experience with open-source LLM frameworks such as Hugging Face, DeepSpeed, vLLM, or SGLang is an asset.

  • Knowledge of domain-specific languages used with AI accelerators.

  • Experience with distributed training frameworks, large-scale experimentation, or LLM infrastructure is an asset.

  • Strong problem-solving and communication skills

Additional Information:

Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.

All applications for this position are reviewed directly by our hiring team, we do not use artificial intelligence tools to screen or select candidates.

Original job Intern Engineer RL Post-Training for LLMs posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

About the Company

Huawei Technologies Canada Co.,

Huawei is a global leader of ICT solutions. Continuously innovating based on customer needs, we are committed to enhancing customer experiences and creating maximum value for telecom carriers, enterprises, and consumers. Our telecom network equipment, IT products and solutions, and smart devices are...

Read more about the company

Auto-Apply to Intern Engineer RL Post-Training for LLMs Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Intern Engineer RL Post-Training for LLMs Jobs in Canada

GrabJobs is the no1 job portal in Canada, connecting you to thousands of jobs fast! Find the best jobs in Canada, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.