Agentic RL Researcher Distributed Computing

Salary :

$106,000 - 156,000 yearly

Company : Huawei Technologies Canada Co.

Job Type : Full Time

10 Aviva Way Markham, Ontario

Number of Applicants

000+

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications

Activate JobCopilot

Job Description - Agentic RL Researcher Distributed Computing

Job description

Huawei Canada has an immediate permanent opening for a Researcher.

About the team:

The Distributed Data Storage and Management Lab leads research in distributed data systems, aiming to develop next-generation cloud serverless products that encompass core infrastructure and databases. This lab addresses various data challenges, including cloud-native disaggregated databases, pay-by-query user models, and optimizing low-level data transfers via RDMA. Teams within this lab create advanced cloud serverless data infrastructure and implement cutting-edge networking technologies for Huawei's global AI infrastructure.

About the job:

Design and develop advanced Agentic Reinforcement Learning (RL) and Multi-Agent Reinforcement Learning (MARL) algorithms for cooperative, competitive, and mixed-agent environments, including CTDE, decentralized learning, and hierarchical agent systems.
Build scalable simulation and training platforms for large-scale agent systems, supporting self-play, population-based training, curriculum learning, and emergent behavior analysis.
Optimize multi-agent learning performance on distributed compute clusters, improving sample efficiency, credit assignment, agent coordination, communication learning, and training stability.
Research and prototype new approaches for multi-agent intelligence, including communication protocols, credit assignment, game-theoretic learning dynamics, meta-learning, and adaptive agent populations.
Translate cutting-edge research in agentic AI and MARL into production-ready systems for real-world or high-fidelity simulated environments.
Develop benchmarking frameworks and evaluation metrics for agent coordination, robustness, scalability, and safety.
Collaborate with research, infrastructure, and product teams to deploy scalable agentic learning systems in real-world applications.
Contribute to technical leadership and innovation through publications, patents, open-source contributions, and conference presentations.

The total target annual compensation for this position ranges from $106,000 to $156,000 depending on education, experience, and demonstrated expertise.

Job requirements

About the ideal candidate:

MS or PhD in Computer Science, Electrical Engineering, or a related field, with a focus on Reinforcement Learning, Multi-Agent Systems, Agentic AI, or Distributed AI.
Strong expertise in reinforcement learning algorithms, particularly in multi-agent settings (e.g., policy gradients, value-based methods, CTDE, credit assignment, and coordination in non-stationary environments).
Solid foundations in optimization, probability, and game theory, with the ability to design and analyze complex learning systems.
Experience building scalable RL training infrastructure, including distributed rollouts, large-scale simulation, and experiment pipelines.
Strong programming skills in Python and/or C++, with experience developing high-performance or distributed ML systems.
Demonstrated impact through research publications, open-source contributions, patents, or production ML systems in reinforcement learning, multi-agent learning, or large-scale AI systems.

Additional Information:

Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.

All applications for this position are reviewed directly by our hiring team, we do not use artificial intelligence tools to screen or select candidates.

Original job Agentic RL Researcher Distributed Computing posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Auto-Apply to Similar Jobs

Share Job

Get your Resume Reviewed for Free

Automate Job Applications for Similar Jobs

About the Company

Huawei Technologies Canada Co.

Huawei is a global leader of ICT solutions. Continuously innovating based on customer needs, we are committed to enhancing customer experiences and creating maximum value for telecom carriers, enterprises, and consumers. Our telecom network equipment, IT products and solutions, and smart devices are...

Similar Researcher Jobs in Canada

Get your Resume Reviewed for Free

Email address

Why are you reporting this job?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.

Setup your job alert:

Frequency

By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime. Skip