Logo-of-Rakuten-hiring-for-jobs-in-Singapore-on-GrabJobs

EDB-IPP Project: Advancing GPU Optimization for Large Language Models

icon building Company : Rakuten
icon briefcase Job Type : Internship

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - EDB-IPP Project: Advancing GPU Optimization for Large Language Models

Job Description:

Rakuten Asia, in partnership with the Economic Development Board (EDB) through the Industrial Postgraduate Programme (IPP), is seeking new PhD students. We are looking for individuals with a robust understanding of deep learning, machine learning, and natural language processing to contribute to our innovative research projects.

Essential requirements include proven hands-on expertise and strong engineering skillsets, specifically in the development and training of PyTorch models.

IPP Programme Benefits
Candidates successfully selected for this programme will receive full sponsorship for their postgraduate studies and will be hired by Rakuten Asia upon successful completion.

Collaboration Model

The collaboration will include joint PhD student supervision, shared access to computational resources for large-scale model compression experiments, and regular research exchanges. Output will include high-impact publications, open-source tools, and demonstrable prototypes of efficient AI.

Project Outline

Introduction

Rakuten is committed to advancing the frontier of AI infrastructure, with a strong focus on optimizing large-scale GPU clusters for training and serving Large Language Models (LLMs). As models grow in size and complexity—ranging from dense architectures to mixture-of-experts (MoE)—achieving efficiency across training, inference, and deployment has become increasingly critical. Our GPU Optimization department combines deep system expertise and significant computational assets, and we are seeking strategic collaborations with leading universities to jointly tackle these challenges.

Proposed Research Areas

We propose collaborative research in the following areas, with flexibility to refine topics based on mutual expertise:

  • Efficient Scheduling for Sparse & Dense LLMs:

Design token-aware, load-balanced scheduling algorithms for MoE and hybrid LLM workloads that reduce inter-GPU communication and optimize heterogeneous cluster utilization.

  • Efficient Inference for State Space Models

Develop high-throughput, low-latency inference techniques for state space models, leveraging their linear-time properties to outperform traditional attention mechanisms in long-context scenarios.

  • Memory-Aware Training & Serving

Explore advanced quantization, memory-efficient checkpointing, offloading strategies, and dynamic memory management techniques to support training and inference of ultra-large models.

  • Scalable Parallelism for LLMs

Investigate hybrid parallelism (data, model, pipeline, expert) and communication-reduction strategies tailored for scaling LLMs across thousands of GPUs.

  • Hardware-Aware Optimization

Develop compiler, kernel, and data layout optimizations that fully exploit features of modern GPU architectures, improving throughput for both dense and sparse model operations.

  • High-Throughput, Low-Latency Inference

Create optimized model serving strategies using speculative decoding, continuous batching, expert routing, and adaptive computation for production-grade LLM applications.

Rakuten is an equal opportunities employer and welcomes applications regardless of sex, marital status, ethnic origin, sexual orientation, religious belief or age.
Original job EDB-IPP Project: Advancing GPU Optimization for Large Language Models posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

About the Company

Rakuten

楽天市場はインターネット通販が楽しめる総合ショッピングモール。楽天ポイントがどんどん貯まる!使える!毎日お得なクーポンも。食品から家電、ファッション、ベビー用品、コスメまで、充実の品揃え。

Read more about the company

Auto-Apply to Project: Advancing GPU Optimization for Large Language Models Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Project: Advancing GPU Optimization for Large Language Models Jobs in Singapore

GrabJobs is the no1 job portal in Singapore, connecting you to thousands of jobs fast! Find the best jobs in Singapore, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.