Logo-of-Deeproute.ai-hiring-for-jobs-in-US-on-GrabJobs

Research Scientist, Reinforcement Learning

icon building Company : Deeproute.ai
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Research Scientist, Reinforcement Learning

Description

We are building next-generation end-to-end autonomous driving systems powered by reinforcement learning.

You will work on applying RL in closed-loop, safety-critical environments, leveraging large-scale simulation and real-world driving data to improve safety, comfort, and robustness.

  • Train and deploy RL policies in closed-loop driving environments
  • Scale RL training using massively parallel simulation systems
  • Design and optimize reward functions for complex driving behaviors
  • Improve sim-to-real transfer for real-world robustness
  • Collaborate with cross-functional teams to integrate models into production systems


Requirements

Core Technical Skills

  • Proficiency in modern RL algorithms: DQN, PPO, SAC, TD3, etc.
  • Proficiency in modern RLHF algorithms: PPO, DPO, GRPO, etc.
  • Hands-on experience training reward models and finetuning LLM/VLM/VLA
  • Knowledge of distributed RL training at scale
  • Proficiency with massively parallel simulation environments
  • Knowledge of sim-to-real transfer techniques and domain randomization
  • Proficiency in Python, comfortable with C++
  • Proficiency in deep learning frameworks such as PyTorch
  • Experience with distributed training frameworks (Ray, Horovod, etc.)
  • Knowledge of model optimization (quantization, pruning) and CUDA is a plus
  • Knowledge of traffic rules, driving behavior modeling

Preferred Qualifications

  • Publications in top-tier venues (ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, ICRA, IROS, etc.)
  • Open-source contributions to RL libraries or autonomous driving projects
  • Previous experience with LLM fine-tuning using RLHF
  • Knowledge of safe RL, interpretable AI, or robustness techniques
  • Familiarity with autonomous vehicle regulations and safety standards
Original job Research Scientist, Reinforcement Learning posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Research Scientist Reinforcement Learning Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Research Scientist Reinforcement Learning Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.