M

Member of Technical Staff, Post-Training, RL Infra

salary Salary :

$350,000 - 500,000 yearly

icon building Company : Mirendil
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Member of Technical Staff, Post-Training, RL Infra

Mirendil

Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration across science and technology. Our first goal is to democratize frontier AI R&D across scientific disciplines. We believe accelerating scientific discovery is one of the most powerful ways to improve the future of humanity, and that AI will play a central role in making that possible.

We are building a frontier AI research company and training our own models end-to-end. Our work spans areas such as model training, reinforcement learning, reasoning systems, and infrastructure for large-scale experiments. Our team includes researchers and engineers from Anthropic, Google DeepMind, xAI, OpenAI, Microsoft, Apple, and MIT.

The Role

We are looking for engineers to help build the post-training stack for frontier reasoning models. This role sits at the intersection of research and infrastructure. You will work to push the scale of our RL stack, whether it is novel recipe ideas, reliability, or performance. Some example areas you might work on (not limited to):

  • Design and build reliable infrastructure for large-scale RL training

  • Implement novel performance optimizations across the training stack

  • Develop evaluation and benchmarking infrastructure to measure model progress, throughput, and uptime

  • Build data collection and feedback pipelines that close the loop between human signal, reward modeling, and training

  • Collaborate with multiple teams to rapidly iterate on RL algorithms and get experiments into production training runs

If you're excited about building the infrastructure that makes frontier RL research possible at scale, we'd love to hear from you.

We offer a base salary of $350,000–$500,000 USD and a meaningful equity grant, depending on experience and background, along with competitive benefits.

Original job Member of Technical Staff, Post-Training, RL Infra posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Member of Technical Staff Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Member of Technical Staff Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.