Logo-of-Bespoke-Labs-hiring-for-jobs-in-US-on-GrabJobs

Member of Technical Staff: RL Environments

icon building Company : Bespoke Labs
icon briefcase Job Type : Full Time
icon remote-alt Remote / Work from Home

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
icon loader

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Member of Technical Staff: RL Environments

About Bespoke Labs

Bespoke Labs is an applied AI research lab pioneering data curation and RL environment curation for the modern agentic world. We curated Open Thoughts, one of the best open reasoning datasets used by multiple frontier labs, trained SOTA specialized models such as Bespoke-MiniChart-7B and Bespoke-MiniCheck, and taught agents to do multi-turn tool-calling with reinforcement learning.

Bespoke is uniquely positioned to capture a large market share of data and RL environment curation.

About the Role

As a member of our technical staff, you will work on designing RL environment curation strategies. This involves coming up with recipes and strategies of creating RL environments. Ideal candidates are problem solvers who can understand the problem in a scientific way and can solve the problem practically.

What you will do

  1. Build our curation platforms for building/collecting/curating RL environments and data curation.

  2. Do research on cutting edge curation strategies, especially for RL environments.

  3. Come up with data and environment recipes, and work with contractors to create RL environments.

  4. Verify whether environments are high quality, by checking for reward hacking, and training small scale agents.

  5. Do data analysis to uncover insights about the environments.

Who you are

  1. PhD/MS in ML, and/or industry experience

  2. Proficiency in languages like Python and experience with cloud platforms (GCP, AWS, etc.).

  3. Ability to design systems that scale to handle large volumes of data and complex workflows.

  4. Have extreme patience reading transcripts of rollouts.

  5. A self-starter who is excited about working on hard technical problems in AI and data-centric platforms.

  6. Have experience designing robust CI/CD pipelines, automated testing, observability, and monitoring.

  7. Passionate about data curation, AI, RL environments, and post-training.

Original job Member of Technical Staff: RL Environments posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Share Job
Share Job

Auto-Apply to Member of Technical Staff: RL Environments Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Member of Technical Staff: RL Environments Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.