RL Training Environment Engineer Intern – Reinforcement Learning Environments

Software Development

San Francisco, CA

Posted 2 months ago

Internship

Apply Now

Are you applying to the internship?

Job Description

Machine Learning Engineer, RL Environments – Internship | Preference Model

The Tone:
This is an internship at Preference Model, located in San Francisco, CA (remote considered). Preference Model is building automated ML research engineering, focusing on creating high-quality RL training environments for large language models to address the brittleness of existing frontier models. This role is crucial for developing the complex, real-world RL environments needed to advance AI closer to achieving its transformative potential, in collaboration with leading AI labs.

The TL;DR
• Role: Internship
• Type: Temporary
• Location: In-person, San Francisco, CA

• Mission: This person will design, implement, and evaluate novel RL training environments for large language models.
• Tech Stack: Python, Docker, CUDA kernels, low-level GPU programming

What You’ll Actually Do
• Design: Design and build RL environments that test LLM reasoning on ML, systems, and research problems.
• Code: Write clean, production-grade Python code, not just research prototypes.
• Operate: Work with Docker, build reproducible environments, and debug when things break.
• Translate: Translate ML papers and concepts into concrete training tasks.
• Evaluate: Conduct experiments and evaluations, delivering your work into production training runs.

The Must-Haves
• Background: Student (undergrad or PhD) in Computer Science, Machine Learning, Math, Physics, or a related field.
• Experience: Ability to write real code, not just research prototypes; familiarity with how LLMs work, what they’re good at, and where they fall short; ability to work independently, take feedback, and iterate fast.
• Skills: Strong Python skills.
• Bonus: Understanding of transformer internals and experience with training or inference code; experience writing CUDA kernels or working with low-level GPU programming; deep knowledge in a research area (evidenced by publications, public code, or strong coursework); broad understanding across ML subfields and ability to connect ideas; experience building interactive environments, simulations, or complex software systems.

Software Engineer Intern – Low-Latency Trading Systems Internship

Tower Research Capital

Posted 2 weeks ago New York, NY Software Development $3.5K - $5.7K / week

View

Engineering Internship – AI Engineering Internship

Notion

Posted 3 weeks ago San Francisco, CA Software Development $57 - $61 / hour

View

Full Stack Engineering Intern – Mobile App Development Full TimeInternship

CloutCred

Posted 3 weeks ago Remote Software Development

View

Machine Learning Engineer Full TimeInternship

Tesla

Posted 3 weeks ago Fremont, CA Software Development $40 - $56 / hour

View

Date Posted

2 months ago
Location

San Francisco, CA
Expiration date

--
Gender

Neutral
Qualification

Doctorate Degree
Career Level

Student

AI Resume Builder

LinkedIn Optimizer

AI Cover Letter Trending

AI Mock Interview Trending

EzApply Chrome Extension

AI Pitch Generator New

RL Training Environment Engineer Intern – Reinforcement Learning Environments

Are you applying to the internship?

Job Description

Related Jobs

Products

For Candidates

Company

Welcome to Internexxus

Reset Password

Welcome to Internexxus

RL Training Environment Engineer Intern – Reinforcement Learning Environments

Are you applying to the internship?

Job Description

Share this post

Related Jobs

Login to Internexxus

Reset Password

Create a free Internexxus account

Products

For Candidates

Company