Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

This simulated robot is being trained to do ballet via a human giving feedback. It’s not obvious how to specify a reward function to achieve the same behavior.

The release contains three main components:

The entire system consists of less than 1,000 lines of Python code (excluding the agents). After you’ve set up your web server you can launch an experiment by running:

`1$ python rl_teacher/teach.py -p human --pretrain_labels 175 -e Reacher-v1 -n human-175`

Humans can give feedback via a simple web interface (shown above), which can be run locally (not recommended) or on a separate machine. Full documentation is available on the project’sGitHub repository⁠(opens in a new window). We’re excited to see what AI researchers and engineers do with this technology—pleaseget in touch⁠with any experimental results!

Tom Brown, Dario Amodei, Paul Christiano

Scaling laws for reward model overoptimization Publication Oct 19, 2022

Introducing Whisper Release Sep 21, 2022

Learning to play Minecraft with Video PreTraining Conclusion Jun 23, 2022

Research * Research Index * Research Overview * Economic Research

Latest Advancements * GPT-5.5 * GPT-5.4 * GPT-5.3 Instant

Safety * Safety Approach * Deployment Safety(opens in a new window) * Security & Privacy * Trust & Transparency

Products * ChatGPT(opens in a new window) * ChatGPT Business(opens in a new window) * ChatGPT Enterprise(opens in a new window) * ChatGPT for Education(opens in a new window) * Codex

API Platform * Overview * API Log In(opens in a new window) * Docs(opens in a new window)

Business * Overview * Solutions * Resources * Contact Sales

Developers * Apps SDK(opens in a new window) * Open Models * Docs(opens in a new window) * Resources(opens in a new window) * Developer Forum(opens in a new window)

Company * About Us * Our Charter * Careers * News

Support * Help Center(opens in a new window)

More * Stories * Academy * Livestreams * Podcast * RSS

Terms & Policies * Terms of Use * Privacy Policy * Other Policies

(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)(opens in a new window)

English United States

Gathering human feedback

Lightning Engine Boosts Apache Spark Performance by Up to 4.9x

Trump Anticipates AI Companies Will Contribute to Public Good

US Agencies Face Three-Day Deadline for Cybersecurity Fixes Amid Rising Threats

The Indian government got cold feet on Starlink just before SpaceX’s IPO

Latest Briefs