Gathering human feedback 文章

OpenAI Blog2017-08-03BLOGen

详细信息

来源站点: OpenAI Blog
文章类型: BLOG
语言: en
发布日期: 2017-08-03

摘要

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

Gathering human feedback 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (1)

相关技术