Gathering human feedback 文章

OpenAI Blog2017-08-03BLOGen

摘要

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

相关事件查看全部 (1)

Gathering human feedback
2017-08-03OPEN_SOURCE影响: MEDIUM

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据