Faulty reward functions in the wild 文章

OpenAI Blog2016-12-21BLOGen

摘要

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

Faulty reward functions in the wild 文章

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (2)