Learning from human preferences 文章

OpenAI Blog2017-06-13BLOGen

摘要

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better.

Learning from human preferences 文章

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (5)

相关技术查看全部 (1)