Evolved Policy Gradients 文章

OpenAI Blog2018-04-18BLOGen

详细信息

来源站点: OpenAI Blog
文章类型: BLOG
语言: en
发布日期: 2018-04-18

摘要

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate to an object on a different side of the room from where it was placed during training.

Evolved Policy Gradients 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (1)