Evolved Policy Gradients 文章

OpenAI Blog2018-04-18BLOGen

详细信息

来源站点
OpenAI Blog
文章类型
BLOG
语言
en
发布日期
2018-04-18

摘要

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate to an object on a different side of the room from where it was placed during training.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据