Noisy Networks For Exploration 论文

2018arXiv (Cornell University)引用 274

Reinforcement Learning in RoboticsAdversarial Robustness in Machine LearningExplainable Artificial Intelligence (XAI)

机器人 Reinforcement Learning in Robotics Adversarial Robustness in Machine Learning Explainable Artificial Intelligence (XAI)

关系图谱

作者

摘要

We introduce NoisyNet, a deep reinforcement learning agent with parametric noise added to its weights, and show that the induced stochasticity of the agent's policy can be used to aid efficient exploration. The parameters of the noise are learned with gradient descent along with the remaining network weights. NoisyNet is straightforward to implement and adds little computational overhead. We find that replacing the conventional exploration heuristics for A3C, DQN and dueling agents (entropy reward and $\epsilon$-greedy respectively) with NoisyNet yields substantially higher scores for a wide range of Atari games, in some cases advancing the agent from sub to super-human performance.

作者查看全部 (12)

Demis Hassabis

Shane Legg

Charles Blundell

Olivier Pietquin

Noisy Networks For Exploration 论文

摘要

作者查看全部 (12)

相关技术查看全部 (2)

相关事件

相关文章