PyTorch-RL 产品

来源: githubOPEN_SOURCE开源PythonMIT发布于 2017-10-17

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.