Plan online, learn offline: Efficient learning and exploration via model-based control 文章

OpenAI Blog2018-11-05BLOGen