Monte Carlo POMDPs 论文

1999Neural Information Processing Systems引用 234

Reinforcement Learning in RoboticsBayesian Modeling and Causal InferenceMachine Learning and Algorithms

机器人 Machine Learning and Algorithms Reinforcement Learning in Robotics Bayesian Modeling and Causal Inference

作者

摘要

We present a Monte Carlo algorithm for learning to act in partially observable Markov decision processes (POMDPs) with real-valued state and action spaces. Our approach uses importance sampling for representing beliefs, and Monte Carlo approximation for belief propagation. A reinforcement learning algorithm, value iteration, is employed to learn value functions over belief states. Finally, a sample-based version of nearest neighbor is used to generalize across states. Initial empirical results suggest that our approach works well in practical applications.

作者查看全部 (1)

Sebastian Thrun

Monte Carlo POMDPs 论文

摘要

作者查看全部 (1)

相关技术查看全部 (3)

相关事件

相关文章