Planning in the Presence of Cost Functions Controlled by an Adversary 论文

2018Research Showcase @ Carnegie Mellon University (Carnegie Mellon University)引用 225

Reinforcement Learning in RoboticsOptimization and Search ProblemsGame Theory and Applications

机器人 Reinforcement Learning in Robotics Game Theory and Applications Optimization and Search Problems

作者

摘要

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a robot path planning problem where costs are influenced by sensors that an adversary places in the environment. We formulate the problem as a zero-sum matrix game where rows correspond to deterministic policies for the planning player and columns correspond to cost vectors the adversary can select. For a fixed cost vector, fast algorithms (such as value iteration) are available for solving MDPs. We develop efficient algorithms for matrix games where such best response oracles exist. We show that for our path planning problem these algorithms are at least an order of magnitude faster than direct solution of the linear programming formulation.

作者查看全部 (3)

阿

阿夫里姆·布魯姆

Geoffrey J. Gordon

H. Brendan McMahan

Planning in the Presence of Cost Functions Controlled by an Adversary 论文

详细信息

摘要

作者查看全部 (3)

相关技术查看全部 (2)

相关事件

相关文章