On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming 论文

2004Mathematics of Operations Research引用 381

Reinforcement Learning in RoboticsOptimization and Search ProblemsMachine Learning and Algorithms

机器人 Machine Learning and Algorithms Reinforcement Learning in Robotics Optimization and Search Problems

作者

摘要

In the linear programming approach to approximate dynamic programming, one tries to solve a certain linear program—the ALP—that has a relatively small number K of variables but an intractable number M of constraints. In this paper, we study a scheme that samples and imposes a subset of m≪M constraints. A natural question that arises in this context is: How must m scale with respect to K and M in order to ensure that the resulting approximation is almost as good as one given by exact solution of the ALP? We show that, given an idealized sampling distribution and appropriate constraints on the K variables, m can be chosen independently of M and need grow only as a polynomial in K. We interpret this result in a context involving controlled queueing networks.

作者查看全部 (2)

Benjamin Van Roy

Daniela Pucci de Farias

On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming 论文

摘要

作者查看全部 (2)

相关技术查看全部 (3)

相关事件

相关文章