The Knowledge-Gradient Policy for Correlated Normal Beliefs 论文

2009INFORMS journal on computing引用 434

Machine Learning and AlgorithmsReservoir Engineering and Simulation MethodsGaussian Processes and Bayesian Inference

人工智能 Gaussian Processes and Bayesian Inference Machine Learning and Algorithms Reservoir Engineering and Simulation Methods

关系图谱

作者

摘要

We consider a Bayesian ranking and selection problem with independent normal rewards and a correlated multivariate normal belief on the mean values of these rewards. Because this formulation of the ranking and selection problem models dependence between alternatives' mean values, algorithms may use this dependence to perform efficiently even when the number of alternatives is very large. We propose a fully sequential sampling policy called the knowledge-gradient policy, which is provably optimal in some special cases and has bounded suboptimality in all others. We then demonstrate how this policy may be applied to efficiently maximize a continuous function on a continuous domain while constrained to a fixed number of noisy measurements.

作者查看全部 (3)

Savaş Dayanik

Warren B. Powell

Peter I. Frazier

The Knowledge-Gradient Policy for Correlated Normal Beliefs 论文

详细信息

摘要

作者查看全部 (3)

相关技术查看全部 (3)

相关事件

相关文章