Nearly Tight Bounds for the Continuum-Armed Bandit Problem 论文

2004引用 300

Advanced Bandit Algorithms ResearchOptimization and Search ProblemsMachine Learning and Algorithms

人工智能 Machine Learning and Algorithms Advanced Bandit Algorithms Research Optimization and Search Problems

作者

摘要

In the multi-armed bandit problem, an online algorithm must choose from a set of strategies in a sequence of n trials so as to minimize the total cost of the chosen strategies. While nearly tight upper and lower bounds are known in the case when the strategy set is finite, much less is known when there is an infinite strategy set. Here we consider the case when the set of strategies is a subset of R d, and the cost functions are continuous. In the d = 1 case, we improve on the best-known upper and lower bounds, closing the gap to a sublogarithmic factor. We also consider the case where d&gt; 1 and the cost functions are convex, adapting a recent online convex optimization algorithm of Zinkevich to the sparser feedback model of the multi-armed bandit problem. 1

作者查看全部 (1)

Robert Kleinberg

Nearly Tight Bounds for the Continuum-Armed Bandit Problem 论文

摘要

作者查看全部 (1)

相关技术查看全部 (3)

相关事件

相关文章