The Epoch-Greedy algorithm for contextual multi-armed bandits 论文

2007引用 330
Advanced Bandit Algorithms ResearchMachine Learning and AlgorithmsReinforcement Learning in Robotics

The Epoch-Greedy algorithm for contextual multi-armed bandits · 相关技术