The Sample Complexity of Multiclass and Sparse Contextual Bandits 事件
PRODUCT_LAUNCH2026-05-29影响: MEDIUM
The Sample Complexity of Multiclass and Sparse Contextual Bandits arXiv:2605.29645v1 Announce Type: cross Abstract: We study contextual bandits in the stochastic i.i.d.\ setting, where a learner observes contexts drawn from an unknown distribution, selects actions from a finite set $A$, and aims to identify an approximately optimal policy from a given class based on bandit feedback. Motivated by bandit multiclass classification with zero-one rewards, we focus on the \emph{$s$-sparse} setting in