Global Policy-Space Response Oracles for Two-Player Zero-Sum Games 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Global Policy-Space Response Oracles for Two-Player Zero-Sum Games arXiv:2605.28273v1 Announce Type: new Abstract: The Policy-Space Response Oracles (PSRO) framework scales equilibrium computation to large zero-sum games by iteratively expanding a restricted strategy set using deep reinforcement learning (DRL). A central challenge is to construct, under limited computational budgets, a small strategy population whose induced game well approximates the full game. Existing PSRO variants typically

Global Policy-Space Response Oracles for Two-Player Zero-Sum Games · 相关公司

F
FrameworkCOMPANY
E
EARNNONPROFIT
I
IterRESEARCH_INSTITUTE
A
ANDINONPROFIT
I
ITABCOMPANY
A
ACTNONPROFIT
E
EGINONPROFIT
R
RatioRESEARCH_INSTITUTE
I
iterativeCOMPANY