Improving Diffusion Planners by Self-Supervised Action Gating with Energies 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Improving Diffusion Planners by Self-Supervised Action Gating with Energies arXiv:2603.02650v2 Announce Type: replace-cross Abstract: Diffusion planners are a strong approach for offline reinforcement learning, but they can fail when value-guided selection favours trajectories that score well yet are locally inconsistent with the environment dynamics, resulting in brittle execution. We propose Self-supervised Action Gating with Energies (SAGE), an inference-time re-ranking method that penalises
相关公司查看全部 (10)
相关产品查看全部 (10)
相关报道查看全部 (1)
Improving Diffusion Planners by Self-Supervised Action Gating with Energies
ArXiv CS.AI2026-06-02