Offline Reinforcement Learning with Generative Trajectory Policies 事件

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

Offline Reinforcement Learning with Generative Trajectory Policies arXiv:2510.11499v2 Announce Type: replace-cross Abstract: Generative models have emerged as a powerful class of policies for offline reinforcement learning (RL) due to their ability to capture complex, multi-modal behaviors. However, existing methods face a stark trade-off: slow, iterative models like diffusion policies are computationally expensive, while fast, single-step models like consistency policies often suffer from degr