On Distributional Reinforcement Learning in Chaotic Dynamical Systems 文章

ArXiv CS.AI2026-05-29NEWSen作者: James Rudd-Jones, Mirco Musolesi, Mar\'ia P\'erez-Ortiz

摘要

arXiv:2605.30160v1 Announce Type: cross Abstract: Chaotic dynamical systems pose a fundamental challenge for Reinforcement Learning (RL): exponential sensitivity to initial conditions induces high-variance bootstrap targets and poorly conditioned gradient updates. Chaotic dynamics arise across scientific and engineering domains, from fluid flows and climate systems to multi-agent systems, where reliable learning is highly desirable. Standard RL methods optimise expected returns through scalar value functions, implicitly averaging over diverging trajectories and entangling trajectory level instability with the learning objective. We show that under mild statistical stability assumptions, the return distribution evolves more regularly than individual trajectories when measured under the $1$-Wasserstein metric, yielding a smoother distributional Bellman objective. By aligning optimisation with this measure level structure, distributional RL provides better conditioned learning.

On Distributional Reinforcement Learning in Chaotic Dynamical Systems 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (4)