On Distributional Reinforcement Learning in Chaotic Dynamical Systems 文章

ArXiv CS.AI2026-05-29NEWSen作者: James Rudd-Jones, Mirco Musolesi, Mar\'ia P\'erez-Ortiz

摘要

arXiv:2605.30160v1 Announce Type: cross Abstract: Chaotic dynamical systems pose a fundamental challenge for Reinforcement Learning (RL): exponential sensitivity to initial conditions induces high-variance bootstrap targets and poorly conditioned gradient updates. Chaotic dynamics arise across scientific and engineering domains, from fluid flows and climate systems to multi-agent systems, where reliable learning is highly desirable. Standard RL methods optimise expected returns through scalar value functions, implicitly averaging over diverging trajectories and entangling trajectory level instability with the learning objective. We show that under mild statistical stability assumptions, the return distribution evolves more regularly than individual trajectories when measured under the $1$-Wasserstein metric, yielding a smoother distributional Bellman objective. By aligning optimisation with this measure level structure, distributional RL provides better conditioned learning.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据