Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network 事件

PRODUCT_LAUNCH2026-06-06影响: MEDIUM

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network arXiv:2606.05326v1 Announce Type: cross Abstract: We study the dynamics of gradient descent in the Edge of Stability regime, where the learning rate is large enough to induce persistent oscillations in the loss and the sharpness. We propose a continuous-time effective model that tracks the evolution of the average trajectory coupled with the time-averaged covariance of its fast oscillat