A Geometric Characterization of the Stationary Plateau for Two-Layer Neural Networks 文章

ArXiv CS.AI2026-06-04NEWSen作者: Tian Ding, Dawei Li, Ruoyu Sun

摘要

arXiv:2606.04327v1 Announce Type: cross Abstract: We investigate the geometric structure of stationary plateaus that arise in the loss landscape of two-layer neural networks with smooth activation functions. We focus on the phenomenon of "neuron splitting" where duplicating a hidden neuron yields an affine set of stationary points in a wider network. We provide a comprehensive classification of all stationary points on these plateaus, determining under what conditions they constitute local minima or saddle points. Our characterization hinges on a per-neuron curvature object we term the "inner Hessian" matrix. Our analysis reveals that the definiteness of the inner Hessian and the choice of splitting coefficients jointly dictate the local geometry of the plateau. We show that "splitting" a local minimum can yield either a mixture of local minima and saddles or an all-saddle plateau, with a concrete sure-saddle region identified under mild assumptions.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据