PILOT: Policy-Informed Learned Optimization for Adaptive Deep Network Training 文章

ArXiv CS.CV2026-05-26NEWSen作者: Sattam Altuuaim, Lama Ayash, Muhammad Mubashar, Naeemullah Khan

摘要

arXiv:2605.24570v1 Announce Type: cross Abstract: Despite the central role of optimization in deep learning, most optimizers rely on update structures whose functional form is fixed before training begins. This static design can limit their ability to respond to changing gradient behavior across the loss landscape, where training may shift between stable, noisy, and inconsistent regimes. This study proposes PILOT (Policy-Informed Learned OpTimizer), an online optimizer that adapts its update behavior during training. Rather than using a fixed balance between momentum, normalization, and sign-based updates, PILOT uses gradient-direction agreement as a signal of local training stability. Conditioning the update rule on this agreement signal allows the optimizer to adjust its behavior when gradients become stable, noisy, or inconsistent.

PILOT: Policy-Informed Learned Optimization for Adaptive Deep Network Training 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (1)