Path Channels and Plan Extension Kernels: a Mechanistic Description of Planning in a Sokoban RNN 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Path Channels and Plan Extension Kernels: a Mechanistic Description of Planning in a Sokoban RNN arXiv:2506.10138v3 Announce Type: replace-cross Abstract: We partially reverse-engineer a convolutional recurrent neural network (RNN) trained with model-free reinforcement learning to play the box-pushing game Sokoban. We find that the RNN stores future moves (plans) as activations in particular channels of the hidden state, which we call path channels. A high activation in a particular location me