Word Class Representations Spontaneously Emerge from Successor Representations Trained on Natural Language 文章

ArXiv CS.CL2026-05-26NEWSen作者: Mathis Immertreu, Achim Schilling, Thomas Kinfe, Patrick Krauss

摘要

arXiv:2605.24585v1 Announce Type: new Abstract: Language models are typically trained to predict the next token in a sequence. Here, we explore an alternative predictive principle from reinforcement learning: Successor Representations (SRs), which model the expected discounted distribution of future states rather than the immediate next state. We transfer this framework to natural language and train neural networks to predict future word distributions across multiple temporal horizons, thereby learning representations of long-range transition structure. We train a deep residual neural network on WikiText-103 (103 million tokens; 20,000-word vocabulary) and optimize successor representations as probability distributions using KL divergence. Without explicit linguistic supervision, structured language representations emerge spontaneously.