Word Class Representations Spontaneously Emerge from Successor Representations Trained on Natural Language 文章

ArXiv CS.CL2026-05-26NEWSen作者: Mathis Immertreu, Achim Schilling, Thomas Kinfe, Patrick Krauss

摘要

arXiv:2605.24585v1 Announce Type: new Abstract: Language models are typically trained to predict the next token in a sequence. Here, we explore an alternative predictive principle from reinforcement learning: Successor Representations (SRs), which model the expected discounted distribution of future states rather than the immediate next state. We transfer this framework to natural language and train neural networks to predict future word distributions across multiple temporal horizons, thereby learning representations of long-range transition structure. We train a deep residual neural network on WikiText-103 (103 million tokens; 20,000-word vocabulary) and optimize successor representations as probability distributions using KL divergence. Without explicit linguistic supervision, structured language representations emerge spontaneously.

Word Class Representations Spontaneously Emerge from Successor Representations Trained on Natural Language 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (1)

相关技术查看全部 (3)