Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories 文章

ArXiv CS.AI2026-06-03NEWSen作者: Ali Behrouz, Farnoosh Hashemi, Vahab Mirrokni

摘要

arXiv:2606.03979v1 Announce Type: cross Abstract: The past few decades have witnessed significant advances in the design of machine learning algorithms, from early studies on task-specific shallow models to more general deep Large Language Models (LLMs). Despite showing promising results in tasks that require instant prediction or in-context learning, existing models lack the ability to continually learn and effectively transfer their temporal in-context knowledge to their long-term parameters. Inspired by human learning process, we introduce a ''Sleep'' paradigm that allows the models to continually learn, distill their short-term fragile memories into stable long-term knowledge with replay, and recursively improve themselves with ''Dreaming'' process.

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (1)

相关技术查看全部 (6)