On the Generalization Gap in Self-Evolving Language Model Reasoning 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
On the Generalization Gap in Self-Evolving Language Model Reasoning arXiv:2606.01075v1 Announce Type: new Abstract: Recent work suggests that large language models (LLMs) can improve through self-evolution (SE), using supervision signals generated by the model itself. In this work, we ask: under a strict closed-loop setup, where the self-evolution algorithm has access only to an unlabeled prompt set and a base model, how close can internally generated supervision come to oracle-supervised train