Self-Verified Distillation: Your Language Model Is Secretly Its Own Synthetic Data Pipeline 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

Self-Verified Distillation: Your Language Model Is Secretly Its Own Synthetic Data Pipeline arXiv:2605.26132v1 Announce Type: new Abstract: Can post-trained large language models (LLMs) further improve themselves using only unlabeled prompts, without external teachers or feedback from tools? We study this setting starting only from unlabeled seed questions with no ground-truth solutions, across three reasoning domains: math, science, and coding. We propose Self-Verified Distillation, a simple p