Zipping the Thought: When and How Compressed Reasoning Data Works in LLM Post-Training 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Zipping the Thought: When and How Compressed Reasoning Data Works in LLM Post-Training arXiv:2605.28008v1 Announce Type: new Abstract: Large language models (LLMs) can now solve complex problems through long chain-of-thought (CoT) reasoning, but the trade-off between performance and token cost remains a central challenge. To address this issue, supervised fine-tuning (SFT) often uses compressed reasoning data, where CoT traces are shortened into compact forms. However, the effect of such compre