LoRi: Low-Rank Distillation for Implicit Reasoning 文章

ArXiv CS.CL2026-06-05NEWSen作者: Ryan Solgi, Jiayi Tian, Zheng Zhang

摘要

arXiv:2606.05315v1 Announce Type: new Abstract: Implicit chain-of-thought (iCoT) methods aim to internalize reasoning in large language models, but often underperform explicit CoT prompting. We empirically find that hidden-state reasoning trajectories exhibit low-rank structure. Motivated by this observation, we propose a low-rank distillation framework that transfers reasoning by aligning teacher and student trajectories in a shared low-rank tensor subspace using first- and second-order statistics. The resulting formulation captures the global structure of reasoning while supporting a compact latent reasoning process. We evaluate the method across multiple model families, including LLaMA and Qwen, at different scales on mathematical reasoning benchmarks. Our approach consistently improves performance, especially on challenging multi-step tasks, approaching explicit CoT accuracy and outperforming prior iCoT distillation methods.

相关事件查看全部 (1)

LoRi: Low-Rank Distillation for Implicit Reasoning
2026-06-05PRODUCT_LAUNCH影响: MEDIUM

相关公司

暂无数据

相关人物

暂无数据