Tailoring the Curriculum: Student-Centered Reasoning Distillation via Dynamic Data-Model Compatibility 文章

ArXiv CS.AI2026-05-29NEWSen作者: Jiahao Huang, Fei Cheng, Junfeng Jiang, Akiko Aizawa

摘要

arXiv:2605.29229v1 Announce Type: new Abstract: Reasoning distillation transfers complex reasoning abilities from large language models (LLMs) to smaller ones, yet its success depends on how well the training data align with the student model. This paper introduces the Data-Model Compatibility (DMC) metric, which can be used to assess the suitability of a dataset for reasoning distillation on a student model. DMC provides an assessment by jointly considering data quality, relative difficulty, and student capability. We validated the effectiveness of DMC from two perspectives: (1) DMC exhibits a strong correlation with reasoning distillation performance; and (2) using DMC as the criterion for data selection leads to improved reasoning distillation performance. Both findings are consistently demonstrated across multiple student models and tasks.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据