LARK: Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation 事件

Name: LARK: Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation
Start: 2026-06-01

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

LARK: Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation arXiv:2605.30651v1 Announce Type: cross Abstract: We study trajectory selection for reasoning distillation, where teacher-generated reasoning trajectories are selectively used as supervision for a student model. Existing methods rely on heuristics such as trajectory quality or model confidence, but they often overlook whether a trajectory is learnable by the student. In this paper, we present LARK, a learnabil

人工智能

关系图谱

LARK: Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation · 相关公司

veRLNONPROFIT

RonCOMPANY

Abstract

arXivNONPROFIT

TERINONPROFIT

EARNNONPROFIT

ACTNONPROFIT

CharacterNONPROFIT

Rho