GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning arXiv:2605.13136v2 Announce Type: replace Abstract: Distilling multi-step reasoning abilities from large language models (LLMs) into compact student models remains challenging due to noisy rationales, hallucinated supervision, and static teacher-student interactions. Existing reasoning distillation methods, including mentor-based approaches, predominantly operate in an open-loop manner, implicitly assuming uniform teacher re
GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning · 相关公司
A
ACTIONNONPROFIT
I
InterActionNONPROFIT
F
FrameworkCOMPANY
A
AnisNONPROFIT
I
InterMediaNONPROFIT
E
EATNONPROFIT
A
ACTNONPROFIT
M
MentorNONPROFIT
U
UniforNONPROFIT
R
RatioRESEARCH_INSTITUTE