GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning arXiv:2605.13136v2 Announce Type: replace Abstract: Distilling multi-step reasoning abilities from large language models (LLMs) into compact student models remains challenging due to noisy rationales, hallucinated supervision, and static teacher-student interactions. Existing reasoning distillation methods, including mentor-based approaches, predominantly operate in an open-loop manner, implicitly assuming uniform teacher re

GateKD: Confidence-Gated Closed-Loop Distillation for Robust Reasoning · 相关人物

暂无数据