Invariant Gradient Alignment for Robust Reasoning Distillation 事件
PRODUCT_LAUNCH2026-06-04影响: MEDIUM
Invariant Gradient Alignment for Robust Reasoning Distillation arXiv:2606.05025v1 Announce Type: cross Abstract: Large language models (LLMs) suffer from shortcut learning: they systematically fail on out-of-distribution (OOD) inputs whose semantic surface differs from training data, even when the logical structure is identical. This undermines knowledge distillation pipelines that transfer chain-of-thought reasoning to smaller students. We introduce Invariant Gradient Alignment (IGA), a traini
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
Invariant Gradient Alignment for Robust Reasoning Distillation
ArXiv CS.AI2026-06-04