Routing by Analogy: kNN-Augmented Expert Assignment for Mixture-of-Experts 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Routing by Analogy: kNN-Augmented Expert Assignment for Mixture-of-Experts arXiv:2601.02144v2 Announce Type: replace Abstract: Mixture-of-Experts (MoE) architectures scale large language models efficiently by employing a parametric ``router'' to dispatch tokens to a sparse subset of experts. Typically, this router is trained once and then frozen, rendering routing decisions brittle under distribution shifts. We address this limitation by introducing kNN-MoE, a retrieval-augmented routing framew
相关产品查看全部 (10)
相关报道查看全部 (1)
Routing by Analogy: kNN-Augmented Expert Assignment for Mixture-of-Experts
ArXiv CS.CL2026-05-26