MobileMoE: Scaling On-Device Mixture of Experts 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
MobileMoE: Scaling On-Device Mixture of Experts arXiv:2605.27358v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) has become the de facto architecture for hundred-billion-parameter language models, yet its advantages at sub-billion scales for on-device deployment remain largely unexplored. To close this gap, we present MobileMoE, a family of on-device MoE language models with sub-billion active parameters (0.3-0.9B active and 1.3-5.3B total) that establish a new Pareto frontier for on-
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
MobileMoE: Scaling On-Device Mixture of Experts
ArXiv CS.CL2026-05-27