ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression 事件

Name: ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression
Start: 2026-05-29

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression arXiv:2605.29350v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) language models reduce per-token computation but still require storing and serving all experts, making deployment memory-intensive. Existing post-training compression methods mainly shrink this cost by pruning experts or merging their weights. We formulate post-training MoE compression as expert-pool consolidation: retaining a smaller set of p

人工智能

关系图谱

ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression 事件

相关公司查看全部 (7)

相关人物

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)