DAG-MoE: From Simple Mixture to Structural Aggregation in Mixture-of-Experts 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
DAG-MoE: From Simple Mixture to Structural Aggregation in Mixture-of-Experts arXiv:2606.01062v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models have become a leading approach for decoupling parameter count from computational cost in large language models, yet effectively scaling MoE performance remains a challenge. Prior work shows that fine-grained experts enlarge the space of expert combinations and improve flexibility, but they also impose substantial routing overhead, creating