DAG-MoE: From Simple Mixture to Structural Aggregation in Mixture-of-Experts 文章

ArXiv CS.AI2026-06-02NEWSen作者: Jiarui Feng, Hanqing Zeng, Karish Grover, Ruizhong Qiu, Yinglong Xia, Qiang Zhang, Qifan Wang, Ren Chen, Dongqi Fu, Jiayi Liu, Zhoukai Zhao, Xiangjun Fan, Benyu Zhang, Yixin Chen

DAG-MoE: From Simple Mixture to Structural Aggregation in Mixture-of-Experts · 相关技术