dMoE: dLLMs with Learnable Block Experts 事件
PRODUCT_LAUNCH2026-06-01影响: MEDIUM
dMoE: dLLMs with Learnable Block Experts arXiv:2605.30876v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have recently emerged as a promising alternative to autoregressive models, offering competitive performance while naturally supporting parallel decoding. However, as dLLMs are increasingly integrated with Mixture-of-Experts (MoE) architectures to scale model capacity, a fundamental mismatch arises between block parallel decoding and token-level expert selection. Speci
相关产品查看全部 (10)
相关报道查看全部 (1)
dMoE: dLLMs with Learnable Block Experts
ArXiv CS.CL2026-06-01