dMoE: dLLMs with Learnable Block Experts 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

dMoE: dLLMs with Learnable Block Experts arXiv:2605.30876v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have recently emerged as a promising alternative to autoregressive models, offering competitive performance while naturally supporting parallel decoding. However, as dLLMs are increasingly integrated with Mixture-of-Experts (MoE) architectures to scale model capacity, a fundamental mismatch arises between block parallel decoding and token-level expert selection. Speci