From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

From AR to Diffusion: Efficiently Adapting Large Language Models with Strictly Causal and Elastic Horizons arXiv:2605.27387v1 Announce Type: new Abstract: Diffusion models promise efficient parallel text generation but rely on bidirectional attention, creating a structural mismatch with pre-trained Autoregressive (AR) models. This incompatibility precludes reusing robust AR priors, necessitating prohibitive pre-training from scratch. To bridge this gap, we propose FLUID, a framework that effici