Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models 事件

ACQUISITION2026-05-29影响: HIGH

Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models arXiv:2601.14758v4 Announce Type: replace-cross Abstract: Post-training pretrained autoregressive models (ARMs) into masked diffusion models (MDMs) has emerged as a cost-effective way to overcome the limitations of sequential generation. Yet it remains unclear whether post-trained MDMs acquire genuinely new computational mechanisms or merely re-express autoregressive computation in a non-autoregressive

Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models · 相关人物