Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models 文章

ArXiv CS.CL2026-05-29NEWSen作者: Injin Kong, Hyoungjoon Lee, Yohan Jo

Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models · 相关技术