Reconsidering Positional Supervision in Masked Diffusion Language Model Training 文章

ArXiv CS.CL2026-06-02NEWSen作者: Mengyu Ye, Keito Kudo, Ryosuke Takahashi, Jun Suzuki

Reconsidering Positional Supervision in Masked Diffusion Language Model Training · 相关技术