OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification 文章

ArXiv CS.CL2026-06-02NEWSen作者: Yuhang Zhou, Lizhu Zhang, Yifan Wu, Mingyi Wang, Peng Bo, Jiayi Liu, Xiangjun Fan, Zhuokai Zhao

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification · 相关技术