Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch 文章

ArXiv CS.CL2026-06-01NEWSen作者: Ziyang Zhang, Xinheng Ding, Jiayi Yuan, Rixin Liu, Huizi Mao, Jiarong Xing, Zirui Liu

Deterministic Inference across Tensor Parallel Sizes That Eliminates Training-Inference Mismatch · 相关人物

暂无数据