Decomposing MXFP4 quantization error for LLM reinforcement learning: reducible bias, recoverable deadzone, and an irreducible floor 文章

ArXiv CS.AI2026-06-03NEWSen作者: Xiaocan Li, Shiliang Wu, Zheng Shen

Decomposing MXFP4 quantization error for LLM reinforcement learning: reducible bias, recoverable deadzone, and an irreducible floor · 相关事件