MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Any-Precision LLM 文章
ArXiv CS.CL2026-05-26NEWSen作者: Dongwei Wang, Jinhee Kim, Seokho Han, Denis Gudovskiy, Yohei Nakata, Tomoyuki Okuno, KhayTze Peong, Kang Eun Jeon, Jong Hwan Ko, Yiran Chen, Huanrui Yang