Channel-Wise Mixed-Precision Quantization for Large Language Models 文章

ArXiv CS.CL2026-06-05NEWSen作者: Zihan Chen, Bike Xie, Jundong Li, Cong Shen

Channel-Wise Mixed-Precision Quantization for Large Language Models · 相关技术