LFQ: Logit-aware Final-block Quantization for Boosting the Generation Quality of Low-Bit Quantized LLMs 文章

ArXiv CS.AI2026-05-29NEWSen作者: Jung Hyun Lee, June Yong Yang, Jungwook Choi, Eunho Yang

LFQ: Logit-aware Final-block Quantization for Boosting the Generation Quality of Low-Bit Quantized LLMs · 相关技术