CONF-KV: Confidence-Aware KV Cache Eviction with Mixed-Precision Storage for Long-Horizon LLM 文章

ArXiv CS.AI2026-05-26NEWSen作者: Yubo Li, Yidi Miao

CONF-KV: Confidence-Aware KV Cache Eviction with Mixed-Precision Storage for Long-Horizon LLM · 相关技术