From Rigid to Dynamic: Entropy-Guided Adaptive Inference for Long-Context LLMs 事件

Name: From Rigid to Dynamic: Entropy-Guided Adaptive Inference for Long-Context LLMs
Start: 2026-06-09

PRODUCT_LAUNCH2026-06-09影响: MEDIUM

From Rigid to Dynamic: Entropy-Guided Adaptive Inference for Long-Context LLMs arXiv:2606.09508v1 Announce Type: new Abstract: Existing sparse attention and KV cache compression methods for long-context LLM inference typically apply fixed sparsity patterns or uniform budgets across all attention heads, overlooking the substantial variation in attention behavior among heads and contexts. We observe two distinct entropy patterns among attention heads: Rigid Heads, whose entropy stays near zero ac

人工智能

关系图谱

From Rigid to Dynamic: Entropy-Guided Adaptive Inference for Long-Context LLMs 事件

相关公司查看全部 (10)

相关人物查看全部 (1)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)