SemanticZip: A Pilot Framework for Lossy Text Compression with LLMs as Semantic Decompressors 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

SemanticZip: A Pilot Framework for Lossy Text Compression with LLMs as Semantic Decompressors arXiv:2605.24541v1 Announce Type: cross Abstract: Text compression for large language model (LLM) systems is usually framed as token deletion, retrieval, summarization, or exact reconstruction. We study a more aggressive but explicitly lossy setting: compress text into compact codes that an LLM can expand into task-relevant meaning. We call this setting SemanticZip. Unlike lossless compression, Semanti