MicroSpec: Accelerating Speculative Decoding with Lightweight In-Context Vocabularies 文章

ArXiv CS.CL2026-05-27NEWSen作者: Zhiyang Chen, Daliang Xu, Yinyuan Zhang, Chenghua Wang, Mengwei Xu, Yun Ma

MicroSpec: Accelerating Speculative Decoding with Lightweight In-Context Vocabularies · 相关事件