Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing 事件
PRODUCT_LAUNCH2026-05-29影响: MEDIUM
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing arXiv:2603.17942v2 Announce Type: replace Abstract: Large Language Models (LLMs) possess latent multi-token prediction (MTP) abilities despite being trained only for next-token generation. We introduce ESP (Embedding-Space Probing), a simple and training-free MTP method that probes an LLM using on-the-fly mask tokens drawn from its embedding space, enabling parallel future-token prediction without modifying weights or re
相关产品查看全部 (10)
相关报道查看全部 (1)
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing
ArXiv CS.CL2026-05-29