Autoregressive Visual Generation Needs a Prologue 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Autoregressive Visual Generation Needs a Prologue arXiv:2605.06137v2 Announce Type: replace Abstract: In this work, we propose Prologue, an approach to bridging the reconstruction-generation gap in autoregressive (AR) image generation. Instead of modifying visual tokens to satisfy both reconstruction and generation, Prologue generates a small set of prologue tokens prepended to the visual token sequence. These prologue tokens are trained exclusively with the AR cross-entropy (CE) loss, while vi