Copy-as-Decode: Grammar-Constrained Parallel Prefill for LLM Editing 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Copy-as-Decode: Grammar-Constrained Parallel Prefill for LLM Editing arXiv:2604.18170v2 Announce Type: replace Abstract: LLMs edit text and code by autoregressively regenerating the full output, even when most tokens appear verbatim in the input. We study Copy-as-Decode, a decoding-layer mechanism that recasts edit generation as structured decoding over a two-primitive grammar: references an input line range, ... emits new content. A token-level FSM guarantees syntactic validity, and a serving