DODO: Discrete OCR Diffusion Models 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

DODO: Discrete OCR Diffusion Models arXiv:2602.16872v2 Announce Type: replace Abstract: Optical Character Recognition (OCR) is a fundamental task for digitizing information, serving as a critical bridge between visual data and textual understanding. While modern Vision-Language Models (VLM) have achieved high accuracy in this domain, they predominantly rely on autoregressive decoding, which becomes computationally expensive and slow for long documents as it requires a sequential forward pass fo