DODO: Discrete OCR Diffusion Models 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
DODO: Discrete OCR Diffusion Models arXiv:2602.16872v2 Announce Type: replace Abstract: Optical Character Recognition (OCR) is a fundamental task for digitizing information, serving as a critical bridge between visual data and textual understanding. While modern Vision-Language Models (VLM) have achieved high accuracy in this domain, they predominantly rely on autoregressive decoding, which becomes computationally expensive and slow for long documents as it requires a sequential forward pass fo
相关产品查看全部 (10)
相关报道查看全部 (1)
DODO: Discrete OCR Diffusion Models
ArXiv CS.CV2026-05-28