MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing arXiv:2605.24973v1 Announce Type: new Abstract: VLM-based OCR models have become the de facto choice for document parsing, as they can accurately extract page-level elements (e.g., paragraphs within individual pages) together with their bounding boxes and textual content. However, downstream applications such as RAG require coherent document-level information, whereas these models often break cross-page continuity and