Dr. DocBench: A Comprehensive Benchmark for Expert-Level and Difficult Document Parsing 事件

BREAKTHROUGH2026-06-02影响: HIGH

Dr. DocBench: A Comprehensive Benchmark for Expert-Level and Difficult Document Parsing arXiv:2606.01393v1 Announce Type: cross Abstract: Document parsing and recognition are fundamental capabilities for vision-language models (VLMs) and document processing systems. However, existing Optical Character Recognition (OCR) and document parsing benchmarks are increasingly limited in coverage and difficulty: many focus on common document genres or uniformly sampled pages where modern parsers already