DTBench: A Synthetic Benchmark for Document-to-Table Extraction 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

DTBench: A Synthetic Benchmark for Document-to-Table Extraction arXiv:2602.13812v3 Announce Type: replace-cross Abstract: Document-to-table (Doc2Table) extraction derives structured tables from unstructured documents under a target schema, enabling reliable and verifiable SQL-based data analytics. Although large language models (LLMs) have shown promise in flexible information extraction, their ability to produce precisely structured tables remains insufficiently understood, particularly for in

DTBench: A Synthetic Benchmark for Document-to-Table Extraction · 相关技术