Structure-Aware Text Recognition for Ancient Greek Critical Editions 文章

ArXiv CS.CV2026-05-29NEWSen作者: Nicolas Angleraud, Antonia Karamolegkou, Beno\^it Sagot, Thibault Cl\'erice

摘要

arXiv:2603.02803v2 Announce Type: replace Abstract: Recent advances in visual language models (VLMs) have transformed end-to-end document understanding. However, their ability to interpret the complex layout semantics of historical scholarly texts remains limited. This paper investigates structure-aware text recognition for Ancient Greek critical editions, which have dense reference hierarchies and extensive marginal annotations. We introduce two novel resources: (i) a large-scale synthetic corpus of 185,000 page images generated from TEI/XML sources with controlled typographic and layout variation, and (ii) a curated benchmark of real scanned editions spanning more than a century of editorial and typographic practices. Using these datasets, we evaluate three state-of-the-art VLMs under both zero-shot and fine-tuning regimes. Our experiments reveal substantial limitations in current VLM architectures when confronted with highly structured historical documents.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据