HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers 文章

ArXiv CS.CV2026-06-02NEWSen作者: Issa Sugiura, Shuhei Kurita, Yusuke Oda, Naoaki Okazaki

摘要

arXiv:2606.01132v1 Announce Type: new Abstract: Understanding chart and table images is essential for applying vision-language models (VLMs) to real-world document understanding. While English benchmarks have advanced rapidly, non-English counterparts remain scarce, leaving it unclear whether this progress generalizes across languages. A key obstacle is the difficulty of collecting realistic and diverse non-English chart and table images at scale. To address this, we leverage governmental white papers as a scalable source for benchmark construction beyond English, as they contain naturally occurring charts and tables across diverse formats and domains and are freely accessible in many countries. As a first instantiation, we introduce HakushoBench, a challenging Japanese chart and table VQA benchmark built from 33 governmental white papers.

相关公司

暂无数据

相关人物

暂无数据