Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models 文章

ArXiv CS.CV2026-05-27NEWSen作者: Yifan Jiang, Dae Yon Hwang, Jesse C. Cresswell, Freda Shi

摘要

arXiv:2605.27311v1 Announce Type: cross Abstract: Chart question-answering (QA) benchmarks aim to pose questions that require visual reasoning to correctly answer, but models can often reach solutions through shortcuts or prior familiarity with a chart based on their own background knowledge. To strictly evaluate visual reasoning, we propose counterfactual charts where the chart-question task remains fixed, but underlying chart and the corresponding answer are varied. We introduce Chartographer, a framework to reverse engineer charts into executable code, validate reconstruction fidelity, generate seed-controlled counterfactual variants, and derive new answers from executable QA logic. We apply this framework to existing chart QA datasets and evaluate proprietary and open-source vision-language models (VLMs), measuring variation sensitivity and generalizability.