EpiCurveBench: Evaluating VLMs on Epidemic Curve Digitization 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

EpiCurveBench: Evaluating VLMs on Epidemic Curve Digitization arXiv:2605.27195v1 Announce Type: new Abstract: Chart-to-data extraction with vision-language models (VLMs) is increasingly evaluated on benchmarks that show diminishing headroom (frontier VLMs exceed 89% on ChartQA) and with metrics that treat extracted points as unordered key-value pairs, ignoring the temporal structure of time series and penalizing small alignment shifts as catastrophic failures. We address both gaps with EpiCurve

EpiCurveBench: Evaluating VLMs on Epidemic Curve Digitization · 相关人物

暂无数据