A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation arXiv:2605.31351v1 Announce Type: cross Abstract: AI-based Visually Impaired Assistance (VIA) remains challenging, largely due to the high cost of human evaluation. The VLM-as-a-Judge paradigm may offer a promising alternative, although it has mostly been studied in general domains. We therefore ask whether such judges can be trusted for VIA tasks. To investigate this question, we introduce VIABLE (Visually Impaired Assistan

A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation · 相关人物