ERQA-Plus: A Diagnostic Benchmark for Reasoning in Embodied AI 文章

ArXiv CS.CV2026-06-17NEWSen作者: Hong Yang, Basura Fernando

详细信息

来源站点
ArXiv CS.CV
作者
Hong Yang, Basura Fernando
文章类型
NEWS
语言
en
发布日期
2026-06-17

摘要

arXiv:2606.17639v1 Announce Type: cross Abstract: Generalist embodied agents require more than object recognition: they must reason about spatial relations, actions, procedures, human intentions, environmental constraints, and commonsense consequences from situated visual observations. Yet existing visual and embodied question answering benchmarks often provide limited control over the reasoning dependencies being tested, making it difficult to distinguish grounded embodied reasoning from shortcut-driven visual or linguistic pattern matching. We present ERQA-Plus, a diagnostic benchmark for reasoning in embodied AI. ERQA-Plus contains 1,766 question-answer instances grounded in 711 robot-centric images and organized according to a structured taxonomy spanning perceptual, action-centric, social-interaction, navigation-environmental, and contextual commonsense reasoning.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据