ERQA-Plus: A Diagnostic Benchmark for Reasoning in Embodied AI 文章

ArXiv CS.CV2026-06-17NEWSen作者: Hong Yang, Basura Fernando

详细信息

来源站点: ArXiv CS.CV
作者: Hong Yang, Basura Fernando
文章类型: NEWS
语言: en
发布日期: 2026-06-17

摘要

arXiv:2606.17639v1 Announce Type: cross Abstract: Generalist embodied agents require more than object recognition: they must reason about spatial relations, actions, procedures, human intentions, environmental constraints, and commonsense consequences from situated visual observations. Yet existing visual and embodied question answering benchmarks often provide limited control over the reasoning dependencies being tested, making it difficult to distinguish grounded embodied reasoning from shortcut-driven visual or linguistic pattern matching. We present ERQA-Plus, a diagnostic benchmark for reasoning in embodied AI. ERQA-Plus contains 1,766 question-answer instances grounded in 711 robot-centric images and organized according to a structured taxonomy spanning perceptual, action-centric, social-interaction, navigation-environmental, and contextual commonsense reasoning.

ERQA-Plus: A Diagnostic Benchmark for Reasoning in Embodied AI 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (1)

相关技术