EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents arXiv:2605.27820v1 Announce Type: new Abstract: As AI agents increasingly operate in open, real-world environments, they require a deep synergy of multimodal perception, tool invocation with multi-hop reasoning, and dynamic interaction with users. However, existing benchmarks fail to jointly evaluate these capabilities due to challenges in designing strictly coupled multi-capability tasks, simulating natural and task