EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents arXiv:2605.27820v1 Announce Type: new Abstract: As AI agents increasingly operate in open, real-world environments, they require a deep synergy of multimodal perception, tool invocation with multi-hop reasoning, and dynamic interaction with users. However, existing benchmarks fail to jointly evaluate these capabilities due to challenges in designing strictly coupled multi-capability tasks, simulating natural and task

EgoBench: An Interactive Egocentric Multimodal Benchmark for Tool-Using Agents · 相关公司

W
World LabsRESEARCH_INSTITUTE
R
RonCOMPANY
I
IDGCOMPANY
A
arXivNONPROFIT
I
ISESNONPROFIT
A
ACTIONNONPROFIT
I
InterActionNONPROFIT
C
CATIRESEARCH_INSTITUTE
A
ACTNONPROFIT