SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking 事件
OPEN_SOURCE2026-05-26影响: MEDIUM
SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking arXiv:2605.25160v1 Announce Type: new Abstract: Mobile GUI agents powered by large language models have progressed rapidly, creating urgent needs for realistic and comprehensive evaluation. Existing benchmarks prioritize reproducibility but are often limited to open-source apps or file-operation tasks for the difficulty of constructing rewards on real applications, leaving a gap between benchmark settings an
SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking · 相关公司
W
World LabsCOMPANY
A
arXivNONPROFIT
S
SpanNONPROFIT
A
ACTIONNONPROFIT
I
InterActionNONPROFIT
F
FrameworkCOMPANY
C
CATIRESEARCH_INSTITUTE
E
EATNONPROFIT
A
ACTNONPROFIT
R
RatioRESEARCH_INSTITUTE