Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation 事件

Name: Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation
Start: 2026-05-27

OPEN_SOURCE2026-05-27影响: MEDIUM

Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation arXiv:2605.27134v1 Announce Type: new Abstract: Vision-Language Models (VLMs) have shown rapid progress in mobile GUI navigation. This paper presents a systematic study of data scaling, benchmarking, and reasoning for VLM-based agents in this domain. To facilitate rigorous evaluation, we introduce HyperTrack, a large-scale dataset with over 16000 real-world tasks across more than 650 Chinese mobile applicat

人工智能

关系图谱

Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation 事件

相关公司查看全部 (8)

相关人物

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)