Does Visual Information Play a Decisive Role in Vision-Language-Action Model Driving Behavior? 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Does Visual Information Play a Decisive Role in Vision-Language-Action Model Driving Behavior? arXiv:2605.31041v1 Announce Type: new Abstract: Vision-Language-Action (VLA) models have demonstrated promising capability in autonomous driving, highlighting the potential of unified multimodal architectures for jointly modeling perception and planning. However, how current VLA-based driving behavior is grounded in visual information remains poorly understood. Existing evaluation protocols mainly foc