DriveMA: Driving Vision-Language-Action Models with verifiable Meta-Actions 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

DriveMA: Driving Vision-Language-Action Models with verifiable Meta-Actions arXiv:2605.31271v1 Announce Type: new Abstract: Driving Vision-Language-Action Models (Driving VLAs) aim to use language to improve end-to-end planning, but the language-action gap limits this promise. We propose DriveMA, a Driving VLA framework built on verifiable meta-actions, which summarize future ego motion into compact language-domain intentions and can be constructed from expert trajectories with a trajectory-gro