Bridging the Semantic-Action Gap in Visual Token Pruning for Efficient VLA Inference 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Bridging the Semantic-Action Gap in Visual Token Pruning for Efficient VLA Inference arXiv:2511.16449v4 Announce Type: replace Abstract: Vision-Language-Action (VLA) models have shown great potential for embodied AI by integrating visual perception, language understanding, and action execution. In real-time deployment, these models must process continuous visual streams, incurring substantial computational overhead. Visual token pruning -- a mainstream technique for accelerating Vision-Language