AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding 事件

PRODUCT_LAUNCH2026-06-05影响: MEDIUM

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding arXiv:2606.06155v1 Announce Type: cross Abstract: Vision-Language-Action (VLA) models leverage the rich world knowledge of pretrained vision-language models (VLMs) to enable instruction-following robotic manipulation. However, the structural mismatch between VLM semantic spaces and embodied control policies often hinders the learning of precise perception--action mappings. To addres