Probing Collision Grounding in Vision-Language Models for Safe Human-Robot Collaboration 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Probing Collision Grounding in Vision-Language Models for Safe Human-Robot Collaboration arXiv:2605.31196v1 Announce Type: new Abstract: Safe human--robot collaboration requires more than visual description: a monitor must determine whether the robot body is safely separated, already colliding with the scene or a person, or about to collide. We call this capability collision grounding: binding visual observations to robot body geometry, camera viewpoint, scene layout, human proximity, and tempo