RoboStressBench: Benchmarking VLM Robustness to Physical Visual Stress in Embodied Scenes 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

RoboStressBench: Benchmarking VLM Robustness to Physical Visual Stress in Embodied Scenes arXiv:2606.00828v1 Announce Type: new Abstract: Vision-Language Models (VLMs) have shown strong visual understanding and are increasingly deployed in embodied AI systems, where reliable perception under real conditions is essential. However, existing benchmarks assess VLMs using clean images or isolated perturbations rather than stresses caused by physical scene formation. This design has two limitations: