RoboTrustBench: Benchmarking the Trustworthiness of Video World Models for Robotic Manipulation 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
RoboTrustBench: Benchmarking the Trustworthiness of Video World Models for Robotic Manipulation arXiv:2606.01600v1 Announce Type: new Abstract: Video world models are increasingly used in robotic manipulation, yet existing benchmarks mostly evaluate them under valid, feasible, and safe instructions. We introduce RoboTrustBench, a benchmark for evaluating the trustworthiness of video world models under four scenarios: Normal, Constraint-Sensitive, Counterfactual, and Adversarial. Built from real