BadWorld: Adversarial Attacks on World Models 文章

ArXiv CS.CV2026-06-16NEWSen作者: Linghui Shen, Mingyue Cui, Xingyi Yang

详细信息

来源站点: ArXiv CS.CV
作者: Linghui Shen, Mingyue Cui, Xingyi Yang
文章类型: NEWS
语言: en
发布日期: 2026-06-16

摘要

arXiv:2606.16519v1 Announce Type: new Abstract: Visual world models (VWMs) synthesize interactive, action-conditioned rollouts from a single context image. However, it remains an open question how robust these models are to adversarial perturbations. Standard adversarial attacks fail to assess this vulnerability because attackers lack ground-truth future videos and cannot predict subsequent user controls. We introduce BadWorld, a label-free adversarial framework tailored for autoregressive VWMs that systematically overcomes both constraints. First, to bypass the need for future supervision, we propose a self-supervised velocity attack that directly disrupts the early denoising dynamics of the model. Second, to ensure the attack generalizes across unpredictable user actions, we formulate a trajectory-adaptive bi-level optimization that actively mines hard control sequences to forge control-agnostic perturbations.

BadWorld: Adversarial Attacks on World Models 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (1)

相关技术查看全部 (5)