BadWorld: Adversarial Attacks on World Models 文章

ArXiv CS.CV2026-06-16NEWSen作者: Linghui Shen, Mingyue Cui, Xingyi Yang

详细信息

来源站点
ArXiv CS.CV
作者
Linghui Shen, Mingyue Cui, Xingyi Yang
文章类型
NEWS
语言
en
发布日期
2026-06-16

摘要

arXiv:2606.16519v1 Announce Type: new Abstract: Visual world models (VWMs) synthesize interactive, action-conditioned rollouts from a single context image. However, it remains an open question how robust these models are to adversarial perturbations. Standard adversarial attacks fail to assess this vulnerability because attackers lack ground-truth future videos and cannot predict subsequent user controls. We introduce BadWorld, a label-free adversarial framework tailored for autoregressive VWMs that systematically overcomes both constraints. First, to bypass the need for future supervision, we propose a self-supervised velocity attack that directly disrupts the early denoising dynamics of the model. Second, to ensure the attack generalizes across unpredictable user actions, we formulate a trajectory-adaptive bi-level optimization that actively mines hard control sequences to forge control-agnostic perturbations.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据