Jailbreaking Multimodal Large Language Models using Multi-Clip Video 文章

ArXiv CS.CV2026-06-02NEWSen作者: Choongwon Kang, Seungjong Sun, Hyunmin Jun, Jang Hyun Kim

摘要

arXiv:2606.02111v1 Announce Type: new Abstract: As multimodal large language models (MLLMs) have advanced to process video inputs, concerns have emerged about their potential for malicious misuse. Prior jailbreak studies have shown that safety alignment in MLLMs can be bypassed through visual inputs, yet it remains unclear which properties of video inputs induce this vulnerability. To address this gap, we introduce Multi-Clip Video (MCV) SafetyBench, a dataset of 2,920 videos designed to evaluate how the diversity of video inputs affects the vulnerability of MLLMs. Each video consists of multiple short clips depicting diverse contexts related to a harmful query. Experiments on eight representative video MLLMs show that attack success consistently increases with the number of clips.

Jailbreaking Multimodal Large Language Models using Multi-Clip Video 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (2)

相关技术查看全部 (2)