Chain-of-Thought Hijacking 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Chain-of-Thought Hijacking arXiv:2510.26418v4 Announce Type: replace Abstract: Large Reasoning Models (LRMs) improve task performance through extended inference-time reasoning. Although previous studies suggest that longer reasoning should lead to more robust safety behavior, we find evidence to the contrary: over-extended reasoning can instead be exploited to systematically weaken refusal behavior. We propose Chain-of-Thought Hijacking, a simple yet effective black-box jailbreak attack that in
相关产品查看全部 (10)
相关报道查看全部 (1)
Chain-of-Thought Hijacking
ArXiv CS.AI2026-05-26