Persona Attack: Incremental Memory Injection Jailbreak Attack against Large Language Models 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Persona Attack: Incremental Memory Injection Jailbreak Attack against Large Language Models arXiv:2606.00150v1 Announce Type: cross Abstract: As Large Language Models evolve for user convenience, vulnerability to jailbreak attacks continues to be reported despite ongoing efforts in safety training. Traditional jailbreak techniques typically focus on a single prompt injection, neglecting the models' ability to remember the flow of conversation and the user's instructions. In this paper, we propo