D-Judge: Disrupting Multi-Turn Jailbreaks using Semantics-Preserving Output Rewriting 文章

ArXiv CS.AI2026-06-03NEWSen作者: Huanli Gong, Zhipeng Wei, Yu Fu, Haz Sameen Shahgir, Ananya Gupta, Yue Dong, N. Benjamin Erichson

D-Judge: Disrupting Multi-Turn Jailbreaks using Semantics-Preserving Output Rewriting · 相关技术