ConsisGuard: Aligning Safety Deliberation with Policy Enforcement in LLM Guardrails 文章

ArXiv CS.CL2026-06-01NEWSen作者: Yan Wang, Zhixuan Chu, Zihao Xue, Zhen Bi, Bingyu Zhu, YueFeng Chen, Zeyu Yang, Jungang Lou, Longtao Huang, Ningyu Zhang, Kui Ren, Hui Xue

ConsisGuard: Aligning Safety Deliberation with Policy Enforcement in LLM Guardrails · 相关技术