CHASE: Adversarial Red-Blue Teaming for Improving LLM Safety using Reinforcement Learning 文章

ArXiv CS.CL2026-06-05NEWSen作者: Rahul Markasserithodi, Aditya Joshi, Yuekang Li, Ishmanbir Singh, Chris Yoo, Alan Niu

CHASE: Adversarial Red-Blue Teaming for Improving LLM Safety using Reinforcement Learning · 相关技术

相关技术