Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications 文章

ArXiv CS.AI2026-05-26NEWSen作者: Xiaoyue Lu, Xianglin Yang, Haijun Liu, Jiahao Liu, Kuntai Cai, Yan Xiao, Jin Song Dong

Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications · 相关技术