Position: Retire the "Positive Backdoor" Label -- Secret Alignment Requires Strict and Systematic Evaluation 文章

ArXiv CS.AI2026-05-28NEWSen作者: Jianwei Li, Jung-Eun Kim

Position: Retire the "Positive Backdoor" Label -- Secret Alignment Requires Strict and Systematic Evaluation · 相关技术