Cross-Generational Transfer of Adversarial Attacks Reveals Non-Monotonic Safety Alignment in LLMs 文章

ArXiv CS.CL2026-06-02NEWSen作者: Subhadip Mitra

Cross-Generational Transfer of Adversarial Attacks Reveals Non-Monotonic Safety Alignment in LLMs · 相关事件