Efficient Preference Poisoning Attack on Offline RLHF 文章

ArXiv CS.AI2026-05-26NEWSen作者: Chenye Yang, Weiyu Xu, Lifeng Lai

Efficient Preference Poisoning Attack on Offline RLHF · 相关人物

暂无数据