The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF 文章

ArXiv CS.AI2026-05-29NEWSen作者: Zeli Su, Zhankai Xu, Tianlei Chen, Longfei Zheng, Xiaolu Zhang, Jun Zhou, Wentao Zhang

The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF · 相关技术