Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks 文章

ArXiv CS.CL2026-05-27NEWSen作者: Lin Mu, Guowei Chu, Li Ni, Lei Sang, Yiwen Zhang

摘要

arXiv:2506.03627v2 Announce Type: replace Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various tasks by effectively utilizing a prompting strategy. However, they are highly sensitive to input perturbations, such as typographical errors or slight character order errors, which can significantly impair their performance. Despite advances in prompting techniques such as Chain-of-Thought and automatic prompt generation, developing a prompting strategy that explicitly mitigates the negative impact of such perturbations remains an open challenge. To bridge this gap, we propose Robustness of Prompting (RoP), a novel prompting strategy aimed at enhancing the robustness of LLMs. RoP consists of two stages: Error Correction and Guidance. In the Error Correction stage, RoP applies diverse perturbation methods to generate adversarial examples, which are used to generate prompts that correct input errors automatically.