Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models 文章

ArXiv CS.AI2026-06-11NEWSen作者: Malikeh Ehghaghi, Bogl\'arka Ecsedi, Marsha Chechik, Colin Raffel

详细信息

来源站点: ArXiv CS.AI
作者: Malikeh Ehghaghi, Bogl\'arka Ecsedi, Marsha Chechik, Colin Raffel
文章类型: NEWS
语言: en
发布日期: 2026-06-11

摘要

arXiv:2606.11409v1 Announce Type: cross Abstract: Adversarial robustness evaluations of large language models (LLMs) typically report attack success rate (ASR) under fixed query budgets, implicitly treating all attacks as equally costly. In practice, the computational expense of different attack strategies can vary by orders of magnitude. Consequently, ASR at a fixed budget can obscure the true effort required to jailbreak a model, thereby making it hard to determine whether an attack's cost justifies its payoff to the attacker. We propose a compute-aware evaluation framework based on computational pressure, measured in cumulative floating-point operations (FLOPs), as a proxy for adversarial effort. We introduce risk-compute curves, which map compute budgets to attack risk, and derive two metrics that summarize the average pressure required for a given attack to succeed.

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术