Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models 文章

ArXiv CS.AI2026-06-11NEWSen作者: Malikeh Ehghaghi, Bogl\'arka Ecsedi, Marsha Chechik, Colin Raffel

详细信息

来源站点
ArXiv CS.AI
作者
Malikeh Ehghaghi, Bogl\'arka Ecsedi, Marsha Chechik, Colin Raffel
文章类型
NEWS
语言
en
发布日期
2026-06-11

摘要

arXiv:2606.11409v1 Announce Type: cross Abstract: Adversarial robustness evaluations of large language models (LLMs) typically report attack success rate (ASR) under fixed query budgets, implicitly treating all attacks as equally costly. In practice, the computational expense of different attack strategies can vary by orders of magnitude. Consequently, ASR at a fixed budget can obscure the true effort required to jailbreak a model, thereby making it hard to determine whether an attack's cost justifies its payoff to the attacker. We propose a compute-aware evaluation framework based on computational pressure, measured in cumulative floating-point operations (FLOPs), as a proxy for adversarial effort. We introduce risk-compute curves, which map compute budgets to attack risk, and derive two metrics that summarize the average pressure required for a given attack to succeed.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据

相关技术

暂无数据