Aligning Language Model Benchmarks with Pairwise Preferences 文章

ArXiv CS.CL2026-06-30PAPERen作者: Marco Gutierrez, Xinyi Leng, Hannah Cyberey, Jonathan Richard Schwarz, Ahmed Alaa, Thomas Hartvigsen

查看原文 →

关系图谱

详细信息

来源站点: ArXiv CS.CL
作者: Marco Gutierrez, Xinyi Leng, Hannah Cyberey, Jonathan Richard Schwarz, Ahmed Alaa, Thomas Hartvigsen
文章类型: PAPER
语言: en
发布日期: 2026-06-30

原文

摘要

arXiv:2602.02898v3 Announce Type: replace-cross Abstract: Language model benchmarks are pervasive and computationally-efficient proxies for real-world performance. However, many recent works find that benchmarks often fail to predict real utility. Towards bridging this gap, we introduce benchmark alignment, where we use limited amounts of information about model performance to automatically update offline benchmarks, aiming to produce new static benchmarks that predict model pairwise preferences in given test settings. We then propose BenchAlign, the first solution to this problem, which learns preference-aligned weightings for benchmark questions using the question-level performance of language models alongside ranked pairs of models that could be collected during deployment, producing new benchmarks that rank previously unseen models according to these preferences.

Aligning Language Model Benchmarks with Pairwise Preferences 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (2)