Pairwise Reference Alignment as a Model-Level Ordinal Observable 文章

ArXiv CS.CL2026-06-01NEWSen作者: Mujing Li

详细信息

来源站点: ArXiv CS.CL
作者: Mujing Li
文章类型: NEWS
语言: en
发布日期: 2026-06-01

摘要

arXiv:2605.30758v1 Announce Type: new Abstract: Pairwise preference data is widely used in language-model evaluation and alignment, often for model ranking, reward modeling, or preference optimization. This note formulates a more basic measurement question: given a reference distribution of pairwise preferences, what model-level quantity is estimated when we test whether a model ranks preferred responses above rejected responses? We define pairwise reference alignment as an ordinal observable induced by a model scoring function. Given a reference pair distribution $P_{\mathrm{pair}}$ over triples $(x,y^+,y^-)$, and a scalar model score $S_M(x,y)$, we define the alignment observable as the probability that the model-induced ordering agrees with the reference preference ordering. We further define a centered order-parameter-like statistic and discuss a margin-based extension.

Pairwise Reference Alignment as a Model-Level Ordinal Observable 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术