SenseJudge: Human-Centric Preference-Driven Judgment Framework 文章

ArXiv CS.CL2026-06-03NEWSen作者: Rui Li, Junfeng Liu, Xiangwen Kong, Linhai Xu, Zhifang Sui

摘要

arXiv:2606.03189v1 Announce Type: new Abstract: Large Language Models (LLMs) as judges across various scenarios such as assessing model responses is becoming an increasingly accepted paradigm. However, existing judgment approaches often rely on trained judgers using fixed preference data, which tend to overlook diverse user preferences and struggle to adapt to real-world human-AI dialogue scenarios. To address these limitations, we propose SenseJudge, a customizable judgment framework driven by human preferences and SenseBench, a diverse and challenging instruction-following benchmark derived from real-world multi-turn interactions. We applied the automatic judgment framework and benchmark to two tasks: (1) LLMs as personalized judges, and (2) model ranking.

SenseJudge: Human-Centric Preference-Driven Judgment Framework 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (8)

相关技术查看全部 (1)