The Geometry of LLM-as-Judge: Why Inter-LLM Consensus Is Not Human Alignment 事件

Name: The Geometry of LLM-as-Judge: Why Inter-LLM Consensus Is Not Human Alignment
Start: 2026-06-03

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

The Geometry of LLM-as-Judge: Why Inter-LLM Consensus Is Not Human Alignment arXiv:2606.03043v1 Announce Type: new Abstract: LMs-as-judges are now standard, yet judges agree strongly with one another while agreeing only weakly with humans. We test whether this reflects shared signal or shared bias by measuring four geometric quantities on the standard LLM-as-judge stack across four community-built Indic datasets, eight Indic languages, and 41 LLM judges: score spread, effective rank, principal

人工智能

关系图谱

The Geometry of LLM-as-Judge: Why Inter-LLM Consensus Is Not Human Alignment 事件

相关公司查看全部 (10)

相关人物查看全部 (1)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)