Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge 事件

Name: Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge
Start: 2026-06-03

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge arXiv:2512.03019v2 Announce Type: replace-cross Abstract: Thinking Large Language Models (LLMs) used as judges for pairwise preferences remain noisy at the single-sample level, and common aggregation rules (majority vote, soft self-consistency, or instruction-based self-aggregation) are inconsistent when ties are allowed. We study inference-time compute (ITC) for evaluators that generate n independent thinking--rating sa

人工智能

关系图谱

Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge 事件

相关公司查看全部 (10)

相关人物查看全部 (2)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)