SLMJury: Can Small Language Models Judge as Well as Large Ones? 事件

Name: SLMJury: Can Small Language Models Judge as Well as Large Ones?
Start: 2026-06-09

PRODUCT_LAUNCH2026-06-09影响: MEDIUM

SLMJury: Can Small Language Models Judge as Well as Large Ones? arXiv:2606.07810v1 Announce Type: cross Abstract: Large language models (LLMs) are widely used as judges for evaluating model outputs, but their high cost, latency, and opacity limit scalability. We introduce SLMJury, a framework for evaluating small language models (SLMs) as judges across two paradigms: closed-ended binary correctness and open-ended quality scoring. We benchmark 16 SLM judges (0.6B-14B parameters) from four model

人工智能

关系图谱

SLMJury: Can Small Language Models Judge as Well as Large Ones? 事件

相关公司查看全部 (10)

相关人物查看全部 (2)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)