PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers arXiv:2605.26730v1 Announce Type: new Abstract: The rapid growth in submissions to machine learning venues has strained the scientific peer-review system and intensified interest in LLM-based automated peer reviewers. However, how good these systems are actually, especially compared to human reviewers at catching scientific gaps, remains poorly understood. In this work, we introduce PRISM (Peer Review Intelligence via Struct