On the Limits of LLM-as-Judge for Scientific Novelty Assessment 事件

PRODUCT_LAUNCH2026-06-11影响: MEDIUM

On the Limits of LLM-as-Judge for Scientific Novelty Assessment arXiv:2606.12071v1 Announce Type: cross Abstract: LLMs are increasingly used to generate and judge scientific ideas. This makes novelty evaluation a central problem. Full idea evaluation is difficult because it often requires judging a method, its feasibility, and its empirical promise. We therefore study a cleaner upstream object: the research question (RQ). RQ generation is a prerequisite for scientific ideation, and RQs can be c