E3: Issue-Level Backtesting for Automated Research Critique 文章

ArXiv CS.CL2026-05-27NEWSen作者: Yashwardhan Chaudhuri, Sanyam Jain, Paridhi Mundra

详细信息

来源站点: ArXiv CS.CL
作者: Yashwardhan Chaudhuri, Sanyam Jain, Paridhi Mundra
文章类型: NEWS
语言: en
发布日期: 2026-05-27

摘要

arXiv:2605.27072v1 Announce Type: new Abstract: We present E3, an automated review assistant that augments reviewers and engineering teams by identifying decision-relevant technical concerns in research papers. For each concern, E3 reports its nature, its location, its bearing on the contribution, and the analysis or evidence that would resolve it, covering unsupported claims, missing ablations, weak baselines, hidden assumptions, threats to validity, and leakage risks. To evaluate E3 without contamination confounds we adopt an issue-level backtesting protocol: the corpus is restricted to papers postdating the training cutoff of every automated source, and for each paper a meta-judge that observes only anonymised reviews labels every issue-source pair as Caught, Partial, or Missed. Applied to 100 ICLR 2026 papers and 4598 judged issue rows, comparing E3 against the ICLR human reviews and two prompt-matched LLM baselines built on gpt-5.

E3: Issue-Level Backtesting for Automated Research Critique 文章

详细信息

摘要

相关事件

相关公司查看全部 (4)

相关人物

相关产品查看全部 (14)

相关技术查看全部 (23)