Evaluating AI-based Scientific Knowledge Synthesis with Epidemiological Systematic Reviews 事件

PRODUCT_LAUNCH2026-06-08影响: MEDIUM

Evaluating AI-based Scientific Knowledge Synthesis with Epidemiological Systematic Reviews arXiv:2603.22327v2 Announce Type: replace-cross Abstract: Systematic literature reviews (SLRs) are a demanding and high-stakes form of scientific knowledge synthesis that remains underspecified as an evaluation setting for large language models (LLMs). We introduce AgentSLR, a large-scale evaluation harness comprising an SLR automation workflow and an expert annotated dataset covering 16,248 articles, des