Position: State-of-the-Art Claims Require State-of-the-Art Evidence 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Position: State-of-the-Art Claims Require State-of-the-Art Evidence arXiv:2605.17273v2 Announce Type: replace-cross Abstract: State-of-the-Art (SOTA) claims pervade Artificial Intelligence (AI) and Machine Learning (ML) research. These claims rest on benchmark evaluations, where models are ranked by aggregate scores across tasks. Public benchmarks or leaderboards are the most visible instance, but the same structure appears in paper tables throughout the literature. However, such minimal eviden