LiveK12Bench: Have Large Multimodal Models Truly Conquered High School-level Examinations? 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

LiveK12Bench: Have Large Multimodal Models Truly Conquered High School-level Examinations? arXiv:2605.26781v1 Announce Type: new Abstract: Advanced Large Multimodal Models (LMMs) have demonstrated impressive performance in K-12 reasoning tasks, exhibiting great promise as intelligent tutors. Realizing this potential requires models to navigate real-world examinations effectively, yet most existing benchmarks fail to capture the complexity of authentic testing environments. Specifically, most da

LiveK12Bench: Have Large Multimodal Models Truly Conquered High School-level Examinations? · 相关报道