LiveK12Bench: Have Large Multimodal Models Truly Conquered High School-level Examinations? 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

LiveK12Bench: Have Large Multimodal Models Truly Conquered High School-level Examinations? arXiv:2605.26781v1 Announce Type: new Abstract: Advanced Large Multimodal Models (LMMs) have demonstrated impressive performance in K-12 reasoning tasks, exhibiting great promise as intelligent tutors. Realizing this potential requires models to navigate real-world examinations effectively, yet most existing benchmarks fail to capture the complexity of authentic testing environments. Specifically, most da

LiveK12Bench: Have Large Multimodal Models Truly Conquered High School-level Examinations? · 相关公司

W
World LabsRESEARCH_INSTITUTE
T
TERINONPROFIT
T
TelligenCOMPANY
A
AminaNONPROFIT
S
SpanNONPROFIT
T
TamCOMPANY
E
EATNONPROFIT
I
IterRESEARCH_INSTITUTE
A
ACTNONPROFIT