OpenCompass: A Universal Evaluation Platform for Large Language Models 事件
OPEN_SOURCE2026-05-29影响: MEDIUM
OpenCompass: A Universal Evaluation Platform for Large Language Models arXiv:2605.19276v2 Announce Type: replace Abstract: In recent years, the field of artificial intelligence has undergone a paradigm shift from task-specific small-scale models to general-purpose large language models (LLMs). With the rapid iteration of LLMs, objective, quantitative, and comprehensive evaluation of their capabilities has become a critical link in advancing technological development. Currently, the mainstream s
相关产品查看全部 (10)
相关报道查看全部 (1)
OpenCompass: A Universal Evaluation Platform for Large Language Models
ArXiv CS.CL2026-05-29