OpenCompass: A Universal Evaluation Platform for Large Language Models 事件

OPEN_SOURCE2026-05-29影响: MEDIUM

OpenCompass: A Universal Evaluation Platform for Large Language Models arXiv:2605.19276v2 Announce Type: replace Abstract: In recent years, the field of artificial intelligence has undergone a paradigm shift from task-specific small-scale models to general-purpose large language models (LLMs). With the rapid iteration of LLMs, objective, quantitative, and comprehensive evaluation of their capabilities has become a critical link in advancing technological development. Currently, the mainstream s