Measuring Massive Multitask Chinese Understanding 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Measuring Massive Multitask Chinese Understanding arXiv:2304.12986v3 Announce Type: replace Abstract: The development of large-scale Chinese language models is flourishing, yet there is a lack of corresponding capability assessments. Therefore, we propose a test to measure the multitask accuracy of large Chinese language models. This test encompasses four major domains, including medicine, law, psychology, and education, with 15 subtasks in medicine and 8 subtasks in education. We found that th

Measuring Massive Multitask Chinese Understanding · 相关报道

相关报道