Confidence Calibration in Large Language Models 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Confidence Calibration in Large Language Models arXiv:2605.23909v1 Announce Type: new Abstract: We investigate the calibration of large language models' (LLMs') confidence across diverse tasks. The results of our preregistered study show that the current crop of LLMs are, like people, too sure they are right: confidence exceeds accuracy, on average. Importantly, however, this tendency is moderated by a powerful hard-easy effect, wherein overconfidence is greatest on difficult tests; by contrast
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
Confidence Calibration in Large Language Models
ArXiv CS.AI2026-05-26