Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth arXiv:2605.25052v1 Announce Type: new Abstract: Chains of thought (CoTs) have become central in interpreting and auditing behaviors of large language models. Yet growing evidence suggests that these traces often fail to faithfully represent the computations behind a model's predictions. Several faithfulness metrics have been proposed, but whether they indeed measure faithfulness remains unknown. Answering this
相关产品查看全部 (10)
相关报道查看全部 (1)
Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth
ArXiv CS.CL2026-05-26