Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth arXiv:2605.25052v1 Announce Type: new Abstract: Chains of thought (CoTs) have become central in interpreting and auditing behaviors of large language models. Yet growing evidence suggests that these traces often fail to faithfully represent the computations behind a model's predictions. Several faithfulness metrics have been proposed, but whether they indeed measure faithfulness remains unknown. Answering this

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth · 相关技术