摘要
arXiv:2605.24960v1 Announce Type: new Abstract: Chain-of-Thought (CoT) faithfulness, i.e., whether CoTs genuinely reflect large language models' (LLM) underlying behavior, is typically evaluated under two disjoint paradigms: contextual faithfulness, measured by perturbing the input or CoT trace, and parametric faithfulness, assessed by intervening on a model's parametric knowledge. Yet prior work compares them only descriptively. We fill this gap by proposing FaithMate, a unified preference-alignment interface for optimizing models towards either faithfulness paradigm. It enables us to investigate the interplay between the two paradigms, examining whether and to what extent faithfulness gains generalize within and across paradigms.
相关事件查看全部 (1)
Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization
2026-05-26PRODUCT_LAUNCH影响: MEDIUM
相关公司查看全部 (4)
相关人物
暂无数据