Chain-of-Thought Reasoning In The Wild Is Not Always Faithful 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Chain-of-Thought Reasoning In The Wild Is Not Always Faithful arXiv:2503.08679v5 Announce Type: replace-cross Abstract: Recent studies indicate that when faced with explicit biases in prompts, models often omit mentioning these biases in their Chain-of-Thought (CoT) output, revealing that verbalized reasoning can give an incorrect picture of how models arrive at conclusions (unfaithfulness). In this work, we show that unfaithful CoT also occurs on naturally worded, non-adversarial prompts witho