Decomposing Factual Sycophancy in Language Models: How Size and Instruction Tuning Shape Robustness 事件
PRODUCT_LAUNCH2026-06-05影响: MEDIUM
Decomposing Factual Sycophancy in Language Models: How Size and Instruction Tuning Shape Robustness arXiv:2606.06306v1 Announce Type: new Abstract: Factual sycophancy occurs when a language model abandons a correct, verifiable answer under social pressure. Because a flip occurs only when pressure toward a false answer exceeds the model's neutral preference for the truth, flip rates conflate two mechanisms: the strength of that baseline preference (truth margin), and how far pressure shifts it (