Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them) 文章

ArXiv CS.CL2026-06-04NEWSen作者: Nizar Islah, Istabrak Abbes, Irina Rish, Sarath Chandar, Eilif B. Muller

摘要

arXiv:2606.05145v1 Announce Type: cross Abstract: When post-trained language models fail on reasoning problems, the common test-time-scaling response is to spend more compute on additional attempts, and the failed traces play no further role. We argue this discards a crucial signal; some failures come from unlucky sampling, where more rollouts help, while others are structural and resist resampling regardless of budget. We propose that failed traces encode recoverability structure: the inference-time signature of which test-time interventions can rescue a given failure. Three problem-level trajectory features, derived from the structure of available interventions, recover this structure from the distributional signature of failed rollouts, not their text. They cluster failures into stable regimes, characterize the failure topography of different post-training methods ($84.3{\pm}4.

Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them) 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术