Real-Time Progress Prediction in Reasoning Language Models 文章

ArXiv CS.AI2026-05-27NEWSen作者: Hans Peter Lyngs{\o}e Raaschou-Jensen, Constanza Fierro, Anders S{\o}gaard

摘要

arXiv:2506.23274v4 Announce Type: replace-cross Abstract: Recent reasoning language models, particularly those that employ long latent chains of thought, achieve strong performance on complex agentic tasks. However, as these models operate over increasingly long time horizons, their internal progress becomes opaque to users, making expectation management and real-time oversight difficult. In this work, we investigate whether real-time progress prediction is feasible for such models. We first test whether hidden states encode progress information by discretizing reasoning trajectories and training a linear probe to classify reasoning states. We then fine-tune models to generate progress estimates from 0--100\% during chain-of-thought reasoning. Our strongest progress-reporting checkpoint reaches 0.161 MAE on mathematical reasoning traces and outperforms position baselines in this setting.

Real-Time Progress Prediction in Reasoning Language Models 文章

摘要

相关事件查看全部 (1)

相关公司查看全部 (3)

相关人物

相关产品查看全部 (12)

相关技术查看全部 (21)