Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs 文章

ArXiv CS.CL2026-06-05NEWSen作者: Sora Miyamoto, Daisuke Oba, Naoaki Okazaki

Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs · 相关事件