LLMs versus the Halting Problem: Characterizing Program Termination Reasoning 文章

ArXiv CS.CL2026-05-27NEWSen作者: Oren Sultan, Jordi Armengol-Estape, Pascal Kesseli, Julien Vanegue, Dafna Shahaf, Yossi Adi, Peter O'Hearn

详细信息

来源站点: ArXiv CS.CL
作者: Oren Sultan, Jordi Armengol-Estape, Pascal Kesseli, Julien Vanegue, Dafna Shahaf, Yossi Adi, Peter O'Hearn
文章类型: NEWS
语言: en
发布日期: 2026-05-27

原文

摘要

arXiv:2601.18987v5 Announce Type: replace Abstract: Determining whether a program terminates is a central problem in computer science. Turing's Halting Problem established termination as undecidable, showing that no algorithm can universally determine termination for all programs and inputs. Hence, verification tools approximate termination, sometimes failing to prove or disprove; these tools rely on problem specific architectures, and are usually tied to particular programming languages. Recent advances in LLMs raise a natural question: To what extent can they reason about program termination? We evaluate frontier LLMs on a diverse set of C programs from the International Competition on Software Verification (SV Comp) 2025. Our results show that GPT-5 and Claude Sonnet 4.5 achieve scores comparable to top ranked verification tools (with test time scaling).

LLMs versus the Halting Problem: Characterizing Program Termination Reasoning 文章

详细信息

摘要

相关事件

相关公司查看全部 (2)

相关人物

相关产品查看全部 (13)

相关技术查看全部 (21)