详细信息
- 来源站点
- ArXiv CS.CL
- 作者
- Raoyuan Zhao, Yihong Liu, Yupei Du, Hinrich Sch\"utze, Michael A. Hedderich
- 文章类型
- NEWS
- 语言
- en
- 发布日期
- 2026-05-28
摘要
arXiv:2605.27709v1 Announce Type: new Abstract: Mathematical reasoning benchmarks are vital for evaluating large language models (LLMs), but many are static and repeatedly exposed through public evaluation and training pipelines, making it difficult to separate genuine reasoning from memorization. Meanwhile, manually constructing new math problems with reliable answers remains costly. We introduce ReverseMath, a scalable method for generating new math problems through answer inversion. Given a problem and its answer, ReverseMath masks a numerical value in the original problem, treats the original answer as a known condition, and rewrites the problem so that the masked value becomes the new answer. The generated problem reverses the original input-output relation, making its answer known by construction. We study ReverseMath for both evaluation and training.