Expected Value Alignment for Generative Reward Modeling in Formal Mathematics Verification 文章

ArXiv CS.AI2026-06-02NEWSen作者: Shihao Ji, Haotao Tan, Zihui Song, Mingyu Li

Expected Value Alignment for Generative Reward Modeling in Formal Mathematics Verification · 相关技术