MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation 文章

ArXiv CS.AI2026-05-28NEWSen作者: Yutong Wang, Pengliang Ji, Chaoqun Yang, Kaixin Li, Ming Hu, Jiaoyang Li, Guillaume Sartoretti

MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation · 相关技术