SEAL: Can Saturated Benchmarks Be Revived by LLM-as-a-Meta-Judge? 文章

ArXiv CS.CL2026-05-29NEWSen作者: Jiamin Chen, Yidi Wu, Qiexiang Wang, Qianben Chen, Yuchen Li, Yansen Zhang, Xiaokun Zhang, Wangchunshu Zhou, Chen Ma

SEAL: Can Saturated Benchmarks Be Revived by LLM-as-a-Meta-Judge? · 相关技术

相关技术