EvoRubric: Self-Evolving Rubric-Driven RL for Open-Ended Generation 文章

ArXiv CS.CL2026-05-29NEWSen作者: Xin Guan, Xiaomeng Hu, Shen Huang, Zhenyi Wang, Bo Zhang, Zijian Li, Pengjun Xie, Bo Liu, Jiuxin Cao

EvoRubric: Self-Evolving Rubric-Driven RL for Open-Ended Generation · 相关技术