SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment 文章

ArXiv CS.AI2026-05-26NEWSen作者: Sihang Jiang, Lipeng Ma, Zhonghua Hong, Keyi Wang, Zhiyu Lu, Tengfei Wang, Shisong Chen, Jinghao Zhang, Tianjun Pan, Weijia Li, Jiaqing Liang, Yanghua Xiao

SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment · 相关人物

暂无数据