SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents 文章

ArXiv CS.CL2026-06-02NEWSen作者: Yujiong Shen, Yajie Yang, Zhiheng Xi, Binze Hu, Huayu Sha, Jiazheng Zhang, Qiyuan Peng, Junlin Shang, Jixuan Huang, Yutao Fan, Jingqi Tong, Shihan Dou, Ming Zhang, Lei Bai, Zhenfei Yin, Tao Gui, Xingjun Ma, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang

SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents · 相关人物

暂无数据