TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents 文章

ArXiv CS.AI2026-06-06NEWSen作者: Chengqi Dong, Chuhuai Yue, Hang He, yandong liu, Fenghe Tang, S Kevin Zhou, Xiaohan Wang, Jiajun Chai, Guojun Yin

TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents · 相关人物

暂无数据