Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents 文章

ArXiv CS.AI2026-05-29NEWSen作者: Qirui Mi, Zhijian Ma, Mengyue Yang, Haoxuan Li, Yisen Wang, Haifeng Zhang, Jun Wang

Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents · 相关技术