Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents 事件

Name: Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents
Start: 2026-05-29

SHUTDOWN2026-05-29影响: LOW

Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents arXiv:2602.01869v3 Announce Type: replace Abstract: LLM-driven agents excel at sequential decision-making but often rely on on-the-fly reasoning, re-deriving solutions even in recurring scenarios. This insufficient experience reuse leads to computational redundancy and instability. To bridge this gap, we propose Skill-Pro, a framework enabling agents to autonomously learn reusable procedural skills from in

人工智能

关系图谱

Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents 事件

相关公司查看全部 (10)

相关人物查看全部 (3)

相关产品查看全部 (10)

相关技术查看全部 (9)

相关报道查看全部 (1)