MARFT: Multi-Agent Reinforcement Fine-Tuning 文章

ArXiv CS.AI2026-06-02NEWSen作者: Junwei Liao, Muning Wen, Jun Wang, Weinan Zhang

详细信息

来源站点: ArXiv CS.AI
作者: Junwei Liao, Muning Wen, Jun Wang, Weinan Zhang
文章类型: NEWS
语言: en
发布日期: 2026-06-02

摘要

arXiv:2504.16129v5 Announce Type: replace-cross Abstract: Large Language Model (LLM)-based Multi-Agent Systems (LaMAS) have demonstrated strong capabilities on complex agentic tasks requiring multifaceted reasoning and collaboration, from high-quality presentation generation to scientific research. Meanwhile, Reinforcement Learning (RL) is widely recognized for enhancing agent intelligence, but limited work has studied fine-tuning LaMAS with foundational RL techniques. Directly applying conventional Multi-Agent Reinforcement Learning (MARL) to LaMAS also introduces major challenges due to the unique mechanisms of LaMAS. To address these challenges, this article presents a comprehensive study of LLM-based MARL and proposes Multi-Agent Reinforcement Fine-Tuning (MARFT). We introduce Flex-MG, a new Markov Game formulation aligned with real-world LaMAS optimization, together with a universal algorithmic framework tailored to LaMAS.

MARFT: Multi-Agent Reinforcement Fine-Tuning 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (10)