TRACER: Turn-level Regret Matching with Inner Reinforcement Credit for Cooperative Multi-LLM Reasoning 文章

ArXiv CS.AI2026-05-28NEWSen作者: Chusen Li, Zhou Liu, Shuigeng Zhou, Wentao Zhang

TRACER: Turn-level Regret Matching with Inner Reinforcement Credit for Cooperative Multi-LLM Reasoning · 相关事件