Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization 文章

ArXiv CS.AI2026-05-29NEWSen作者: Wenwu Li, Yuran Song, Mingze Zhao, Bo Jin, Wenhao Li

摘要

arXiv:2605.30227v1 Announce Type: cross Abstract: While Multi-Agent Systems (MAS) empower Large Language Models to tackle complex reasoning tasks through collaborative interaction, optimizing their dynamics remains a formidable challenge due to the discrete, non-differentiable nature of the computation graph and the sparsity of global supervisory signals. Existing black-box optimizers struggle to attribute trajectory-level failure to specific local components, resulting in inefficient, high-variance exploration. We argue that tractable MAS optimization needs structural inductive biases to disentangle error signals. We propose temporal and structural credit assignment, which decomposes the objective along two axes: (i) temporal credit, using state-space bottlenecks to identify critical rounds, and (ii) structural credit, using stationary role policies to isolate agent contributions.