Weak-Driven Learning: How Weak Agents make Strong Agents Stronger 事件
PRODUCT_LAUNCH2026-06-09影响: MEDIUM
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger arXiv:2602.08222v2 Announce Type: replace Abstract: As post-training optimization becomes central to improving large language models, we observe a persistent saturation bottleneck: once models grow highly confident, further training yields diminishing returns. While existing methods continue to reinforce target predictions, we find that informative supervision signals remain latent in models' own historical weak states. Motivated
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger · 相关报道
相关报道
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
ArXiv CS.AI2026-06-09