From Procedural Skills to Strategy Genes: Towards Experience-Driven Test-Time Evolution 文章

ArXiv CS.CL2026-06-03NEWSen作者: Junjie Wang, Yiming Ren, Haoyang Zhang

摘要

arXiv:2604.15097v2 Announce Type: replace-cross Abstract: This beta technical report asks how reusable experience should be represented so that it can function as effective test-time control and as a substrate for iterative evolution. We study this question in 4.590 controlled trials across 45 scientific code-solving scenarios. We find that documentation-oriented Skill packages provide unstable control: their useful signal is sparse, and expanding a compact experience object into a fuller documentation package often fails to help and can degrade the overall average. We further show that representation itself is a first-order factor. A compact Gene representation yields the strongest overall average, remains competitive under substantial structural perturbations, and outperforms matched-budget Skill fragments, while reattaching documentation-oriented material usually weakens rather than improves it.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据