Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas 文章

ArXiv CS.AI2026-05-29NEWSen作者: V\'ictor Gallego

摘要

arXiv:2605.30003v1 Announce Type: cross Abstract: We study two-level autoresearch for cooperation: an outer-loop AI agent autonomously redesigns the inner-loop pipeline of an LLM policy-synthesis system for multi-agent Sequential Social Dilemmas (SSDs). A researcher agent $\mathcal{R}$ (run as a coding agent) reads the inner-loop source code, edits system prompts, feedback functions, helper libraries, and iteration logic, runs evaluations, and decides what to keep, following the autoresearch paradigm. Across two games (Cleanup and Gathering), two policy-synthesizer LLMs, and two welfare objectives (utilitarian efficiency and Rawlsian maximin), the researcher reliably exceeds hand-designed baselines, sharply tightens run-to-run variance, and outperforms prompt-only optimization.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据