SePO: Self-Evolving Prompt Agent for System Prompt Optimization 文章

ArXiv CS.CL2026-06-04NEWSen作者: Wangcheng Tao, Han Wu, Weng-Fai Wong

摘要

arXiv:2606.04465v1 Announce Type: new Abstract: System prompt optimization improves agent behavior without modifying the underlying model, yielding human-readable, model-agnostic instructions. Existing methods build a prompt agent that refines task agents' system prompts, yet leave the prompt agent's own system prompt hand-engineered and fixed. We propose Self-Evolving Prompt Optimization (SePO), which treats the prompt agent's own system prompt as an optimization target alongside task agents' system prompts. SePO adopts a self-referential design. A single prompt agent improves both task agents' system prompts and its own under an open-ended evolutionary search that maintains an archive of candidate prompts as stepping stones. Training proceeds in two stages: pre-training evolves the prompt agent on a multi-task pool, and fine-tuning then applies it to a target task.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据