SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents 文章

ArXiv CS.AI2026-06-02NEWSen作者: Danlong Yuan, Wei Wu, Enhan Zhao, Zhengren Wang, Xueliang Zhao, Huishuai Zhang, Dongyan Zhao

查看原文 →

关系图谱

详细信息

来源站点: ArXiv CS.AI
作者: Danlong Yuan, Wei Wu, Enhan Zhao, Zhengren Wang, Xueliang Zhao, Huishuai Zhang, Dongyan Zhao
文章类型: NEWS
语言: en
发布日期: 2026-06-02

原文

摘要

arXiv:2602.11210v5 Announce Type: replace-cross Abstract: Reinforcement learning (RL) has become a key paradigm for training software engineering (SWE) agents, but existing pipelines typically rely on per-task containers for isolation. At scale, pre-built container images incur substantial storage overhead, slow environment setup, and require container-management privileges. We propose SWE-MiniSandbox, a lightweight, container-free method that enables scalable RL training of SWE agents without sacrificing isolation. Instead of relying on per-instance containers, SWE-MiniSandbox executes each task in an isolated workspace backed by kernel-level mechanisms, substantially reducing system overhead. It leverages lightweight environment pre-caching techniques to eliminate the need for bulky container images.

SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (3)