SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents 文章

ArXiv CS.AI2026-06-02NEWSen作者: Danlong Yuan, Wei Wu, Enhan Zhao, Zhengren Wang, Xueliang Zhao, Huishuai Zhang, Dongyan Zhao

SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents · 相关技术