SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents arXiv:2602.11210v5 Announce Type: replace-cross Abstract: Reinforcement learning (RL) has become a key paradigm for training software engineering (SWE) agents, but existing pipelines typically rely on per-task containers for isolation. At scale, pre-built container images incur substantial storage overhead, slow environment setup, and require container-management privileges. We propose SWE-MiniSandbo