SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale arXiv:2602.23866v2 Announce Type: replace-cross Abstract: Software engineering agents (SWE) are improving rapidly, with recent gains largely driven by reinforcement learning (RL). However, RL training is constrained by the scarcity of large-scale task collections with reproducible execution environments and reliable test suites. Although a growing number of benchmarks have emerged, datasets suitable for training remain limited in sc

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale · 相关人物

暂无数据