TensorBench: Benchmarking Coding Agents on a Compiler-Based Tensor Framework 事件
PRODUCT_LAUNCH2026-06-05影响: MEDIUM
TensorBench: Benchmarking Coding Agents on a Compiler-Based Tensor Framework arXiv:2606.05570v1 Announce Type: new Abstract: Repository-level coding benchmarks face a trade-off between task difficulty and evaluation reliability: tasks that challenge frontier models often involve large codebases with incomplete test coverage, while human review does not scale. We introduce TensorBench, a benchmark of 199 feature-addition and refactoring tasks on an open-source compiler-based tensor framework tha
相关产品查看全部 (10)
相关报道查看全部 (1)
TensorBench: Benchmarking Coding Agents on a Compiler-Based Tensor Framework
ArXiv CS.CL2026-06-05