Verifiable Benchmarking of Long-Horizon Spatial Biology 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Verifiable Benchmarking of Long-Horizon Spatial Biology arXiv:2605.28065v1 Announce Type: new Abstract: AI agents are increasingly useful for biological data analysis, but existing benchmarks mostly test broad biological knowledge, executable workflows, or localized analysis steps rather than end-to-end scientific reasoning over spatial measurements. We introduce SpatialBench-Long, a benchmark for long-horizon spatial biology in which agents must recover biological claims from raw or near-raw d