Verifiable Benchmarking of Long-Horizon Spatial Biology 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Verifiable Benchmarking of Long-Horizon Spatial Biology arXiv:2605.28065v1 Announce Type: new Abstract: AI agents are increasingly useful for biological data analysis, but existing benchmarks mostly test broad biological knowledge, executable workflows, or localized analysis steps rather than end-to-end scientific reasoning over spatial measurements. We introduce SpatialBench-Long, a benchmark for long-horizon spatial biology in which agents must recover biological claims from raw or near-raw d
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
Verifiable Benchmarking of Long-Horizon Spatial Biology
ArXiv CS.AI2026-05-28