AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials 事件

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials arXiv:2510.04704v4 Announce Type: replace-cross Abstract: Large language models (LLMs) have shown promising potential in scientific research, enabling tasks ranging from knowledge retrieval to property prediction. Existing science benchmarks mainly focus on perceptual or knowledge-based tasks, largely ignoring the modelling tasks, a fundamental starting point for any real scientific researc

AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials · 相关公司

W
World LabsRESEARCH_INSTITUTE
A
arXivNONPROFIT
T
TERINONPROFIT
A
ACTIONNONPROFIT
C
CATIRESEARCH_INSTITUTE
E
EATNONPROFIT
S
StartNONPROFIT
A
ACTNONPROFIT
S
SearchNONPROFIT
F
FINDNONPROFIT