SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy arXiv:2602.22971v2 Announce Type: replace Abstract: As LLMs achieved breakthroughs in general reasoning, their proficiency in specialized scientific domains reveals pronounced gaps in existing benchmarks due to data contamination, insufficient complexity, and prohibitive human labor costs. Here we present SPM-Bench, an original, PhD-level multimodal benchmark specifically designed for scanning probe microscopy (SPM). We