Can LLMs Reason Structurally? Benchmarking via the Lens of Data Structures 事件
BREAKTHROUGH2026-06-02影响: HIGH
Can LLMs Reason Structurally? Benchmarking via the Lens of Data Structures arXiv:2505.24069v4 Announce Type: replace-cross Abstract: Large language models (LLMs) are deployed on increasingly complex tasks that require multi-step decision-making. Understanding their algorithmic reasoning abilities is therefore crucial. However, we lack a diagnostic benchmark for evaluating these capabilities. We propose to use data structures as a principled lens: as fundamental building blocks of algorithms, th