X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes 事件
BREAKTHROUGH2026-06-03影响: HIGH
X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes arXiv:2603.05290v2 Announce Type: replace Abstract: Large language models (LLMs) achieve promising performance, yet their ability to reason remains poorly understood. Existing evaluations largely emphasize task-level accuracy, often conflating pattern matching with reasoning capability. We present X-RAY, an explainable reasoning analysis system that maps the LLM reasoning capability using calibrated, formally verified
相关产品查看全部 (10)
相关报道查看全部 (1)
X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes
ArXiv CS.AI2026-06-03