OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling arXiv:2605.26322v1 Announce Type: new Abstract: Theory of Mind (ToM), the ability to infer others' knowledge, intentions, and emotions, is commonly evaluated in large language models (LLMs) using end-point question answering, where performance is judged solely by the final answer to a social reasoning query. This paradigm obscures whether the model actually constructs the underlying mental-state representations required f