OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling arXiv:2605.26322v1 Announce Type: new Abstract: Theory of Mind (ToM), the ability to infer others' knowledge, intentions, and emotions, is commonly evaluated in large language models (LLMs) using end-point question answering, where performance is judged solely by the final answer to a social reasoning query. This paradigm obscures whether the model actually constructs the underlying mental-state representations required f
相关公司查看全部 (10)
相关产品查看全部 (10)
相关报道查看全部 (1)
OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling
ArXiv CS.AI2026-05-27