DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories 事件

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories arXiv:2604.20443v2 Announce Type: replace Abstract: We introduce DialToM, an annotated Theory of Mind (ToM) benchmark built from naturalistic human-human dialogues using a multiple-choice evaluation framework. Concurrent with recent work showing a gap between explicit mental-state inference and applied ToM in synthetic settings~\cite{gu2024simpletom}, we establish a stricter \emph{State-Driven Diagnostic Prob