MemoryDocDataSet: A Benchmark for Joint Conversational Memory and Long Document Reasoning 事件
PRODUCT_LAUNCH2026-06-04影响: MEDIUM
MemoryDocDataSet: A Benchmark for Joint Conversational Memory and Long Document Reasoning arXiv:2606.04442v1 Announce Type: new Abstract: AI systems increasingly need to combine two demanding capabilities: navigating multi-session conversation history and performing deep reading comprehension within long documents. Yet no existing benchmark evaluates both simultaneously. We introduce MemoryDocDataSet, a synthetic benchmark of 50 micro-worlds and 1,000 QA pairs in which each instance comprises 3