Structured Belief State and the First Precision-Aware Benchmark for LLM Memory Retrieval 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Structured Belief State and the First Precision-Aware Benchmark for LLM Memory Retrieval arXiv:2605.11325v2 Announce Type: replace-cross Abstract: Every major benchmark for LLM memory systems, LoCoMo foremost, measures whether a model answered correctly, not whether the memory system retrieved correctly. A system returning its entire belief store achieves recall of 1.0 and passes answer-quality evaluation. This is the difference between a unit test and an integration test: retrieval quality mus