Selective QA over Conflicting Multi-Source Personal Memory: A Diagnostic Testbed and Method Comparison 事件
PRODUCT_LAUNCH2026-05-29影响: MEDIUM
Selective QA over Conflicting Multi-Source Personal Memory: A Diagnostic Testbed and Method Comparison arXiv:2605.30087v1 Announce Type: new Abstract: Emerging personal AI agents are moving toward persistent, multi-source memory. This creates an evaluation problem: systems must decide how to use conflicting or incomplete evidence; they cannot just retrieve facts from one clean history. Existing benchmarks rarely show whether an error came from the evidence given to a method or from the method's