Trivium: Temporal Regret as a First-Class Objective for Causal-Memory Controllers 文章

ArXiv CS.AI2026-06-04NEWSen作者: Edward Y. Chang

摘要

arXiv:2606.04421v1 Announce Type: new Abstract: Many current agentic systems and LLM pipelines correct mistakes by optimizing outcome reward. This addresses only the what of failure: when an outcome diverges from prediction, the why and when of the mismatch are not systematically logged, reviewed, or corrected, so the same error can recur episode after episode. We argue that this is a structural problem, not merely a model-capacity one. We propose long-horizon temporal regret as a first-class objective alongside outcome regret and epistemic regret over the working causal model. Temporal regret captures when failure persists: how long a miscalibrated causal model is tolerated before correction. Epistemic regret captures why failure persists: residual uncertainty or error in the working causal model. Together, the three regrets give a falsifiable account of what, why, and when a long-lived agent can fail.

Trivium: Temporal Regret as a First-Class Objective for Causal-Memory Controllers 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术