Trace-Mediated Peak Bias: Bridging Temporal Credit Assignment and Cognitive Heuristics in Deep Reinforcement Learning 文章

ArXiv CS.AI2026-06-04NEWSen作者: Viktor Vesel\'y, Aleksandar Todorov, Erwan Escudie, Matthia Sabatelli

Trace-Mediated Peak Bias: Bridging Temporal Credit Assignment and Cognitive Heuristics in Deep Reinforcement Learning · 相关技术