Learning to Trust: Bayesian Adaptation to Varying Suggester Reliability in Sequential Decision Making 文章

ArXiv CS.AI2026-05-26NEWSen作者: Dylan M. Asmar, Mykel J. Kochenderfer

摘要

arXiv:2511.12378v2 Announce Type: replace Abstract: Autonomous agents operating in sequential decision-making tasks under uncertainty can benefit from external action suggestions, which provide valuable guidance but inherently vary in reliability. Existing methods for incorporating such advice typically assume static and known suggester quality parameters, limiting practical deployment. We introduce a framework that dynamically learns and adapts to varying suggester reliability in partially observable environments. First, we integrate suggester quality directly into the agent's belief representation, enabling agents to infer and adjust their reliance on suggestions through Bayesian inference over suggester types. Second, we introduce an explicit ``ask'' action allowing agents to strategically request suggestions at critical moments, balancing informational gains against acquisition costs.