A POMDP formulation of preference elicitation problems 论文

2002引用 275

Bayesian Modeling and Causal InferenceMachine Learning and AlgorithmsMulti-Criteria Decision Making

人工智能 Multi-Criteria Decision Making Machine Learning and Algorithms Bayesian Modeling and Causal Inference

作者

摘要

Preference elicitation is a key problem facing the deployment of intelligent systems that make or rec-ommend decisions on the behalf of users. Since not all aspects of a utility function have the same im-pact on object-level decision quality, determining which information to extract from a user is itself a sequential decision problem, balancing the amount of elicitation effort and time with decision quality. We formulate this problem as a partially-observable Markov decision process (POMDP). Because of the continuous nature of the state and action spaces of this POMDP, standard techniques cannot be used to solve it. We describe methods that exploit the spe-cial structure of preference elicitation to deal with parameterized belief states over the continuous state space, and gradient techniques for optimizing pa-rameterized actions. These methods can be used with a number of different belief state representa-tions, including mixture models. 1

作者查看全部 (1)

Craig Boutilier

A POMDP formulation of preference elicitation problems 论文

摘要

作者查看全部 (1)

相关技术查看全部 (2)

相关事件

相关文章