Entropy and Inference, Revisited 论文

2002The MIT Press eBooks引用 219

Bayesian Methods and Mixture ModelsAlgorithms and Data CompressionMachine Learning and Algorithms

人工智能 Algorithms and Data Compression Bayesian Methods and Mixture Models Machine Learning and Algorithms

作者

摘要

We study properties of popular near–uniform (Dirichlet) priors for learning undersampled probability distributions on discrete nonmetric spaces and show that they lead to disastrous results. However, an Occam–style phase space argument expands the priors into their infinite mixture and resolves most of the observed problems. This leads to a surprisingly good estimator of entropies of discrete distributions. Learning a probability distribution from examples is one of the basic problems in data analysis. Common practical approaches introduce a family of parametric models, leading to questions about model selection. In Bayesian inference, computing the total probabilityof the data arising from a model involves an integration over parameter space, and the resulting “phase space volume ” automatically discriminates against models with larger numbers of parameters—hence the description of these volume terms as Occam factors [1, 2]. As we move from finite parameterizations to models that are described by smooth functions, the integrals over parameter space become functional integrals and methods from quantum field theory allow us to do these integrals asymptotically; again the volume in model space consistent with the data is larger for models that are smoother and hence less complex [3]. Further, at least under some conditions the relevant degree of smoothness can be determined self–consistently from the data, so

作者查看全部 (3)

William Bialek

Fariel Shafee

Ilya Nemenman

Entropy and Inference, Revisited 论文

摘要

作者查看全部 (3)

相关技术查看全部 (3)

相关事件

相关文章