Bayesian Regularization and Pruning Using a Laplace Prior 论文

1995Neural Computation引用 388

Neural Networks and ApplicationsStatistical Mechanics and EntropyGaussian Processes and Bayesian Inference

人工智能 Neural Networks and Applications Gaussian Processes and Bayesian Inference Statistical Mechanics and Entropy

作者

摘要

Standard techniques for improved generalization from neural networks include weight decay and pruning. Weight decay has a Bayesian interpretation with the decay function corresponding to a prior over weights. The method of transformation groups and maximum entropy suggests a Laplace rather than a gaussian prior. After training, the weights then arrange themselves into two classes: (1) those with a common sensitivity to the data error and (2) those failing to achieve this sensitivity and that therefore vanish. Since the critical value is determined adaptively during training, pruning—in the sense of setting weights to exact zeros—becomes an automatic consequence of regularization alone. The count of free parameters is also reduced automatically as weights are pruned. A comparison is made with results of MacKay using the evidence framework and a gaussian regularizer.

作者查看全部 (1)

Peter M. Williams

Bayesian Regularization and Pruning Using a Laplace Prior 论文

摘要

作者查看全部 (1)

相关技术查看全部 (2)

相关事件

相关文章