Boosting as a Regularized Path to a Maximum Margin Classifier 论文

2004引用 229
Sparse and Compressive Sensing TechniquesMachine Learning and AlgorithmsFace and Expression Recognition

摘要

In this paper we study boosting methods from a new perspective. We build on recent work by Efron et al. to show that boosting approximately (and in some cases exactly) minimizes its loss criterion with an l 1 constraint on the coefficient vector. This helps understand the success of boosting with early stopping as regularized fitting of the loss criterion. For the two most commonly used criteria (exponential and binomial log-likelihood), we further show that as the constraint is relaxed---or equivalently as the boosting iterations proceed---the solution converges (in the separable case) to an "l 1 -optimal" separating hyper-plane. We prove that this l 1 -optimal separating hyper-plane has the property of maximizing the minimal l 1 -margin of the training data, as defined in the boosting literature.