Discriminative Batch Mode Active Learning 论文

2007引用 233
Machine Learning and AlgorithmsMachine Learning and Data ClassificationImbalanced Data Classification Techniques

详细信息

发表日期
2007-12-03
发表年份
2007

关键词

Machine Learning and AlgorithmsMachine Learning and Data ClassificationImbalanced Data Classification Techniques

摘要

Active learning sequentially selects unlabeled instances to label with the goal of reducing the effort needed to learn a good classifier. Most previous studies in active learning have focused on selecting one unlabeled instance to label at a time while retraining in each iteration. Recently a few batch mode active learning approaches have been proposed that select a set of most informative unlabeled instances in each iteration under the guidance of heuristic scores. In this paper, we propose a discriminative batch mode active learning approach that formulates the instance selection task as a continuous optimization problem over auxiliary instance selection variables. The optimization is formulated to maximize the discriminative classification performance of the target classifier, while also taking the unlabeled data into account. Although the objective is not convex, we can manipulate a quasi-Newton method to obtain a good local solution. Our empirical studies on UCI datasets show that the proposed active learning is more effective than current state-of-the art batch mode active learning algorithms. 1