CLASSIFICATION OF IMBALANCED DATA: A REVIEW 论文

2009International Journal of Pattern Recognition and Artificial Intelligence引用 1681
Imbalanced Data Classification TechniquesFinancial Distress and Bankruptcy PredictionElectricity Theft Detection Techniques

摘要

Classification of data with imbalanced class distribution has encountered a significant drawback of the performance attainable by most standard classifier learning algorithms which assume a relatively balanced class distribution and equal misclassification costs. This paper provides a review of the classification of imbalanced data regarding: the application domains; the nature of the problem; the learning difficulties with standard classifier learning algorithms; the learning objectives and evaluation measures; the reported research solutions; and the class imbalance problem in the presence of multiple classes.