This paper proposes using active learning to address the problem of class imbalance in machine learning classification tasks. The key ideas are:
1) Active learning selects the most informative examples to label, which tend to be instances closest to the decision boundary. This helps provide a more balanced sample to the learner.
2) An online support vector machine (SVM) algorithm is used to allow efficient integration of newly labeled examples without retraining on the entire dataset.
3) Early stopping criteria based on support vectors are introduced to determine when enough examples have been labeled.
Empirical results on imbalanced datasets demonstrate that the active learning approach leads to improved classification performance compared to traditional supervised learning methods.