Toward Optimal Active Learning through Sampling Estimation of Error Reduction
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
One-class svms for document classification
The Journal of Machine Learning Research
Automatically Labeling Video Data Using Multi-class Active Learning
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Editorial: special issue on learning from imbalanced data sets
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Active learning using pre-clustering
ICML '04 Proceedings of the twenty-first international conference on Machine learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Confidence-Based Active Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-evidence, multi-criteria, lazy associative document classification
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
ML-KNN: A lazy learning approach to multi-label learning
Pattern Recognition
A bound on the label complexity of agnostic active learning
Proceedings of the 24th international conference on Machine learning
Active learning for class imbalance problem
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Cost-sensitive boosting for classification of imbalanced data
Pattern Recognition
Learning on the border: active learning in imbalanced data classification
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Hierarchical sampling for active learning
Proceedings of the 25th international conference on Machine learning
Active learning with direct query construction
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Lazy Bagging for Classifying Imbalanced Data
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Semisupervised SVM batch mode active learning with applications to image retrieval
ACM Transactions on Information Systems (TOIS)
Importance weighted active learning
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Analysis of Perceptron-Based Active Learning
The Journal of Machine Learning Research
IEEE Transactions on Knowledge and Data Engineering
A Multi-Strategy Approach to KNN and LARM on Small and Incrementally Induced Prediction Knowledge
ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
SMOTE: synthetic minority over-sampling technique
Journal of Artificial Intelligence Research
SVMs modeling for highly imbalanced classification
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on human computing
Active learning in multimedia annotation and retrieval: A survey
ACM Transactions on Intelligent Systems and Technology (TIST)
Active Learning from Multiple Noisy Labelers with Varied Costs
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Active Learning with Human-Like Noisy Oracle
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Finding Local Anomalies in Very High Dimensional Space
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Rare Category Characterization
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Exploiting Local Data Uncertainty to Boost Global Outlier Detection
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Certainty-Enhanced Active Learning for Improving Imbalanced Data Classification
ICDMW '11 Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops
Interactive Video Indexing With Statistical Active Learning
IEEE Transactions on Multimedia
Hi-index | 0.01 |
Active learning is to learn an accurate classifier within as few queried labels as possible. For practical applications, we propose a Certainty-Based Active Learning (CBAL) algorithm to solve the imbalanced data classification problem in active learning. Without being affected by irrelevant samples which might overwhelm the minority class, the importance of each unlabeled sample is carefully measured within an explored neighborhood. For handling the agnostic case, IWAL-ERM is integrated into our approach without costs. Thus our CBAL is designed to determine the query probability within an explored neighborhood for each unlabeled sample. The potential neighborhood is incrementally explored, and there is no need to define the neighborhood size in advance. In our theoretical analysis, it is presented that CBAL has a polynomial label query improvement over passive learning. And the experimental results on synthetic and real-world datasets show that, CBAL has the ability of identifying informative samples and dealing with the imbalanced data classification problem in active learning.