Machine Learning
The Random Subspace Method for Constructing Decision Forests
IEEE Transactions on Pattern Analysis and Machine Intelligence
Machine Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Ensemble Methods in Machine Learning
MCS '00 Proceedings of the First International Workshop on Multiple Classifier Systems
A study of the behavior of several methods for balancing machine learning training data
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Learning from imbalanced data sets with boosting and data generation: the DataBoost-IM approach
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Exploratory Under-Sampling for Class-Imbalance Learning
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Statistical Comparisons of Classifiers over Multiple Data Sets
The Journal of Machine Learning Research
Automatically countering imbalance and its empirical relationship to cost
Data Mining and Knowledge Discovery
Learning Decision Trees for Unbalanced Data
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
SMOTE: synthetic minority over-sampling technique
Journal of Artificial Intelligence Research
Learning when training data are costly: the effect of class distribution on tree induction
Journal of Artificial Intelligence Research
Roughly balanced bagging for imbalanced data
Statistical Analysis and Data Mining - Best of SDM'09
Exploiting diversity in ensembles: improving the performance on unbalanced datasets
MCS'07 Proceedings of the 7th international conference on Multiple classifier systems
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Ensembles of decision trees for imbalanced data
MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Using model trees and their ensembles for imbalanced data
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
An efficient ensemble method for classifying skewed data streams
ICIC'11 Proceedings of the 7th international conference on Intelligent Computing: bio-inspired computing and applications
A measure oriented training scheme for imbalanced classification problems
PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
Hi-index | 0.00 |
One of the more challenging problems faced by the data mining community is that of imbalanced datasets In imbalanced datasets one class (sometimes severely) outnumbers the other class, causing correct, and useful predictions to be difficult to achieve In order to combat this, many techniques have been proposed, especially centered around sampling methods In this paper we propose an ensemble framework that combines random subspaces with sampling to overcome the class imbalance problem We then experimentally verify this technique on a wide variety of datasets We conclude by analyzing the performance of the ensembles, and showing that, overall, our technique provides a significant improvement.