Knowledge discovery from imbalanced and noisy data
Data & Knowledge Engineering
IEEE Transactions on Neural Networks
An exploration of learning when data is noisy and imbalanced
Intelligent Data Analysis
Class imbalance and the curse of minority hubs
Knowledge-Based Systems
Hi-index | 0.00 |
Both imbalanced data and class noise are problems which have received attention in data mining research, how- ever learning from imbalanced data with labeling errors has not been adequately addressed. We present system- atic experimentation on imbalanced datasets with simulated class noise and evaluate the impact on various classifica- tion algorithms. Our results show that class noise is a sig- nificant detriment to learning from skewed data, but more importantly, we demonstrate that the class in which the noise is located is critical. This has significant repercus- sions for noise treatment procedures, which often handle noise equally in both classes. In addition, an examination of 11 classifiers demonstrates that the learners react very differently when confronted with class noise.