Instance-Based Learning Algorithms
Machine Learning
C4.5: programs for machine learning
C4.5: programs for machine learning
Reduction Techniques for Instance-BasedLearning Algorithms
Machine Learning
A study of the behavior of several methods for balancing machine learning training data
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Learning Greek verb complements: addressing the class imbalance
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Learning verb complements for modern greek: Balancing the noisy dataset
Natural Language Engineering
The effect of borderline examples on language learning
Journal of Experimental & Theoretical Artificial Intelligence
On metricity of two heterogeneous measures in the presence of missing values
Artificial Intelligence Review
Consistency Measure of Multiple Classifiers for Land Cover Classification by Remote Sensing Image
MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
Evolutionary undersampling for classification with imbalanced datasets: Proposals and taxonomy
Evolutionary Computation
Classification of Imbalanced Data Sets by Using the Hybrid Re-sampling Algorithm Based on Isomap
ISICA '09 Proceedings of the 4th International Symposium on Advances in Computation and Intelligence
SERA: selectively recursive approach towards nonstationary imbalanced stream data mining
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Using XCS to describe continuous-valued problem spaces
IWLCS'03-05 Proceedings of the 2003-2005 international conference on Learning classifier systems
An asymmetric classifier based on partial least squares
Pattern Recognition
Classification of imbalanced data by combining the complementary neural network and SMOTE algorithm
ICONIP'10 Proceedings of the 17th international conference on Neural information processing: models and applications - Volume Part II
Compact ensemble trees for imbalanced data
MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Expert Systems with Applications: An International Journal
Improved response modeling based on clustering, under-sampling, and ensemble
Expert Systems with Applications: An International Journal
A novel synthetic minority oversampling technique for imbalanced data set learning
ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
A proposal of evolutionary prototype selection for class imbalance problems
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Preprocessing unbalanced data using support vector machine
Decision Support Systems
WSEAS Transactions on Information Science and Applications
An efficient and simple under-sampling technique for imbalanced time series classification
Proceedings of the 21st ACM international conference on Information and knowledge management
Improving risk predictions by preprocessing imbalanced credit data
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part II
Evaluation of sampling methods for learning from imbalanced data
ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories
Multimedia Tools and Applications
Hi-index | 0.00 |
We studied three methods to improve identification of difficult small classes by balancing imbalanced class distribution with data reduction. The new method, neighborhood cleaning rule (NCL), outperformed simple random and one-sided selection methods in experiments with ten data sets. All reduction methods improved identification of small classes (20-30%), but the differences were insignificant. However, significant differences in accuracies, true-positive rates and true-negative rates obtained with the 3-nearest neighbor method and C4.5 from the reduced data favored NCL. The results suggest that NCL is a useful method for improving the modeling of difficult small classes, and for building classifiers to identify these classes from the real-world data.