Neural networks and artificial intelligence
SIGCSE '89 Proceedings of the twentieth SIGCSE technical symposium on Computer science education
Selective Sampling Using the Query by Committee Algorithm
Machine Learning
One-class svms for document classification
The Journal of Machine Learning Research
Class imbalances versus small disjuncts
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
SMOTE: synthetic minority over-sampling technique
Journal of Artificial Intelligence Research
The foundations of cost-sensitive learning
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Expert Systems with Applications: An International Journal
Supervised classification with associative SOM
IWANN'03 Proceedings of the Artificial and natural neural networks 7th international conference on Computational methods in neural modeling - Volume 1
Support vector self-organizing learning for imbalanced medical data
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
An unsupervised self-organizing learning with support vector ranking for imbalanced datasets
Expert Systems with Applications: An International Journal
Using PCA to predict customer churn in telecommunication dataset
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Borderline over-sampling for imbalanced data classification
International Journal of Knowledge Engineering and Soft Data Paradigms
Distributed learning with data reduction
Transactions on computational collective intelligence IV
Computational Biology and Chemistry
A normal distribution-based over-sampling approach to imbalanced data classification
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
WSEAS Transactions on Information Science and Applications
Semi-supervised learning for imbalanced sentiment classification
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Sample cutting method for imbalanced text sentiment classification based on BRC
Knowledge-Based Systems
Computers and Electrical Engineering
Weighted Fuzzy-Possibilistic C-Means Over Large Data Sets
International Journal of Data Warehousing and Mining
Boosting weighted ELM for imbalanced learning
Neurocomputing
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - FUZZYSS'2011: 2nd International Fuzzy Systems Symposium
Hi-index | 12.05 |
For classification problem, the training data will significantly influence the classification accuracy. However, the data in real-world applications often are imbalanced class distribution, that is, most of the data are in majority class and little data are in minority class. In this case, if all the data are used to be the training data, the classifier tends to predict that most of the incoming data belongs to the majority class. Hence, it is important to select the suitable training data for classification in the imbalanced class distribution problem. In this paper, we propose cluster-based under-sampling approaches for selecting the representative data as training data to improve the classification accuracy for minority class and investigate the effect of under-sampling methods in the imbalanced class distribution environment. The experimental results show that our cluster-based under-sampling approaches outperform the other under-sampling techniques in the previous studies.