A study of the behavior of several methods for balancing machine learning training data
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Neural Learning from Unbalanced Data
Applied Intelligence
KBA: Kernel Boundary Alignment Considering Imbalanced Data Distribution
IEEE Transactions on Knowledge and Data Engineering
Boosting for Learning Multiple Classes with Imbalanced Class Distribution
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Multi-class pattern classification using neural networks
Pattern Recognition
Forecasting of the daily meteorological pollution using wavelets and support vector machine
Engineering Applications of Artificial Intelligence
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
SVMs modeling for highly imbalanced classification
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on human computing
Multi-objective genetic fuzzy classifiers for imbalanced and cost-sensitive datasets
Soft Computing - A Fusion of Foundations, Methodologies and Applications
Engineering Applications of Artificial Intelligence
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Information Sciences: an International Journal
A Neuro-Fuzzy Inference System Through Integration of Fuzzy Logic and Extreme Learning Machines
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Information Theory
A comparison of methods for multiclass support vector machines
IEEE Transactions on Neural Networks
Hi-index | 0.01 |
Suspended particulate matters (PM"1"0) is considered as a harmful air pollutant. Many models attempt to predict numerical levels of PM"1"0 but a simple, clearly defined classification of PM"1"0 levels is more readily comprehensible to the general public rather than a numerical value. However, the PM"1"0 prediction model often suffers from data imbalance problem in the training dataset that results in failure to forecast the minority class of severe cases. In this study, a warning system using extreme learning machine (ELM), compared with support vector machine (SVM), was constructed to forecast the class of PM"1"0 level: Good, Moderate, and Severe. An imbalance strategy called prior duplication was also applied to improve the forecast of minority class. The experimental comparisons between ELM and SVM demonstrate that ELM produces superior accuracy relative to SVM in forecasting minority class (Severe) of PM"1"0 level with or without the imbalance strategy. Furthermore, our results show that the required training time and model size in the ELM model are much shorter and smaller than those of SVM respectively, leading to a more efficient and practical implementation of prediction model for large dataset. The performance superiority of ELM is also discussed in this paper.