The nature of statistical learning theory
The nature of statistical learning theory
Robust support vector machines for classification and computational issues
Optimization Methods & Software - Systems Analysis, Optimization and Data Mining in Biomedicine
Learning when training data are costly: the effect of class distribution on tree induction
Journal of Artificial Intelligence Research
Robust feature selection for SVMs under uncertain data
ICDM'13 Proceedings of the 13th international conference on Advances in Data Mining: applications and theoretical aspects
Hi-index | 0.00 |
In this paper, we have developed a robust Support Vector Machines (SVM) scheme of classifying imbalanced and noisy data using the principles of Robust Optimization. Uncertainty is prevalent in almost all datasets and has not been addressed efficiently by most data mining techniques, as these are based on deterministic mathematical tools. Imbalanced datasets exist while performing analysis of rare events, and for such datasets elements in the minority class become critical. Our method tries to address both issues lacking in traditional SVM classifications. At present, we provide solutions for linear classification of data having bounded uncertainties. This can be extended to non-linear classification schemes for any types of uncertainties that are convex. Our results in predicting the importance of the minority class are better than the traditional SVM soft-margin classification. Preliminary computational results are presented.