Posterior probability support vector Machines for unbalanced data

  • Authors:
  • Qing Tao;Gao-Wei Wu;Fei-Yue Wang;Jue Wang

  • Affiliations:
  • Key Lab. of Complex Syst. & Intelligence Sci., Chinese Acad. of Sci., Beijing, China;-;-;-

  • Venue:
  • IEEE Transactions on Neural Networks
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper proposes a complete framework of posterior probability support vector machines (PPSVMs) for weighted training samples using modified concepts of risks, linear separability, margin, and optimal hyperplane. Within this framework, a new optimization problem for unbalanced classification problems is formulated and a new concept of support vectors established. Furthermore, a soft PPSVM with an interpretable parameter ν is obtained which is similar to the ν-SVM developed by Schölkopf et al., and an empirical method for determining the posterior probability is proposed as a new approach to determine ν. The main advantage of an PPSVM classifier lies in that fact that it is closer to the Bayes optimal without knowing the distributions. To validate the proposed method, two synthetic classification examples are used to illustrate the logical correctness of PPSVMs and their relationship to regular SVMs and Bayesian methods. Several other classification experiments are conducted to demonstrate that the performance of PPSVMs is better than regular SVMs in some cases. Compared with fuzzy support vector machines (FSVMs), the proposed PPSVM is a natural and an analytical extension of regular SVMs based on the statistical learning theory.