Fast support vector machines for continuous data

  • Authors:
  • Kurt A. Kramer;Lawrence O. Hall;Dmitry B. Goldgof;Andrew Remsen;Tong Luo

  • Affiliations:
  • Department of Computer Science and Engineering, University of South Florida, Tampa, FL;Department of Computer Science and Engineering, University of South Florida, Tampa, FL;Department of Computer Science and Engineering, University of South Florida, Tampa, FL;College of Marine Science, University of South Florida, St. Petersburg, FL;Searchspark, Palo Alto, CA

  • Venue:
  • IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on cybernetics and cognitive informatics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Support vector machines (SVMs) can be trained to be very accurate classifiers and have been used in many applications. However, the training time and, to a lesser extent, prediction time of SVMs on very large data sets can be very long. This paper presents a fast compression method to scale up SVMs to large data sets. A simple bit-reduction method is applied to reduce the cardinality of the data by weighting representative examples. We then develop SVMs trained on the weighted data. Experiments indicate that bit-reduction SVM produces a significant reduction in the time required for both training and prediction with minimum loss in accuracy. It is also shown to typically be more accurate than random sampling when the data are not overcompressed.