Optimization methods in massive data sets

  • Authors:
  • P. S. Bradley;O. L. Mangasarian;D. R. Musicant

  • Affiliations:
  • Microsoft Research, Redmond, WA;Computer Sciences Dept., University of Wisconsin, Madison, WI;Computer Sciences Dept., University of Wisconsin, Madison, WI

  • Venue:
  • Handbook of massive data sets
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe the role of generalized support vector machines in separating massive and complex data using arbitrary nonlinear kernels. Feature selection that improves generalization is implemented via an effective procedure that utilizes a polyhedral norm or a concave function minimization. Massive data is separated using a linear programming chunking algorithm as well as a successive overrelaxation algorithm, each of which is capable of processing data with millions of points.