Fast SVM Training Algorithm with Decomposition on Very Large Data Sets

Authors:
Jian-xiong Dong;Adam Krzyzak;Ching Y. Suen
Affiliations:
-;IEEE;IEEE
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2005

Citing 22
Cited 42

A set of level 3 basic linear algebra subprograms

ACM Transactions on Mathematical Software (TOMS)
Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
A training algorithm for optimal margin classifiers

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
Bagging predictors

Machine Learning
Computer architecture (2nd ed.): a quantitative approach

Computer architecture (2nd ed.): a quantitative approach
Making large-scale support vector machine learning practical

Advances in kernel methods
Fast training of support vector machines using sequential minimal optimization

Advances in kernel methods
An introduction to support Vector Machines: and other kernel-based learning methods

An introduction to support Vector Machines: and other kernel-based learning methods
Automatically tuned linear algebra software

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Efficient SVM Regression Training with SMO

Machine Learning
Convergence of a Generalized SMO Algorithm for SVM Classifier Design

Machine Learning
Training Invariant Support Vector Machines

Machine Learning
A theory of complexity for continuous time systems

Journal of Complexity
A parallel mixture of SVMs for very large scale problems

Neural Computation
Efficient greedy learning of Gaussian mixture models

Neural Computation
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Training Support Vector Machines: an Application to Face Detection

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
SVMTorch: support vector machines for large-scale regression problems

The Journal of Machine Learning Research
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Improvements to Platt's SMO Algorithm for SVM Classifier Design

Neural Computation
A Bayesian Committee Machine

Neural Computation

An improved handwritten Chinese character recognition system using support vector machine

Pattern Recognition Letters - Special issue: Artificial neural networks in pattern recognition
2005 Special issue: Modelling ordinal relations with SVMs: An application to objective aesthetic evaluation of breast cancer conservative treatment

Neural Networks - 2005 Special issue: IJCNN 2005
A genetic framework using contextual knowledge for segmentation and recognition of handwritten numeral strings

Pattern Recognition
Optimizing resources in model selection for support vector machine

Pattern Recognition
Handwritten Bangla numeral recognition system and its application to postal automation

Pattern Recognition
Support vector classifiers via gradient systems with discontinuous righthand sides

Neural Networks
Support vector machine classification for large data sets via minimum enclosing ball clustering

Neurocomputing
Incorporating prior knowledge in support vector machines for classification: A review

Neurocomputing
Choosing the kernel parameters for support vector machines by the inter-cluster distance in the feature space

Pattern Recognition
CombNET-III with Nonlinear Gating Network and Its Application in Large-Scale Classification Problems

IEICE - Transactions on Information and Systems
Incremental support vector machine learning in the primal and applications

Neurocomputing
Combining evolution strategy and gradient descent method for discriminative learning of bayesian classifiers

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Evaluation of incremental learning algorithms for HMM in the recognition of alphanumeric characters

Pattern Recognition
Fast Local Support Vector Machines for Large Datasets

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Recognition of handwritten Chinese characters by critical region analysis

Pattern Recognition
A Fast BMU Search for Support Vector Machine

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part I
A new support vector machine for microarray classification and adaptive gene selection

ACC'09 Proceedings of the 2009 conference on American Control Conference
Automatic clinical image segmentation using pathological modeling, PCA and SVM

Engineering Applications of Artificial Intelligence
Addressing the problems of data-centric physiology-affect relations modeling

Proceedings of the 15th international conference on Intelligent user interfaces
Sparse approximation through boosting for learning large scale kernel machines

IEEE Transactions on Neural Networks
Fast and Scalable Local Kernel Machines

The Journal of Machine Learning Research
A modified support vector machine and its application to image segmentation

Image and Vision Computing
Fuzzy integral to speed up support vector machines training for pattern classification

International Journal of Knowledge-based and Intelligent Engineering Systems
A fast method to approximately train hard support vector regression

Neural Networks
A novel technique for subpixel image classification based on support vector machine

IEEE Transactions on Image Processing
Using an iterative linear solver in an interior-point method for generating support vector machines

Computational Optimization and Applications
A novel robust kernel for visual learning problems

Neurocomputing
Discriminative structure selection method of Gaussian Mixture Models with its application to handwritten digit recognition

Neurocomputing
Density-induced margin support vector machines

Pattern Recognition
A novel multi-view learning developed from single-view patterns

Pattern Recognition
Clinical charge profiles prediction for patients diagnosed with chronic diseases using Multi-level Support Vector Machine

Expert Systems with Applications: An International Journal
A classifier for Bangla handwritten numeral recognition

Expert Systems with Applications: An International Journal
Active learning for sparse least squares support vector machines

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
An Affine-Scaling Interior-Point Method for Continuous Knapsack Constraints with Application to Support Vector Machines

SIAM Journal on Optimization
Training support vector machines with multiple equality constraints

ECML'05 Proceedings of the 16th European conference on Machine Learning
Automatic clinical image segmentation using pathological modelling, PCA and SVM

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
A fast SVM training algorithm based on a decision tree data filter

MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Detecting RNA sequences using two-stage SVM classifier

LSMS'07 Proceedings of the 2007 international conference on Life System Modeling and Simulation
Support vector machine for large databases as classifier

SEMCCO'12 Proceedings of the Third international conference on Swarm, Evolutionary, and Memetic Computing
Fast instance selection for speeding up support vector machines

Knowledge-Based Systems
A fast algorithm for kernel 1-norm support vector machines

Knowledge-Based Systems
Fast classification for large data sets via random selection clustering and Support Vector Machines

Intelligent Data Analysis

Quantified Score

Hi-index	0.15

Visualization

Abstract

Training a support vector machine on a data set of huge size with thousands of classes is a challenging problem. This paper proposes an efficient algorithm to solve this problem. The key idea is to introduce a parallel optimization step to quickly remove most of the nonsupport vectors, where block diagonal matrices are used to approximate the original kernel matrix so that the original problem can be split into hundreds of subproblems which can be solved more efficiently. In addition, some effective strategies such as kernel caching and efficient computation of kernel matrix are integrated to speed up the training process. Our analysis of the proposed algorithm shows that its time complexity grows linearly with the number of classes and size of the data set. In the experiments, many appealing properties of the proposed algorithm have been investigated and the results show that the proposed algorithm has a much better scaling capability than Libsvm, {\rm{SVM}}^{light}, and {\rm{SVMTorch}}. Moreover, the good generalization performances on several large databases have also been achieved.