A Linear Classification Method in a Very High Dimensional Space Using Distributed Representation

Authors:
Takao Kobayashi;Ikuko Shimizu
Affiliations:
Department of Computer and Information Sciences, Tokyo University of Agriculture and Technology, Japan 184-8588;Department of Computer and Information Sciences, Tokyo University of Agriculture and Technology, Japan 184-8588
Venue:
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Year:
2009

Citing 3
Cited 0

Bagging predictors

Machine Learning
Similarity estimation techniques from rounding algorithms

STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Pattern Recognition by Distributed Coding: Test and Analysis of the Power Space Similarity Method

IWFHR '04 Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

We have proposed a fast learning and classification method by using distributed representation of vectors. In this paper, first, we shows that our method provides faster and better performance than 1-NN method by introducing a definition of a similarity concerned with LSH scheme. Next we compare our method with the Naive Bayes with respect to the number of dimensions of features. While the Naive Bayes requires a considerably large dimensional feature space, our method achieves higher performance even where the number of dimensions of a feature space of our method is much smaller than that of Naive Bayes. We explain our method by formalizing as a linear classifier in a very high dimensional space and show it is a special case of Naive Bayes model. Experimental results show that our method provides superior classification rates with small time complexity of learning and classification and is applicable to large data set.