Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Instance-Based Learning Algorithms
Machine Learning
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
The nature of statistical learning theory
The nature of statistical learning theory
Training algorithms for linear text classifiers
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Reduction Techniques for Instance-BasedLearning Algorithms
Machine Learning
Learning to construct knowledge bases from the World Wide Web
Artificial Intelligence - Special issue on Intelligent internet systems
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
A Study of Approaches to Hypertext Categorization
Journal of Intelligent Information Systems
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Selecting Typical Instances in Instance-Based Learning
ML '92 Proceedings of the Ninth International Workshop on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Leave-One-Out Support Vector Machines
IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Using unlabeled data to improve text classification
Using unlabeled data to improve text classification
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Improving text categorization using the importance of sentences
Information Processing and Management: an International Journal
Automatic text categorization by unsupervised learning
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Dictionary-based text categorization of chemical web pages
Information Processing and Management: an International Journal
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Information Processing and Management: an International Journal
Information and Software Technology
Hi-index | 0.00 |
This paper proposes a new approach for text categorization, based on a feature projection technique. In our approach, training data are represented as the projections of training documents on each feature. The voting for a classification is processed on the basis of individual feature projections. The final classification of test documents is determined by a majority voting from the individual classifications of each feature. Our empirical results show that the proposed approach, text categorization using feature projections (TCFP), outperforms k-NN, Rocchio, and Naive Bayes. Most of all, TCFP is a faster classifier, up to one hundred times faster than k-NN in the Newsgroups data set. It is also robust from noisy data. Since the TCFP algorithm is very simple, its implementation and training process can be done very easily. For these reasons, TCFP can be a useful classifier in text categorization tasks, which need fast execution speed, robustness, and high performance.