A statistical learning learning model of text classification for support vector machines
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Numerical Recipes in C++: the art of scientific computing
Numerical Recipes in C++: the art of scientific computing
Multispace KL for Pattern Representation and Classification
IEEE Transactions on Pattern Analysis and Machine Intelligence
Centroid-Based Document Classification: Analysis and Experimental Results
PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
What Is the Nearest Neighbor in High Dimensional Spaces?
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Automated Variable Weighting in k-Means Type Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence
A Weighted Nearest Mean Classifier for Sparse Subspaces
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
IEEE Transactions on Pattern Analysis and Machine Intelligence
Locally adaptive metrics for clustering high dimensional data
Data Mining and Knowledge Discovery
An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data
IEEE Transactions on Knowledge and Data Engineering
An improved centroid classifier for text categorization
Expert Systems with Applications: An International Journal
A Probability Model for Projective Clustering on High Dimensional Data
ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
A real-coded genetic algorithm for constructive induction
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Nearest neighbor pattern classification
IEEE Transactions on Information Theory
On the evolutionary optimization of k-NN by label-dependent feature weighting
Pattern Recognition Letters
Automated feature weighting in naive bayes for high-dimensional data classification
Proceedings of the 21st ACM international conference on Information and knowledge management
Projected-prototype based classifier for text categorization
Knowledge-Based Systems
Hi-index | 0.10 |
Text categorization presents unique challenges to traditional classification methods due to the large number of features inherent in the datasets from real-world applications of text categorization, and a great deal of training samples. In high-dimensional document data, the classes are typically categorized only by subsets of features, which are typically different for the classes of different topics. This paper presents a simple but effective classifier for text categorization using class-dependent projection based method. By projecting onto a set of individual subspaces, the samples belonging to different document classes are separated such that they are easily to be classified. This is achieved by developing a new supervised feature weighting algorithm to learn the optimized subspaces for all the document classes. The experiments carried out on common benchmarking corpuses showed that the proposed method achieved both higher classification accuracy and lower computational costs than some distinguishing classifiers in text categorization, especially for datasets including document categories with overlapping topics.