The nature of statistical learning theory
The nature of statistical learning theory
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Modern Information Retrieval
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
In Defense of One-Vs-All Classification
The Journal of Machine Learning Research
Fast Recognition of Multi-View Faces with Feature Selection
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Document-Base Extraction for Single-Label Text Classification
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
A novel probabilistic feature selection method for text classification
Knowledge-Based Systems
Hi-index | 0.00 |
In this paper, we propose a probabilistic approach to feature selection for multi-class text categorization. Specifically, we regard document class and occurrence of each feature as events, calculate the probability of occurrence of each feature by the theorem on the total probability and utilize the values as a ranking criterion. Experiments on Reuters-2000 collection show that the proposed method can yield better performance than information gain and ï戮驴-square, which are two well-known feature selection methods.