Word association norms, mutual information, and lexicography
Computational Linguistics
Elements of information theory
Elements of information theory
Recent trends in automatic information retrieval
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus-based stemming using cooccurrence of word variants
ACM Transactions on Information Systems (TOIS)
Similarity-Based Models of Word Cooccurrence Probabilities
Machine Learning - Special issue on natural language learning
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
High-performing feature selection for text classification
Proceedings of the eleventh international conference on Information and knowledge management
Automatic Text Categorization and Its Application to Text Retrieval
IEEE Transactions on Knowledge and Data Engineering
Vector space model of information retrieval: a reevaluation
SIGIR '84 Proceedings of the 7th annual international ACM SIGIR conference on Research and development in information retrieval
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Cluster ensembles: a knowledge reuse framework for combining partitionings
Eighteenth national conference on Artificial intelligence
Survey of Text Mining
Information Theory, Inference & Learning Algorithms
Information Theory, Inference & Learning Algorithms
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Feature selection with conditional mutual information maximin in text categorization
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A simple feature selection method for text classification
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Nonnegative Matrix Factorization (NMF) Based Supervised Feature Selection and Adaptation
IDEAL '08 Proceedings of the 9th International Conference on Intelligent Data Engineering and Automated Learning
AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
A novel chinese text feature selection method based on probability latent semantic analysis
ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part II
Hi-index | 0.00 |
Feature selection method for text classification based on information gain ranking, improved by removing redundant terms using mutual information measure and inclusion index, is proposed. We report an experiment to study the impact of term redundancy on the performance of text classifier. The result shows that term redundancy behaves very similar to noise and may degrade the classifier performance. The proposed method is tested on an SVM text classifier. Feature reduction by this method remarkably outperforms information gain based feature selection.