An example-based mapping method for text categorization and retrieval
ACM Transactions on Information Systems (TOIS)
Pivoted document length normalization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Classifying web documents in a hierarchy of categories: a comprehensive study
Journal of Intelligent Information Systems
Text classification: a recent overview
ICCOMP'05 Proceedings of the 9th WSEAS International Conference on Computers
Expert Systems with Applications: An International Journal
Using the self organizing map for clustering of text documents
Expert Systems with Applications: An International Journal
A hybrid classification method of k nearest neighbor, Bayesian methods and genetic algorithm
Expert Systems with Applications: An International Journal
Automatically computed document dependent weighting factor facility for Naïve Bayes classification
Expert Systems with Applications: An International Journal
What have fruits to do with technology?: the case of Orange, Blackberry and Apple
Proceedings of the International Conference on Web Intelligence, Mining and Semantics
K nearest neighbor reinforced expectation maximization method
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Techniques for improving the performance of naive bayes for text classification
CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Expert Systems with Applications: An International Journal
CatStream: categorising tweets for user profiling and stream filtering
Proceedings of the 2013 international conference on Intelligent user interfaces
Projected-prototype based classifier for text categorization
Knowledge-Based Systems
Hi-index | 0.01 |
Though naive Bayes text classifiers are widely used because of its simplicity, the techniques for improving performances of these classifiers have been rarely studied. In this paper, we propose and evaluate some general and effective techniques for improving performance of the naive Bayes text classifier. We suggest document model based parameter estimation and document length normalization to alleviate the problems in the traditional multinomial approach for text classification. In addition, Mutual-Information-weighted naive Bayes text classifier is proposed to increase the effect of highly informative words. Our techniques are evaluated on the Reuters21578 and 20 Newsgroups collections, and significant improvements are obtained over the existing multinomial naive Bayes approach.