Word association norms, mutual information, and lexicography
Computational Linguistics
Elements of information theory
Elements of information theory
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss
Machine Learning - Special issue on learning with probabilistic representations
ACM SIGKDD Explorations Newsletter
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
Hi-index | 0.00 |
This paper presents a Bayes document classifier using phrases as features.Th e phrases are extracted using a grammar that iteratively applies the rules to the sequence of words in the document. This grammar is generated from a training set using statistical word association. We report an improvement in the classification over the "bag of words" representation.