BoosTexter: A Boosting-based Systemfor Text Categorization
Machine Learning - Special issue on information retrieval
Transductive Inference for Text Classification using Support Vector Machines
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
The role of domain information in Word Sense Disambiguation
Natural Language Engineering
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Extracting multiword expressions with a semantic tagger
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Hownet And the Computation of Meaning
Hownet And the Computation of Meaning
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Semisupervised Learning for Computational Linguistics
Semisupervised Learning for Computational Linguistics
Construction of domain dictionary for fundamental vocabulary
ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Text classification by labeling words
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Topic classification of blog posts using distant supervision
Proceedings of the Workshop on Semantic Analysis in Social Media
Hi-index | 0.00 |
This paper presents an approach to text categorization that i) uses no machine learning and ii) reacts on-the-fly to unknown words. These features are important for categorizing Blog articles, which are updated on a daily basis and filled with newly coined words. We categorize 600 Blog articles into 12 domains. As a result, our categorization method achieved an accuracy of 94.0% (564/600).