A Comparison of Several Ensemble Methods for Text Categorization

Authors:
Yan-Shi Dong;Ke-Song Han
Affiliations:
Shanghai Jiao Tong University;Motorola Labs, China Research Center
Venue:
SCC '04 Proceedings of the 2004 IEEE International Conference on Services Computing
Year:
2004

Citing 0
Cited 8

Effective spam filtering: A single-class learning and ensemble approach

Decision Support Systems
Hierarchical Text Categorization Through a Vertical Composition of Classifiers

AI*IA '07 Proceedings of the 10th Congress of the Italian Association for Artificial Intelligence on AI*IA 2007: Artificial Intelligence and Human-Oriented Computing
Selective Ensemble Algorithms of Support Vector Machines Based on Constraint Projection

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Efficient Text Classification Using Best Feature Selection and Combination of Methods

Proceedings of the Symposium on Human Interface 2009 on ConferenceUniversal Access in Human-Computer Interaction. Part I: Held as Part of HCI International 2009
Ensemble of feature sets and classification algorithms for sentiment classification

Information Sciences: an International Journal
Classifiers selection in ensembles using genetic algorithms for bankruptcy prediction

Expert Systems with Applications: An International Journal
Application of bagging, boosting and stacking to intrusion detection

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Fuzzy cognitive map ensemble learning paradigm to solve classification problems: Application to autism identification

Applied Soft Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Text categorization (TC), as an important domain of machine learning, has many unique traits, such as huge number of features, serious redundant features, dataset imbalance, etc. In this paper the various ensemble methods of naïve Bayes classifiers and SVM classifiers are experimentally compared on the TC tasks. Besides, a new type of classifiers, moderated asymmetric naïve Bayes classifiers, is proposed. Its advantages over the conventional naïve Bayes classifiers in performance and computational efficiency are demonstrated.