The nature of statistical learning theory
The nature of statistical learning theory
Hierarchical classification of Web content
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Hierarchical Text Classification and Evaluation
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Learning hierarchical multi-category text classification models
ICML '05 Proceedings of the 22nd international conference on Machine learning
Hierarchical classification: combining Bayes with SVM
ICML '06 Proceedings of the 23rd international conference on Machine learning
Classifying imbalanced data using a bagging ensemble variation (BEV)
ACM-SE 45 Proceedings of the 45th annual southeast regional conference
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
Hi-index | 0.00 |
In this paper, a novel method for web page hierarchical classification is addressed. In our approach, SVM is used as the basic algorithm to separate any two sub-categories under the same parent node. In order to alleviate the ill shift of SVM classifier caused by imbalanced training data, we try to combine the original SVM classifier with BEV algorithm to create classifier called VOTEM. Then, a web document is assigned to a sub-category based on voting from all category-to-category classifiers. This hierarchical classification algorithm starts its work from the top of the hierarchical tree downward recursively until it triggers a stop condition or reaches the leaf nodes. And our experiment reveals that proposed algorithm obtains better results.