Web page classification based on k-nearest neighbor approach
IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Web-page classification through summarization
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Hi-index | 0.00 |
There are several difficulties in integrating traditional classification approaches in a search engine. This paper presents an Entity-Based Web Page Classification Algorithm, which can be embedded in search engine easily. In the algorithm, we build up an Entity System to classify web pages immediately before indexing jobs. It is an assistant system used in text feature selection and can be updated incrementally. Experimental results show its efficiency, compared to the traditional ones and has a good performance.