Entity-Based Classification of Web Page in Search Engine

  • Authors:
  • Yicen Liu;Mingrong Liu;Liang Xiang;Qing Yang

  • Affiliations:
  • Institute of Automation, Chinese Academy of Sciences, Beijing, China 100190;Institute of Automation, Chinese Academy of Sciences, Beijing, China 100190;Institute of Automation, Chinese Academy of Sciences, Beijing, China 100190;Institute of Automation, Chinese Academy of Sciences, Beijing, China 100190

  • Venue:
  • ICADL 08 Proceedings of the 11th International Conference on Asian Digital Libraries: Universal and Ubiquitous Access to Information
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are several difficulties in integrating traditional classification approaches in a search engine. This paper presents an Entity-Based Web Page Classification Algorithm, which can be embedded in search engine easily. In the algorithm, we build up an Entity System to classify web pages immediately before indexing jobs. It is an assistant system used in text feature selection and can be updated incrementally. Experimental results show its efficiency, compared to the traditional ones and has a good performance.