Categorizing and ranking search engine's results by semantic similarity

  • Authors:
  • Tianyong Hao;Zhi Lu;Shitong Wang;Tiansong Zou;Shenhua GU;Liu Wenyin

  • Affiliations:
  • City University of Hong Kong, Hong Kong, China;City University of Hong Kong, Hong Kong, China;City University of Hong Kong, Hong Kong, China;City University of Hong Kong, Hong Kong, China;City University of Hong Kong, Hong Kong, China;City University of Hong Kong, Hong Kong, China

  • Venue:
  • Proceedings of the 2nd international conference on Ubiquitous information management and communication
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

An automatic method for text categorizing and ranking search engine's results by semantic similarity is proposed in this paper. We first obtain nouns and verbs from snippets obtained from search engine using Name Entity Recognition and part-of speech. A semantic similarity algorithm based on WordNet is proposed to calculate the similarity of each snippet to each of the pre-defined categories. A balanced similarity ranking method combined with Google's rank and timeliness of the pages is proposed to rank these snippets. Preliminary experiments with 500 labeled questions from TREC03 show that 72.7% are correctly categorized.