Extending Link-based Algorithms for Similar Web Pages with Neighborhood Structure

  • Authors:
  • Zhenjiang Lin;Michael R. Lyu;Irwin King

  • Affiliations:
  • -;-;-

  • Venue:
  • WI '07 Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of fnding similar pages to a given web page arises in many web applications such as search engine. In this paper, we focus on the link-based similarity measures which compute web page similarity solely from the hyperlinks of the Web. We first propose a simple model called the Extended Neighborhood Structure (ENS), which defines a bi-directional (in-link and out-link) and multi-hop neighborhood structure. Based on the ENS model, several existing similarity measures are extended. Preliminary experimental results show that the accuracy of the extended algorithms are signifcantly improved.