Using bloom filters to speed up HITS-like ranking algorithms

  • Authors:
  • Sreenivas Gollapudi;Marc Najork;Rina Panigrahy

  • Affiliations:
  • Microsoft Research, Mountain View, CA;Microsoft Research, Mountain View, CA;Microsoft Research, Mountain View, CA

  • Venue:
  • WAW'07 Proceedings of the 5th international conference on Algorithms and models for the web-graph
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a technique for reducing the querytime cost of HITS-like ranking algorithm. The basic idea is to compute for each node in the web graph a summary of its immediate neighborhood (which is a query-independent operation and thus can be done off-line), and to approximate the neighborhood graph of a result set at query-time by combining the summaries of the result set nodes. This approximation of the query-specific neighborhood graph can then be used to perform query-dependent link-based ranking algorithms such as HITS and SALSA.We have evaluated our technique on a large web graph and a substantial set of queries with partially judged results, and found that its effectiveness (retrieval performance) is comparable to the original SALSA algorithm, while its efficiency (query-time speed) is substantially higher.