A large time-aware web graph

  • Authors:
  • Paolo Boldi;Massimo Santini;Sebastiano Vigna

  • Affiliations:
  • Università di Milano, Italy;Università di Milano, Italy;Università di Milano, Italy

  • Venue:
  • ACM SIGIR Forum
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe the techniques developed to gather and distribute in a highly compressed, yet accessible, form a series of twelve snapshot of the .uk web domain. Ad hoc compression techniques made it possible to store the twelve snapshots using just 1:9 bits per link, with constant-time access to temporal information. Our collection makes it possible to study the temporal evolution link-based scores (e.g., PageRank), the growth of online communities, and in general time-dependent phenomena related to the link structure.