WebGraph: things you thought you could not do with Java™

  • Authors:
  • Paolo Boldi;Sebastiano Vigna

  • Affiliations:
  • Università degli Studi di Milano;Università degli Studi di Milano

  • Venue:
  • Proceedings of the 3rd international symposium on Principles and practice of programming in Java
  • Year:
  • 2004

Quantified Score

Hi-index 0.01

Visualization

Abstract

Studying web graphs is often difficult due to their large size. The WebGraph framework is a suite of codes, algorithms and tools that make it easy to manipulate large web graphs, and to store them in a limited space, by exploiting the inner redundancies of the web. WebGraph is based on sophisticated bitwise compression techniques, and functional-style lazy constructions. Common wisdom would say that the most unlikely language to implement such a framework is Java. We are going to tell you the real story.