Simulating the Webgraph: A Comparative Analysis of Models

  • Authors:
  • Debora Donato;Luigi Laura;Stefano Leonardi;Stefano Millozzi

  • Affiliations:
  • -;-;-;-

  • Venue:
  • Computing in Science and Engineering
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Webgraph is a diagram produced by the World Wide Web's hyperlinked structure: its nodes are static HTML pages, and its edges are the hyperlinks between two pages. Since the early '90s, the Web has grown exponentially a trend we expect will continue. Today's Webgraph has several billion edges, but in spite of its size, it exhibits a well-defined structure characterized by several properties. In the past few years, several research papers have reported these properties and proposed various random graph models.1 We simulated several of these models and compared them against a 300-million-node sample of the Webgraph provided by the Stanford WebBase project (http://www-diglib.stanford.edu/~testbed/doc2/WebBase/). All the software we developed to perform this comparison is free to download from the European Research Project COSIN Web site (www.cosin.org).