Design of a crawler with bounded bandwidth

  • Authors:
  • Michelangelo Diligenti;Marco Maggini;Filippo Maria Pucci;Franco Scarselli

  • Affiliations:
  • Università di Siena Via Roma, Siena, Italy;Università di Siena Via Roma, Siena, Italy;Università di Siena Via Roma, Siena, Italy;Università di Siena Via Roma, Siena, Italy

  • Venue:
  • Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an algorithm to bound the bandwidth of a Web crawler. The crawler collects statistics on the transfer rate of each server to predict the expected bandwidth use for future downloads. The prediction allows us to activate the optimal number of fetcher threads in order to exploit the assigned bandwidth. The experimental results show the effectiveness of the proposed technique.