The Viúva Negra crawler: an experience report

  • Authors:
  • Daniel Gomes;Mário J. Silva

  • Affiliations:
  • Departamento de Informática, Faculdade de Ciências da Universidade de Lisboa, Edificio C6, piso 3, Campo Grande, 1749-016 Lisboa, Portugal;Departamento de Informática, Faculdade de Ciências da Universidade de Lisboa, Edificio C6, piso 3, Campo Grande, 1749-016 Lisboa, Portugal

  • Venue:
  • Software—Practice & Experience
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper documents hazardous situations on the Web that crawlers must address. This knowledge was accumulated while developing and operating the Viúva Negra (VN) crawler to feed a search engine and a Web archive for the Portuguese Web for four years. The design, implementation and evaluation of the VN crawler are also presented as a case study of a Web crawler design. The case study tested provides crawling techniques that may be useful for the further development of crawlers. Copyright © 2007 John Wiley & Sons, Ltd.