Exploiting the deep web with DynaBot: matching, probing, and ranking

  • Authors:
  • Daniel Rocco;James Caverlee;Ling Liu;Terence Critchlow

  • Affiliations:
  • University of West Georgia, Carrollton, GA;Georgia Inst. of Technology, Atlanta, GA;Georgia Inst. of Technology, Atlanta, GA;Lawrence Livermore Nat'l Lab, Livermore, CA

  • Venue:
  • WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the design of Dynabot, a guided Deep Web discovery system. Dynabot's modular architecture supports focused crawling of the Deep Web with an emphasis on matching, probing, and ranking discovered sources using two key components: service class descriptions and source-biased analysis. We describe the overall architecture of Dynabot and discuss how these components support effective exploitation of the massive Deep Web data available.