On building a search interface discovery system

  • Authors:
  • Denis Shestakov

  • Affiliations:
  • Department of Media Technology, Aalto University, Espoo, Finland

  • Venue:
  • RED'09 Proceedings of the 2nd international conference on Resource discovery
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A huge portion of the Web known as the deep Web is accessible via search interfaces to myriads of databases on the Web. While relatively good approaches for querying the contents of web databases have been recently proposed, one cannot fully utilize them having most search interfaces unlocated. Thus, the automatic recognition of search interfaces to online databases is crucial for any application accessing the deep Web. This paper describes the architecture of the I-Crawler, a system for finding and classifying search interfaces. The I-Crawler is intentionally designed to be used in the deep web characterization surveys and for constructing directories of deep web resources.