Query-by-structure approach for the web

  • Authors:
  • Michael Johnson;Farshad Fotouhi;Sorin Draghici

  • Affiliations:
  • Madonna University;Wayne State University;Wayne State University

  • Venue:
  • Data mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This chapter presents three systems that incorporate document structure information into a search of the Web. These systems extend existing Web searches by allowing the user to request documents containing not only specific search words, but also to specify that documents be of a certain type. In addition to being able to search a local database (DB), all three systems are capable of dynamically querying the Web. Each system applies a query-by-structure approach that captures and utilizes structure information as well as content during a query of the Web. Two of the systems also employ neural networks (NNs) to organize the information based on relevancy of both the content and structure. These systems utilize a supervised Hamming NN and an unsupervised competitive NN, respectively. Initial testing of these systems has shown promising results when compared to straight keyword searches.