An effective query relaxation solution for the deep web

  • Authors:
  • Ye Ma;Derong Shen;Yue Kou;Wei Liu

  • Affiliations:
  • Department of Computer, Northeastern University, Shenyang, China;Department of Computer, Northeastern University, Shenyang, China;Department of Computer, Northeastern University, Shenyang, China;Department of Computer, Northeastern University, Shenyang, China

  • Venue:
  • APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The information on the deep web is much more abundant than the surface web, so it is important to make the best use of it. However, in the process of query, it is difficult to avoid the so-called failed queries that make no result. Instead of notifying the user that there is no result, it is more cooperative to modify the raw query to return non-empty result set. Inspired by the observations on the deep web, this paper presents a query relaxation solution. Firstly, it applies the technique of query probing to obtain data samples from the underlying deep web databases. Based on these data samples, the important degree of attributes are obtained by employing approximate functional dependence. Secondly, the databases matching the query better are chosen and divided into some groups in terms of their schemas. Then the groups are organized into a directed acyclic graph called database relationship graph (DRG) to implement query relaxation. Finally, it returns some results satisfying the query better. We have conducted experiments to demonstrate the feasibility and the efficiency of the solution.