Organizing Structured Deep Web by Clustering Query Interfaces Link Graph

  • Authors:
  • Pengpeng Zhao;Li Huang;Wei Fang;Zhiming Cui

  • Affiliations:
  • Jiangsu Key Laboratory of Computer Information Processing Technology, Soochow University, Suzhou, China 215006;Jiangsu Key Laboratory of Computer Information Processing Technology, Soochow University, Suzhou, China 215006;Jiangsu Key Laboratory of Computer Information Processing Technology, Soochow University, Suzhou, China 215006;Jiangsu Key Laboratory of Computer Information Processing Technology, Soochow University, Suzhou, China 215006

  • Venue:
  • ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are a lot of pages on internet that are generated dynamically by the back-end database and the traditional searching engines can't reach these pages, which are called Deep Web. These sources are structured and provide structured query interfaces and results. Organizing structured Deep Web sources by their domain can let users browse these valuable resources and is one of the critical steps toward the large-scale Deep Web information integration. We propose a new strategy that automatically and accurately classifies Deep Web sources based on the form link graph, which can be easily constructed from web forms, and apply Fuzzy partition technique which is proved to be better suited for the features of Deep Web. Experiments using real Deep Web data show that our approach provides an effective and scalable solution for organizing Deep Web sources.