Structured databases on the web: observations and implications
ACM SIGMOD Record
WebIQ: Learning from the Web to Match Deep-Web Query Interfaces
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Integrating deep web data sources
Integrating deep web data sources
Wise-integrator: an automatic integrator of web search interfaces for E-commerce
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Schema Extraction of Deep Web Query Interface
WISM '09 Proceedings of the 2009 International Conference on Web Information Systems and Mining
Constructing interface schemas for search interfaces of web databases
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Hi-index | 0.00 |
Query interface schema extraction is an important issue for Deep Web data acquisition and integration. In order to obtain the query interface schema, it is firstly required to associate elements and labels of Deep Web query interface correctly. Due to the fact that query interface on HTML page can be parsed as well structured DOM, we proposed an effective algorithm for associating elements and labels of Deep Web query interface based on hierarchical DOM. Our algorithm mainly adopted the nearest-neighbor-distance and other two useful heuristic rules to associate the most related label of a given control element. The experimental results on real query interfaces show that our proposed algorithm is highly effective.