Deep Web Information Retrieval Process: A Technical Survey
International Journal of Information Technology and Web Engineering
Hi-index | 0.00 |
Due to the autonomy of web databases, a major challenge for query translation in a Deep Web Data Integration System is the lack of cost models at the global level. In this paper, we propose a Multiple-regression Cost Model (MrCoM) based on statistical analysis for global range queries that involve numeric range attributes. Using the MrCoM, the query translation strategy for new global range queries can be inferred. We also propose a Pre-processing-based Stepwise Algorithm (PSA) for selecting significant independent variables into the MrCoM. Experimental results demonstrate that the fitness of the MrCoM is good and the accuracy of the query strategy selection is high.