Automatic integration of Web search interfaces with WISE-Integrator
The VLDB Journal — The International Journal on Very Large Data Bases
WISE-cluster: clustering e-commerce search engines automatically
Proceedings of the 6th annual ACM international workshop on Web information and data management
Combining classifiers to identify online databases
Proceedings of the 16th international conference on World Wide Web
Data integration with uncertainty
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
TODWEB: training-less ontology based deep web source classification
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Architecture specification of rule-based deep web crawler with indexer
International Journal of Knowledge and Web Intelligence
Hi-index | 0.00 |
There are a lot of pages on internet that are generated dynamically by the back-end database and the traditional searching engines can't reach these pages, which are called Deep Web. These sources are structured and provide structured query interfaces and results. Organizing structured Deep Web sources by their domain can let users browse these valuable resources and is one of the critical steps toward the large-scale Deep Web information integration. We propose a new strategy that automatically and accurately classifies Deep Web sources based on the form link graph, which can be easily constructed from web forms, and apply Fuzzy partition technique which is proved to be better suited for the features of Deep Web. Experiments using real Deep Web data show that our approach provides an effective and scalable solution for organizing Deep Web sources.