Proceedings of the 27th International Conference on Very Large Data Bases
Statistical schema matching across web query interfaces
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Light-weight domain-based form assistant: querying web databases on the fly
VLDB '05 Proceedings of the 31st international conference on Very large data bases
VLDB '05 Proceedings of the 31st international conference on Very large data bases
WebIQ: Learning from the Web to Match Deep-Web Query Interfaces
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Merging Source Query Interfaces onWeb Databases
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
An adaptive crawler for locating hidden-Web entry points
Proceedings of the 16th international conference on World Wide Web
Context-aware wrapping: synchronized data extraction
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Querying capability modeling and construction of deep web sources
WISE'07 Proceedings of the 8th international conference on Web information systems engineering
EasyQuerier: a keyword based interface for web database integration system
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Understanding deep web search interfaces: a survey
ACM SIGMOD Record
Deep web integration with VisQI
Proceedings of the VLDB Endowment
Clustering structured web sources: a schema-based, model-differentiation approach
EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Accelerating Structured Web Crawling without Losing Data
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Hi-index | 0.01 |
The problem of extracting data that resides in the deep Web has become the center of many research efforts in the recent few years. The challenges in this research area are spanning from online databases discovery and forms extraction from query interfaces, to receiving structured queries from the user, submitting them automatically and retrieving accurate results back to the user. Therefore, the main task is to build an integrated system that connects this variety of missions. In this paper we give an overview of this area of research. We start by surveying previous deep Web systems. After that we define the basic components of a typical deep Web integrated system. Finally, we highlight the current challenges along with possible future research directions.