A framework for supporting data integration using the materialized and virtual approaches
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Integrating heterogeneous databases: lazy or eager?
ACM Computing Surveys (CSUR) - Special issue: position statements on strategic directions in computing research
Communications of the ACM - ACM at sixty: a look back in time
Mining search engine query logs via suggestion sampling
Proceedings of the VLDB Endowment
Towards rich query interpretation: walking back and forth for mining query templates
Proceedings of the 19th international conference on World wide web
Understanding the semantic structure of noun phrase queries
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Dataspaces: a new abstraction for information management
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Unsupervised extraction of template structure in web search queries
Proceedings of the 21st international conference on World Wide Web
Principles of Data Integration
Principles of Data Integration
Proactive natural language search engine: tapping into structured data on the web
Proceedings of the 16th International Conference on Extending Database Technology
Searching the deep web using proactive phrase queries
Proceedings of the 22nd international conference on World Wide Web companion
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
The massive and diverse data sources on the Deep Web presents a serious data integration challenge. Existing virtual integration approaches suffer from slow query response, while surfacing approaches demand hefty storage space and incur huge costs in maintaining data freshness. We propose a novel hybrid integration approach that strikes a balance between the virtual and surfacing approaches. The key idea is to capture user needs in query templates and focus the integration efforts on the templates. However, realizing this approach requires innovations in template-driven query planning, query parsing, and template discovery. We elaborate on these challenges and propose our solutions.