A probabilistic relational model for the integration of IR and databases
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Integrating IR and RDBMS using cooperative indexing
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Join queries with external text sources: execution and optimization techniques
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Histogram-based estimation techniques in database systems
Histogram-based estimation techniques in database systems
WSQ/DSQ: a practical approach for combined querying of databases and the Web
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Data integration using similarity joins and a word-based information representation language
ACM Transactions on Information Systems (TOIS)
Modern Information Retrieval
Garlic: a new flavor of federated query processing for DB2
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Keyword Searching and Browsing in Databases using BANKS
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
The integration of business intelligence and knowledge management
IBM Systems Journal
Data integration through database federation
IBM Systems Journal
Information integration: A research agenda
IBM Systems Journal
OSQR: overlapping clustering of query results
Proceedings of the 14th ACM international conference on Information and knowledge management
Objectrank: authority-based keyword search in databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
OSQR: overlapping clustering of query results
Proceedings of the 14th ACM international conference on Information and knowledge management
Efficiently linking text documents with relevant structured information
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
New trends in information integration
Proceedings of the 2nd international conference on Ubiquitous information management and communication
A method for computing lexical semantic distance using linear functionals
Web Semantics: Science, Services and Agents on the World Wide Web
SCORE: symbiotic context oriented information retrieval
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Customer-focused service management for contact centers
IBM Journal of Research and Development
A topological embedding of the lexicon for semantic distance computation
Natural Language Engineering
Toward total business intelligence incorporating structured and unstructured data
Proceedings of the 2nd International Workshop on Business intelligencE and the WEB
Information Systems and e-Business Management
The parallel path framework for entity discovery on the web
ACM Transactions on the Web (TWEB)
Hi-index | 0.00 |
Faced with growing knowledge management needs, enterprises are increasingly realizing the importance of seamlessly integrating critical business information distributed across both structured and unstructured data sources. In existing information integration solutions, the application needs to formulate the SQL logic to retrieve the needed structured data on one hand, and identify a set of keywords to retrieve the related unstructured data on the other. This paper proposes a novel approach wherein the application specifies its information needs using only a SQL query on the structured data, and this query is automatically ``translated'' into a set of keywords that can be used to retrieve relevant unstructured data. We describe the techniques used for obtaining these keywords from (i) the query result, and (ii) additional related information in the underlying database. We further show that these techniques achieve high accuracy with very reasonable overheads.