Semantic integration of semistructured and structured data sources
ACM SIGMOD Record
Using Schema Matching to Simplify Heterogeneous Data Translation
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
DBXplorer: A System for Keyword-Based Search over Relational Databases
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Keyword Searching and Browsing in Databases using BANKS
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Effective keyword search in relational databases
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Discover: keyword search in relational databases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Metadata management for federated databases
Proceedings of the ACM first workshop on CyberInfrastructure: information management in eScience
Referential integrity quality metrics
Decision Support Systems
Information retrieval from digital libraries in SQL
Proceedings of the 10th ACM workshop on Web information and data management
Enhanced Business Intelligence using EROCS
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Models for association rules based on clustering and correlation
Intelligent Data Analysis
DBDOC: querying and browsing databases and interrelated documents
Proceedings of the First International Workshop on Keyword Search on Structured Data
Integrating and querying web databases and documents
Proceedings of the 20th ACM international conference on Information and knowledge management
Integrating and querying source code of programs working on a database
KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
Querying external source code files of programs connecting to a relational database
Proceedings of the 5th Ph.D. workshop on Information and knowledge
Hi-index | 0.00 |
Given the continuous growth of databases and the abundance of diverse files in modern IT environments, there is a pressing need to integrate keyword search on heterogeneous information sources. A particular case in which such integration is needed occurs when a collection of documents (e.g. word processing documents, spreadsheets, text files and so on) is derived directly from a central database, and both repositories are independently updated. Finding hidden relationships between documents and databases is difficult, given the loose connection between them. This problem is especially complicated when database integration techniques must be extended to handle semi-structured data (i.e. documents). Our research focuses on exploiting a relational database system for integrating and exploring complex interrelationships between a database and a collection of potentially related documents. We focus on the discovery and ranking of keyword links (relationships) at different granularity levels between a database schema and a collection of documents. We adapt, extend, and combine information retrieval techniques into the DBMS. As such, we provide algorithms for efficient exploration of discovered relationships among a collection of documents and a DBMS. We experimentally show that our system can discover, query and rank complex relationships discovered between a database and surrounding documents.