Keyword search across databases and documents

Authors:
Carlos Garcia-Alvarado;Carlos Ordonez
Affiliations:
University of Houston, Houston, TX;University of Houston, Houston, TX
Venue:
Proceedings of the 2nd International Workshop on Keyword Search on Structured Data
Year:
2010

Citing 14
Cited 3

Semantic integration of semistructured and structured data sources

ACM SIGMOD Record
Using Schema Matching to Simplify Heterogeneous Data Translation

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
DBXplorer: A System for Keyword-Based Search over Relational Databases

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Keyword Searching and Browsing in Databases using BANKS

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Effective keyword search in relational databases

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
An architecture for creating collaborative semantically capable scientific data sharing infrastructures

WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Discover: keyword search in relational databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Metadata management for federated databases

Proceedings of the ACM first workshop on CyberInfrastructure: information management in eScience
Referential integrity quality metrics

Decision Support Systems
Information retrieval from digital libraries in SQL

Proceedings of the 10th ACM workshop on Web information and data management
Enhanced Business Intelligence using EROCS

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Models for association rules based on clustering and correlation

Intelligent Data Analysis
DBDOC: querying and browsing databases and interrelated documents

Proceedings of the First International Workshop on Keyword Search on Structured Data

Integrating and querying web databases and documents

Proceedings of the 20th ACM international conference on Information and knowledge management
Integrating and querying source code of programs working on a database

KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
Querying external source code files of programs connecting to a relational database

Proceedings of the 5th Ph.D. workshop on Information and knowledge

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given the continuous growth of databases and the abundance of diverse files in modern IT environments, there is a pressing need to integrate keyword search on heterogeneous information sources. A particular case in which such integration is needed occurs when a collection of documents (e.g. word processing documents, spreadsheets, text files and so on) is derived directly from a central database, and both repositories are independently updated. Finding hidden relationships between documents and databases is difficult, given the loose connection between them. This problem is especially complicated when database integration techniques must be extended to handle semi-structured data (i.e. documents). Our research focuses on exploiting a relational database system for integrating and exploring complex interrelationships between a database and a collection of potentially related documents. We focus on the discovery and ranking of keyword links (relationships) at different granularity levels between a database schema and a collection of documents. We adapt, extend, and combine information retrieval techniques into the DBMS. As such, we provide algorithms for efficient exploration of discovered relationships among a collection of documents and a DBMS. We experimentally show that our system can discover, query and rank complex relationships discovered between a database and surrounding documents.