BioFlow: A Web-Based Declarative Workflow Language for Life Sciences
SERVICES '08 Proceedings of the 2008 IEEE Congress on Services - Part I
OntoMatch: a monotonically improving schema matching system for autonomous data integration
IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
FastWrap: an efficient wrapper for tabular data extraction from the web
IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
Transactions on large-scale data- and knowledge-centered systems III
Hi-index | 0.00 |
Formulating and executing queries over distributed, autonomous and heterogeneous resources is an important research area. The advent of the Internet and the Web and their inherent ubiquity have brought forth opportunities to query these information sources in an automated and independent manner. In the domain of information extraction, automatic wrapper generation has been well studied but the efficacy of the current wrappers are limited by the fact that automatic annotation of column names to the extracted tabular data is yet to be perfected. In this paper, we propose a novel annotation system that can assign meaningful column names to the extracted tables for subsequent queries. We enhance our prototype wrapper system FastWrap with this annotator to support fast and autonomous on-the-fly data integration and ad hoc declarative querying.