Exploiting Parallelism to Accelerate Keyword Search on Deep-Web Sources

Authors:
Tantan Liu;Fan Wang;Gagan Agrawal
Affiliations:
Department of Computer Science and Engineering, Ohio State University, Columbus 43210;Department of Computer Science and Engineering, Ohio State University, Columbus 43210;Department of Computer Science and Engineering, Ohio State University, Columbus 43210
Venue:
DILS '09 Proceedings of the 6th International Workshop on Data Integration in the Life Sciences
Year:
2009

Citing 12
Cited 0

Optimization of SQL queries for parallel machines

Optimization of SQL queries for parallel machines
An overview of query optimization in relational systems

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Approaches to collection selection and results merging for distributed information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Searching the Deep Web: Directed Query Engine Applications at the Department of Energy

Searching the Deep Web: Directed Query Engine Applications at the Department of Energy
Optimization of Parallel Query Execution Plans in XPRS

Optimization of Parallel Query Execution Plans in XPRS
Knocking the door to the deep Web: integrating Web query interfaces

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Automatic integration of Web search interfaces with WISE-Integrator

The VLDB Journal — The International Journal on Very Large Data Bases
Programming scientific and distributed workflow with Triana services: Research Articles

Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Query optimization over web services

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Query Planning for Searching Inter-dependent Deep-Web Databases

SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Flow Algorithms for Parallel Query Optimization

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
SEEDEEP: A System for Exploring and Querying Scientific Deep Web Data Sources

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Increasingly, biological data is being shared over the deep web. Many biological queries can only be answered by successively searching a number of distinct web-sites. This paper introduces a system that exploits parallelization for accelerating search over multiple deep web data sources. An interactive, two-stage multi-threading system is developed to achieve task parallelization, thread parallelization, and pipelined parallelization. We show the effectiveness of our system by considering a number of queries involving SNP datasets. We show that most of the queries can be accelerated significantly by exploiting these three forms of parallelism.