Exploiting Parallelism to Accelerate Keyword Search on Deep-Web Sources

  • Authors:
  • Tantan Liu;Fan Wang;Gagan Agrawal

  • Affiliations:
  • Department of Computer Science and Engineering, Ohio State University, Columbus 43210;Department of Computer Science and Engineering, Ohio State University, Columbus 43210;Department of Computer Science and Engineering, Ohio State University, Columbus 43210

  • Venue:
  • DILS '09 Proceedings of the 6th International Workshop on Data Integration in the Life Sciences
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Increasingly, biological data is being shared over the deep web. Many biological queries can only be answered by successively searching a number of distinct web-sites. This paper introduces a system that exploits parallelization for accelerating search over multiple deep web data sources. An interactive, two-stage multi-threading system is developed to achieve task parallelization, thread parallelization, and pipelined parallelization. We show the effectiveness of our system by considering a number of queries involving SNP datasets. We show that most of the queries can be accelerated significantly by exploiting these three forms of parallelism.