Active Integration of Databases in Grids for Scalable Distributed Query Processing

  • Authors:
  • Alexander Wöhrer;Peter Brezany

  • Affiliations:
  • Institute of Scientific Computing, University of Vienna, Vienna, Austria A-1090;Institute of Scientific Computing, University of Vienna, Vienna, Austria A-1090

  • Venue:
  • OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

A focus of Grid computing are data intensive applications. Additionally, database management systems (DBMS) are gaining on importance in many scientific disciplines for publication of research results. The employment of Service-oriented-Architecture (SoA) raises the question of how DBMSs and their built-in technologies can be best utilized in such environments. A common way is to pull out all required data for a certain task from a source and process it service side far away from the original source. This approach is characterized by a passive usage of the DBMS as a pure data provider which implies significant overheads. The research effort described in this paper allows an active usage of a DBMS by relocating distributed query processing functionality inside it. Our novel solution utilizes the existing database technology, puts just the interface code at the service level while the data processing code resides at the database level and uses a push mechanism for the result data. The advantages are less overheads and data movement as well as increased data locality. Our proof of concept implementation is evaluated by comparing our distributed query processing prototype working inside popular relational DBMS (Oracle 10g and PostGreSQL 8.3) with a traditional installation of the OGSA-DQP middleware developed for distributed query processing on the Grid.