Processing and optimization of complex queries in schema-based p2p-networks

Authors:
Hadhami Dhraief;Alfons Kemper;Wolfgang Nejdl;Christian Wiesner
Affiliations:
Information Systems Institute, University of Hannover, Germany;Computer Science Department, Technical University of Munich, Germany;L3S Research Center, University of Hannover, Germany;Computer Science Department, University of Passau, Germany
Venue:
DBISP2P'04 Proceedings of the Second international conference on Databases, Information Systems, and Peer-to-Peer Computing
Year:
2004

Citing 18
Cited 0

Query optimization for parallel execution

SIGMOD '92 Proceedings of the 1992 ACM SIGMOD international conference on Management of data
Data model and query evaluation in global information systems

Journal of Intelligent Information Systems - Special issue: networked information discovery and retrieval
Chord: A scalable peer-to-peer lookup service for internet applications

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
A scalable content-addressable network

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
EDUTELLA: a P2P networking infrastructure based on RDF

Proceedings of the 11th international conference on World Wide Web
Garlic: a new flavor of federated query processing for DB2

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Building Dynamic Market Places Using HyperQueries

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Hyperqueries: Dynamic Distributed Query Processing on the Internet

Proceedings of the 27th International Conference on Very Large Data Bases
LEO - DB2's LEarning Optimizer

Proceedings of the 27th International Conference on Very Large Data Bases
Heuristic and randomized optimization for the join ordering problem

The VLDB Journal — The International Journal on Very Large Data Bases
ObjectGlobe: Ubiquitous query processing on the Internet

The VLDB Journal — The International Journal on Very Large Data Bases
Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks

WWW '03 Proceedings of the 12th international conference on World Wide Web
Piazza: data management infrastructure for semantic web applications

WWW '03 Proceedings of the 12th international conference on World Wide Web
Improving Search in Peer-to-Peer Networks

ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
Dynamic Extensible Query Processing in Super-Peer Based P2P Systems

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Index structures and algorithms for querying distributed RDF repositories

Proceedings of the 13th international conference on World Wide Web
The history of histograms (abridged)

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
HyperCuP: hypercubes, ontologies, and efficient search on peer-to-peer networks

AP2PC'02 Proceedings of the 1st international conference on Agents and peer-to-peer computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Peer-to-Peer infrastructures are emerging as one of the important data management infrastructures in the World Wide Web. So far, however, most work has focused on simple P2P networks which tackle efficient query distribution to a large set of peers but assume that each query can be answered completely at each peer. For queries which need data from more than one peer to be executed this is clearly insufficient. Unfortunately, though quite a few database techniques can be re-used in the P2P context, P2P data management infrastructures pose additional challenges caused by the dynamic nature of these networks. In P2P networks, we can assume neither global knowledge about data distribution, nor the suitableness of static topologies and static query plans for these networks. Unlike in traditional distributed database systems, we cannot assume complete information schema and allocation schema instances but rather work with distributed schema information which can only direct query processing tasks from one node to one or more neighboring nodes. In this paper we first describe briefly our super-peer based topology and schema-aware distributed routing indices extended with suitable statistics and describe how this information is extracted and updated. Second we show how these indices facilitate the distribution and dynamic expansion of query plans. Third we propose a set of transformation rules to optimize query plans and discuss different optimization strategies in detail, enabling efficient distributed query processing in a schema-based P2P network.