A research agenda for query processing in large-scale peer data management systems

Authors:
Katja Hose;Armin Roth;André Zeitz;Kai-Uwe Sattler;Felix Naumann
Affiliations:
Technische Universität Ilmenau, FG Datenbanken und Informationssysteme, D-98684 Ilmenau, Germany;Hasso-Plattner-Institut für Softwaresystemtechnik (HPI), D-14482 Potsdam, Germany;Universität Rostock, Universtitätsrechenzentrum/Lehrstuhl Datenbank- und Informationssysteme, D-18051 Rostock, Germany;Technische Universität Ilmenau, FG Datenbanken und Informationssysteme, D-98684 Ilmenau, Germany;Hasso-Plattner-Institut für Softwaresystemtechnik (HPI), D-14482 Potsdam, Germany
Venue:
Information Systems
Year:
2008

Citing 52
Cited 5

A comparative analysis of methodologies for database schema integration

ACM Computing Surveys (CSUR)
Epidemic algorithms for replicated database maintenance

PODC '87 Proceedings of the sixth annual ACM Symposium on Principles of distributed computing
Object orientation in multidatabase systems

ACM Computing Surveys (CSUR)
Online aggregation

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Complexity of answering queries using materialized views

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Principles of distributed database systems (2nd ed.)

Principles of distributed database systems (2nd ed.)
Ripple joins for online aggregation

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Search and replication in unstructured peer-to-peer networks

ICS '02 Proceedings of the 16th international conference on Supercomputing
Data integration: a theoretical perspective

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Model independent assertions for integration of heterogeneous schemas

The VLDB Journal — The International Journal on Very Large Data Bases
Object Exchange Across Heterogeneous Information Sources

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
SchemaSQL - A Language for Interoperability in Relational Multi-Database Systems

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Large-Sample and Deterministic Confidence Intervals for Online Aggregation

SSDBM '97 Proceedings of the Ninth International Conference on Scientific and Statistical Database Management
Answering queries using views: A survey

The VLDB Journal — The International Journal on Very Large Data Bases
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
The chatty web: emergent semantics through gossiping

WWW '03 Proceedings of the 12th international conference on World Wide Web
Piazza: data management infrastructure for semantic web applications

WWW '03 Proceedings of the 12th international conference on World Wide Web
Representing and reasoning about mappings between domain models

Eighteenth national conference on Artificial intelligence
Similarity Join for Low-and High-Dimensional Data

DASFAA '03 Proceedings of the Eighth International Conference on Database Systems for Advanced Applications
Conflict Tolerant Queries in AURORA

COOPIS '99 Proceedings of the Fourth IECIS International Conference on Cooperative Information Systems
PlanetP: Using Gossiping to Build Content Addressable Peer-to-Peer Information Sharing Communities

HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
Routing Indices For Peer-to-Peer Systems

ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
The Piazza peer data management project

ACM SIGMOD Record
The hyperion project: from data integration to data coordination

ACM SIGMOD Record
Efficient query reformulation in peer data management systems

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Completeness of integrated information sources

Information Systems - Special issue: Data quality in cooperative information systems
Logical foundations of peer-to-peer data integration

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
HePToX: marrying XML and heterogeneity in your P2P databases

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Adaptive Routing Filters for Robust Query Processing in Schema-Based P2P Systems

IDEAS '05 Proceedings of the 9th International Database Engineering & Application Symposium
Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications)

Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications)
SRI: exploiting semantic information for effective query routing in a PDMS

WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Emerging semantic communities in peer web search

P2PIR '06 Proceedings of the international workshop on Information retrieval in peer-to-peer networks
Processing relaxed skylines in PDMS using distributed data summaries

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Towards Traceability across Sovereign, Distributed RFID Databases

IDEAS '06 Proceedings of the 10th International Database Engineering and Applications Symposium
Distributed Data Summaries for Approximate Query Processing in PDMS

IDEAS '06 Proceedings of the 10th International Database Engineering and Applications Symposium
Semantic Web and Peer-to-Peer: Decentralized Management and Exchange of Knowledge and Information

Semantic Web and Peer-to-Peer: Decentralized Management and Exchange of Knowledge and Information
On reconciling data exchange, data integration, and peer data management

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Translating web data

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Composing mappings among data sources

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Top-k query evaluation with probabilistic guarantees

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Update exchange with mappings and provenance

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Information integration in schema-based peer-to-peer networks

CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Benefit and cost of query answering in PDMS

DBISP2P'05/06 Proceedings of the 2005/2006 international conference on Databases, information systems, and peer-to-peer computing
A relaxed but not necessarily constrained way from the top to the sky

OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part I
Quality-driven query answering for integrated information systems

Quality-driven query answering for integrated information systems
Inconsistency tolerance in P2P data integration: an epistemic logic approach

DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
An extensible, distributed simulation environment for peer data management systems

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Semantic overlay networks for p2p systems

AP2PC'04 Proceedings of the Third international conference on Agents and Peer-to-Peer Computing
On constructing small worlds in unstructured peer-to-peer systems

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Semantic query routing and processing in p2p database systems: the ICS-FORTH SQPeer middleware

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Viewpoints on emergent semantics

Journal on Data Semantics VI

Maintenance strategies for routing indexes

Distributed and Parallel Databases
Distributed and secure access control in P2P databases

DBSec'10 Proceedings of the 24th annual IFIP WG 11.3 working conference on Data and applications security and privacy
Polymorphic queries for P2P systems

Information Systems
Ontology-Based Clustering in a Peer Data Management System

International Journal of Distributed Systems and Technologies
Less is more: selecting sources wisely for integration

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Peer Data Management Systems (Pdms) are a novel, useful, but challenging paradigm for distributed data management and query processing. Conventional integrated information systems have a hierarchical structure with an integration component that manages a global schema and distributes queries against this schema to the underlying data sources. Pdmsare a natural extension to this architecture by allowing each participating system (peer) to act both as a data source and as an integrator. Peers are interconnected by schema mappings, which guide the rewriting of queries between the heterogeneous schemas, and thus form a P2P (peer-to-peer)-like network. Despite several years of research, the development of efficient Pdmsstill holds many challenges. In this article we first survey the state of the art on peer data management: We classify Pdmsby characteristics concerning their system model, their semantics, their query planning schemes, and their maintenance. Then we systematically examine open research directions in each of those areas. In particular, we observe that research results from both the domain of P2P systems and of conventional distributed data management can have an impact on the development of Pdms.