Efficient query reformulation in peer data management systems

Authors:
Igor Tatarinov;Alon Halevy
Affiliations:
University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Year:
2004

Citing 21
Cited 79

Deciding containment for queries with complex objects (extended abstract)

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Complexity of answering queries using materialized views

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Database techniques for the World-Wide Web: a survey

ACM SIGMOD Record
Query planning in infomaster

SAC '97 Proceedings of the 1997 ACM symposium on Applied computing
SilkRoute: trading between relations and XML

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Minimization of tree pattern queries

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Reconciling schemas of disparate data sources: a machine-learning approach

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Containment and equivalence for an XPath fragment

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Data integration: a theoretical perspective

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Foundations of Databases: The Logical Level

Foundations of Databases: The Logical Level
A framework for semantic gossiping

ACM SIGMOD Record
Views in a Large Scale XML Repository

Proceedings of the 27th International Conference on Very Large Data Bases
Answering queries using views: A survey

The VLDB Journal — The International Journal on Very Large Data Bases
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
Piazza: data management infrastructure for semantic web applications

WWW '03 Proceedings of the 12th international conference on World Wide Web
The hyperion project: from data integration to data coordination

ACM SIGMOD Record
Relational data sharing in peer-based data management systems

ACM SIGMOD Record
Composing schema mappings: second-order dependencies to the rescue

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
COMA: a system for flexible combination of schema matching approaches

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
MARS: a system for publishing XML from mixed and redundant storage

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Composing mappings among data sources

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29

XPath lookup queries in P2P networks

Proceedings of the 6th annual ACM international workshop on Web information and data management
Querying structured data in an unstructured P2P system

Proceedings of the 6th annual ACM international workshop on Web information and data management
Aggregate queries in peer-to-peer OLAP

Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
StreamGlobe: adaptive query processing and optimization in streaming P2P environments

DMSN '04 Proceeedings of the 1st international workshop on Data management for sensor networks: in conjunction with VLDB 2004
Representing and Querying Data Transformations

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
XQuery containment in presence of variable binding dependencies

WWW '05 Proceedings of the 14th international conference on World Wide Web
Peer data exchange

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Guaranteeing correctness and availability in P2P range indices

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
HePToX: marrying XML and heterogeneity in your P2P databases

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Peer-to-peer management of XML data: issues and research challenges

ACM SIGMOD Record
On the complexity of computing peer agreements for consistent query answering in peer-to-peer data integration systems

Proceedings of the 14th ACM international conference on Information and knowledge management
An efficient algorithm for XML type projection

Proceedings of the 8th ACM SIGPLAN international conference on Principles and practice of declarative programming
Implementing mapping composition

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
SRI: exploiting semantic information for effective query routing in a PDMS

WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Peer data exchange

ACM Transactions on Database Systems (TODS)
Storing and retrieving XPath fragments in structured P2P networks

Data & Knowledge Engineering - Special issue: WIDM 2004
On reconciling data exchange, data integration, and peer data management

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
AbIx: an approach to content-based approximate query processing in peer-to-peer data systems

Journal of Computer Science and Technology
Containment of nested XML queries

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
The NEXT framework for logical XQuery optimization

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
iTrails: pay-as-you-go information integration in dataspaces

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Inconsistency tolerance in P2P data integration: An epistemic logic approach

Information Systems
Distributed databases and peer-to-peer databases: past and present

ACM SIGMOD Record
A generic proposal for a transparent integration of distributed data by an autonomous layer in a virtual repository

Multiagent and Grid Systems - Grid Computing, high performance and distributed applications
Graph-based query rewriting for knowledge sharing between peer ontologies

Information Sciences: an International Journal
A research agenda for query processing in large-scale peer data management systems

Information Systems
A framework for semantic grouping in P2P databases

Information Systems
Building a PDMS infrastructure for XML data sharing with SUNRISE

DataX '08 Proceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web
Consistent Data Integration in P2P Deductive Databases

SUM '07 Proceedings of the 1st international conference on Scalable Uncertainty Management
Query Propagation in a P2P Data Integration System in the Presence of Schema Constraints

Globe '08 Proceedings of the 1st international conference on Data Management in Grid and Peer-to-Peer Systems
A step towards incremental maintenance of the composed schema mapping

Proceedings of the 17th ACM conference on Information and knowledge management
Pruning nested XQuery queries

Proceedings of the 17th ACM conference on Information and knowledge management
Query optimization in xml-based information integration

Proceedings of the 17th ACM conference on Information and knowledge management
GrouPeer: Dynamic clustering of P2P databases

Information Systems
Data exchange: query answering for incomplete data sources

Proceedings of the 3rd international conference on Scalable information systems
P2P OLAP: Data model, implementation and case study

Information Systems
Semantic Data Integration in P2P Environment Using Schema Mappings and Agent Technology

KES-AMSTA '07 Proceedings of the 1st KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Conceptual Synopses of Semantics in Social Networks Sharing Structured Data

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Mediation-Based XML Query Answerability

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Retrieving XML data from heterogeneous sources through vague querying

ACM Transactions on Internet Technology (TOIT)
An adaptive online system for efficient processing of hierarchical data

Proceedings of the 18th ACM international symposium on High performance distributed computing
Detection of corrupted schema mappings in XML data integration systems

ACM Transactions on Internet Technology (TOIT)
Approximate Rewriting of Queries Using Views

ADBIS '09 Proceedings of the 13th East European Conference on Advances in Databases and Information Systems
Distributed reasoning in a peer-to-peer setting: application to the semantic web

Journal of Artificial Intelligence Research
Scalability study of peer-to-peer consequence finding

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
SomeRDFS in the semantic web

Journal on data semantics VIII
Schema mapping and query translation in heterogeneous P2P XML databases

The VLDB Journal — The International Journal on Very Large Data Bases
SRI@work: efficient and effective routing strategies in a PDMS

WISE'07 Proceedings of the 8th international conference on Web information systems engineering
An architecture for integrating heterogeneous university applications that supports monitoring

TEAA'06 Proceedings of the 2nd international conference on Trends in enterprise application architecture
Benefit and cost of query answering in PDMS

DBISP2P'05/06 Proceedings of the 2005/2006 international conference on Databases, information systems, and peer-to-peer computing
Preserving privacy and fairness in peer-to-peer data integration

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Distributing and searching concept hierarchies: an adaptive DHT-based system

Cluster Computing
On the expressiveness of generalization rules for XPath query relaxation

Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Context-preserving XQuery fusion

APLAS'10 Proceedings of the 8th Asian conference on Programming languages and systems
PSemRef: personalized query reformulation based on user preferences

Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
Generating schema mappings based on annotations in a P2P data integration system

Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
On provenance minimization

Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Generating synthetic database schemas for simulation purposes

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
Interoperability for peer-to-peer networks: opening p2p to the rest of the world

EC-TEL'06 Proceedings of the First European conference on Technology Enhanced Learning: innovative Approaches for Learning and Knowledge Sharing
Mapping maintenance in XML p2p databases

DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Inconsistency tolerance in P2P data integration: an epistemic logic approach

DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
SomeWhere in the semantic web

SOFSEM'06 Proceedings of the 32nd conference on Current Trends in Theory and Practice of Computer Science
STRIDER: a versatile system for structural disambiguation

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
An extensible, distributed simulation environment for peer data management systems

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
A framework for query reformulation between knowledge base peers

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Specifying schema mappings for query reformulation in data integration systems

AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
Constructing and querying peer-to-peer warehouses of XML resources

SWDB'04 Proceedings of the Second international conference on Semantic Web and Databases
Web and semantic web query languages: a survey

Proceedings of the First international conference on Reasoning Web
SomeWhere in the semantic web

PPSWR'05 Proceedings of the Third international conference on Principles and Practice of Semantic Web Reasoning
Supporting complex query with structured overlays in schema-based p2p system

WISE'06 Proceedings of the 7th international conference on Web Information Systems
Working in a dynamic environment: the nep4b approach as a MAS

AP2PC'08 Proceedings of the 7th international conference on Agents and Peer-to-Peer Computing
On Provenance Minimization

ACM Transactions on Database Systems (TODS)
Vague queries on peer-to-peer XML databases

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Semantic grouping of social networks in P2P database settings

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Restoring consistency in p2p deductive databases

SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
A DHT-Based system for the management of loosely structured, multidimensional data

Transactions on Large-Scale Data- and Knowledge-Centered Systems VI
A dynamically semantic platform for efficient information retrieval in P2P networks

International Journal of Grid and Utility Computing
A Semantic-Driven Adaptive Architecture for Large Scale P2P Networks

International Journal of Grid and High Performance Computing
Lightweight privacy-preserving peer-to-peer data integration

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Peer data management systems (PDMS) offer a flexible architecture for decentralized data sharing. In a PDMS, every peer is associated with a schema that represents the peer's domain of interest, and semantic relationships between peers are provided locally between pairs (or small sets) of peers. By traversing semantic paths of mappings, a query over one peer can obtain relevant data from any reachable peer in the network. Semantic paths are traversed by reformulating queries at a peer into queries on its neighbors.Naively following semantic paths is highly inefficient in practice. We describe several techniques for optimizing the reformulation process in a PDMS and validate their effectiveness using real-life data sets. In particular, we develop techniques for pruning paths in the reformulation process and for minimizing the reformulated queries as they are created. In addition, we consider the effect of the strategy we use to search through the space of reformulations. Finally, we show that pre-computing semantic paths in a PDMS can greatly improve the efficiency of the reformulation process. Together, all of these techniques form a basis for scalable query reformulation in PDMS.To enable our optimizations, we developed practical algorithms, of independent interest, for checking containment and minimization of XML queries, and for composing XML mappings.