C4.5: programs for machine learning
C4.5: programs for machine learning
Machine Learning - Special issue on applications of machine learning and the knowledge discovery process
An adaptive peer-to-peer network for distributed caching of OLAP results
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
QProber: A system for automatic classification of hidden-Web databases
ACM Transactions on Information Systems (TOIS)
A framework for semantic gossiping
ACM SIGMOD Record
Comparing Hybrid Peer-to-Peer Systems
Proceedings of the 27th International Conference on Very Large Data Bases
On schema matching with opaque column names and data values
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Mapping data in peer-to-peer systems: semantics and algorithmic issues
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
The Piazza peer data management project
ACM SIGMOD Record
The hyperion project: from data integration to data coordination
ACM SIGMOD Record
Relational data sharing in peer-based data management systems
ACM SIGMOD Record
Constraint-based XML query rewriting for data integration
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
iMAP: discovering complex semantic matches between database schemas
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
THALIA: Test Harness for the Assessment of Legacy Information Integration Approaches
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
BATON: a balanced tree structure for peer-to-peer networks
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Data sharing in the Hyperion peer database system
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Instance-based schema matching for web databases by domain-specific query probing
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Queries and updates in the coDB peer to peer database system
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Learning trees and rules with set-valued features
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Hi-index | 0.00 |
In this paper, we address the problems of adaptive schema mappings between different peers in peer-to-peer networks and searching for interesting data residing at different peers based on such mappings. We begin by classifying the shared schema of each peer into a taxonomy of relation categories and attribute categories. We then propose our adaptive schema mapping by selectively probing the shared schema with query probes, which are generated by the classification rules. To improve the accuracy of schema mapping, we introduce the notion of confusion matrix and prior-knowledge. Finally, we present the query reformulation strategy for retrieving and integrating data from all relevant peers. We have implemented our proposed schema mapping and query processing methods in real settings with real datasets. The experimental results show that our method can be adopted effectively in practice.