The R*-tree: an efficient and robust access method for points and rectangles
SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Query reformulation for dynamic information integration
Journal of Intelligent Information Systems - Special issue on intelligent integration of information
Infomaster: an information integration system
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Semantic integration of semistructured and structured data sources
ACM SIGMOD Record
GlOSS: text-source discovery over the Internet
ACM Transactions on Database Systems (TODS)
Ontological Approach for Information Discovery in Internet Databases
Distributed and Parallel Databases
Scaling Access to Heterogeneous Data Sources with DISCO
IEEE Transactions on Knowledge and Data Engineering
Chord: a scalable peer-to-peer lookup protocol for internet applications
IEEE/ACM Transactions on Networking (TON)
Querying Heterogeneous Information Sources Using Source Descriptions
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
ObjectGlobe: Ubiquitous query processing on the Internet
The VLDB Journal — The International Journal on Very Large Data Bases
The description logic handbook: theory, implementation, and applications
The description logic handbook: theory, implementation, and applications
Scalable Semantic Brokering over Dynamic Heterogeneous Data Sources in InfoSleuth"
IEEE Transactions on Knowledge and Data Engineering
The hyperion project: from data integration to data coordination
ACM SIGMOD Record
The Piazza Peer Data Management System
IEEE Transactions on Knowledge and Data Engineering
On location models for ubiquitous computing
Personal and Ubiquitous Computing
Using Constraints to Describe Source Contents in Data Integration Systems
IEEE Intelligent Systems
Ontology-based data source localization in a structured peer-to-peer environment
IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Making the World Wide Space happen: New challenges for the Nexus context platform
PERCOM '09 Proceedings of the 2009 IEEE International Conference on Pervasive Computing and Communications
Structural and Role-Oriented Web Service Discovery with Taxonomies in OWL-S
IEEE Transactions on Knowledge and Data Engineering
Quete: ontology-based query system for distributed sources
ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Hi-index | 0.00 |
Scaling heterogeneous information systems (HIS) to thousands of sources poses particular challenges to source discovery. It requires a powerful formalism for describing the contents of the sources in a concise manner and for formulating compatible queries as well as a suitable structure for indexing and retrieving the source descriptions efficiently. We propose an extended logic-based description formalism for large-scale HIS with structured sources and a shared ontology. The formalism refines existing approaches that describe the sources by constraints on the attribute value ranges in several ways: It allows for complex, nested descriptions based on defined classes. It supports alternative descriptions to express that a source may be discovered by different combinations of constraints. Finally, it allows to adjust between positive matching, similar to keyword-based discovery, and negative matching, as used in existing logic-based approaches. We further propose the SDC-Tree for indexing such source descriptions. To allow for efficient discovery, the SDC-Tree features multidimensional indexing capabilities for the different attributes and the IS-A hierarchy of the shared ontology, but also incorporates the existence or absence of constraints. For this purpose, it supports three different types of node split operations which exploit the expressiveness of the description formalism. Therefore, we also propose a generic split algorithm which can be used with arbitrary ontologies.