Logic-based techniques in data integration
Logic-based artificial intelligence
Global Viewing of Heterogeneous Data Sources
IEEE Transactions on Knowledge and Data Engineering
A Graph-Oriented Model for Articulation of Ontology Interdependencies
EDBT '00 Proceedings of the 7th International Conference on Extending Database Technology: Advances in Database Technology
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Generic Schema Matching with Cupid
Proceedings of the 27th International Conference on Very Large Data Bases
Views in a Large Scale XML Repository
Proceedings of the 27th International Conference on Very Large Data Bases
Semantic integration in Xyleme: a uniform tree-based approach
Data & Knowledge Engineering - Special issue: Data integration over the Web
Constructing and querying peer-to-peer warehouses of XML resources
SWDB'04 Proceedings of the Second international conference on Semantic Web and Databases
Processing heterogeneous collections in XML information retrieval
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Hi-index | 0.00 |
This article presents our work within the INEX 2004 Heterogeneous Track. We focused on taming the structural diversity within the INEX heterogeneous bibliographic corpus. We demonstrate how semantic models and associated inference techniques can be used to solve the problems raised by the structural diversity within a given XML corpus. The first step automatically extracts a set of concepts from each class of INEX heterogeneous documents. An unified set of concepts is then computed, which synthesizes the interesting concepts from the whole corpus. Individual corpora are connected to the unified set of concepts via conceptual mappings. This approach is implemented as an application of the KadoP platform for peer-to-peer warehousing of XML documents. While this work caters to the structural aspects of XML information retrieval, the extensibility of the KadoP system makes it an interesting test platform in which components developed by several INEX participants could be plugged, exploiting the opportunities of peer-to-peer data and service distribution.