Entity identification in database integration
Information Sciences: an International Journal
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
Information Systems
Containment and equivalence for a fragment of XPath
Journal of the ACM (JACM)
On the Intersection of XPath Expressions
IDEAS '05 Proceedings of the 9th International Database Engineering & Application Symposium
Deciding XPath containment with MSO
Data & Knowledge Engineering
ACM Computing Surveys (CSUR)
A model for XML instance level integration
SBBD '08 Proceedings of the 23rd Brazilian symposium on Databases
Declarative XML data cleaning with XClean
CAiSE'07 Proceedings of the 19th international conference on Advanced information systems engineering
Using ontologies for XML data cleaning
OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems
XML data integration with identification
DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Hi-index | 0.00 |
Ensuring high quality data when collecting and integrating information from heterogeneous sources into a data warehouse is a challenging problem. In this paper, we propose a model for XML data fusion, which allows the integrator to define data cleaning rules for solving value conflicts that may have been detected during the integration process. These rules resemble decisions that are made by users when data are manually curated and, once defined, conflicts detected in subsequent integration processes that are within the context of existing rules can be automatically solved without user intervention. We also introduce a notion of fusion policy validation that prevents conflicting resolution rules to be defined. To validate our proposal, we developed XFusion, a rulebased cleaning tool that stores curated data in a integrated repository.