The Clio project: managing heterogeneity
ACM SIGMOD Record
Authoring and annotation of web pages in CREAM
Proceedings of the 11th international conference on World Wide Web
Superimposed Schematics: Introducing E-R Structure for In-Situ Information Selections
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
Putting integrated information in context: superimposing conceptual models with SPARCE
APCCM '04 Proceedings of the first Asian-Pacific conference on Conceptual modelling - Volume 31
Personal information management with SEMEX
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Supporting insight-based information exploration in intelligence analysis
Communications of the ACM - Supporting exploratory search
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Definition and Formalization of Entity Resolution Functions for Everyday Information Integration
Semantics in Data and Knowledge Bases
Hi-index | 0.00 |
Integration of large databases by expert teams is only a small part of the data integration activities that take place. Users without data integration expertise very often gather, organize, reconcile, and use diverse information as a normal part of their jobs. Often, they do this by copying data into a text file or spreadsheet. In doing so, they make significant data integration decisions. They often express a mental model, or schema, over their data. They organize data to describe real-world entities. They reconcile redundancy and disagreements in their data. Such integration is both ubiquitous and not generally supported by experts and tools available for large integration efforts. We seek to capture and make explicit the user's mental model, and the attribute and entity correspondences they express, during these activities. This paper contributes the definition of a set of functions that support this type of data integration, a conceptual model to support these functions, and an associated simple tool that supports data integration by end-users in an entity-centric way, with an extensible schema, that makes the user's job easier.