Complexity of answering queries using materialized views
PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Remarks on the algebra of non first normal form relations
PODS '82 Proceedings of the 1st ACM SIGACT-SIGMOD symposium on Principles of database systems
Operations and the Properties on Non-First-Normal-Form Relational Databases
VLDB '83 Proceedings of the 9th International Conference on Very Large Data Bases
WISE '03 Proceedings of the Fourth International Conference on Web Information Systems Engineering
A survey on tree edit distance and related problems
Theoretical Computer Science
Data exchange: semantics and query answering
Theoretical Computer Science - Database theory
Principles of dataspace systems
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Provenance management in curated databases
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
ORCHESTRA: facilitating collaborative data sharing
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Update exchange with mappings and provenance
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Databases with uncertainty and lineage
The VLDB Journal — The International Journal on Very Large Data Bases
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Using schema transformation pathways for data lineage tracing
BNCOD'05 Proceedings of the 22nd British National conference on Databases: enterprise, Skills and Innovation
A provenance model for manually curated data
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
A conceptual model and predicate language for data selection and projection based on provenance
TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
Print: a provenance model to support integration processes
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Enabling revisitation of fine-grained clinical information
Proceedings of the 1st ACM International Health Informatics Symposium
Trustworthy information: concepts and mechanisms
WAIM'10 Proceedings of the 11th international conference on Web-age information management
The Foundations for Provenance on the Web
Foundations and Trends in Web Science
W3P: Building an OPM based provenance model for the Web
Future Generation Computer Systems
Hi-index | 0.00 |
Some tasks in a dataspace (a loose collection of heterogeneous data sources) require integration of fine-grained data from diverse sources. This work is often done by end users knowledgeable about the domain, who copy-and-paste data into a spreadsheet or other existing application. Inspired by this kind of work, in this paper we define a data curation setting characterized by data that are explicitly selected, copied, and then pasted into a target dataset where they can be confirmed or replaced. Rows and columns in the target may also be combined, for example, when redundant. Each of these actions is an integration decision, often of high quality, that when taken together comprise the provenance of a data value in the target. In this paper, we define a conceptual model for data and provenance for these user actions, and we show how questions about data provenance can be answered. We note that our model can be used in automated data curation as well as in a setting with the manual activity we emphasize in our examples.