Meaningful change detection in structured data
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
On the Resemblance and Containment of Documents
SEQUENCES '97 Proceedings of the Compression and Complexity of Sequences 1997
Workflow mining: a survey of issues and approaches
Data & Knowledge Engineering
A survey on tree edit distance and related problems
Theoretical Computer Science
PASSing the provenance challenge
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Automatic capture and reconstruction of computational provenance
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Provenance for Computational Tasks: A Survey
Computing in Science and Engineering
Lire: lucene image retrieval: an extensible java CBIR library
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Provenance as data mining: combining file system metadata with content analysis
TAPP'09 First workshop on on Theory and practice of provenance
Provenance in Databases: Why, How, and Where
Foundations and Trends in Databases
A survey of graph edit distance
Pattern Analysis & Applications
The Foundations for Provenance on the Web
Foundations and Trends in Web Science
Information provenance in social media
SBP'11 Proceedings of the 4th international conference on Social computing, behavioral-cultural modeling and prediction
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering
ICSC '11 Proceedings of the 2011 IEEE Fifth International Conference on Semantic Computing
A survey of automated web service composition methods
SWSWPC'04 Proceedings of the First international conference on Semantic Web Services and Web Process Composition
Learning data transformation rules through examples: preliminary results
Proceedings of the Ninth International Workshop on Information Integration on the Web
Automatic discovery of high-level provenance using semantic similarity
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
Hi-index | 0.00 |
Provenance is an increasingly important aspect of data management that is often underestimated and neglected by practitioners. In our work, we target the problem of reconstructing provenance of files in a shared folder setting, assuming that only standard filesystem metadata are available. We propose a content-based approach that is able to reconstruct provenance automatically, leveraging several similarity measures and edit distance algorithms, adapting and integrating them into a multi-signal pipeline. We discuss our research methodology and show some promising preliminary results.