Lineage tracing for general data warehouse transformations
The VLDB Journal — The International Journal on Very Large Data Bases
Overcoming the Traceability Benefit Problem
RE '05 Proceedings of the 13th IEEE International Conference on Requirements Engineering
Recovering and using use-case-diagram-to-source-code traceability links
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
An end-to-end industrial software traceability tool
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Recording and using provenance in a protein compressibility experiment
HPDC '05 Proceedings of the High Performance Distributed Computing, 2005. HPDC-14. Proceedings. 14th IEEE International Symposium
PASSing the provenance challenge
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Automatic capture and efficient storage of e-Science experiment provenance
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Challenges for semi-automatic trace recovery in the automotive domain
TEFSE '09 Proceedings of the 2009 ICSE Workshop on Traceability in Emerging Forms of Software Engineering
The life and times of files and information: a study of desktop provenance
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Probe-it!: visualization support for provenance
ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Layering in provenance systems
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
The Open Provenance Model core specification (v1.1)
Future Generation Computer Systems
Supporting professional spreadsheet users by generating leveled dataflow diagrams
Proceedings of the 33rd International Conference on Software Engineering
Towards a model of provenance and user views in scientific workflows
DILS'06 Proceedings of the Third international conference on Data Integration in the Life Sciences
Provenance explorer – customized provenance views using semantic inferencing
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Managing rapidly-evolving scientific workflows
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Provenance collection support in the kepler scientific workflow system
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Issues in automatic provenance collection
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Hi-index | 0.00 |
One of the most important tasks in eScience is capturing the provenance of data. While scientists frequently use off-the-shelf analysis tools to process and manipulate data, current provenance techniques such as those based on scientific workflows are typically not able to trace internal data manipulations that occur within these tools. In this paper, we focus on one such off-the-shelf tool, MS Excel, which is used by many scientists; specifically, we propose InSituTrac, an automated in situ provenance approach for spreadsheet data in Excel. Our framework captures data provenance unobtrusively in the background, allows for user annotations, provides undo/redo functionality at various levels of granularity, presents the captured provenance in an accessible format, and visualizes captured provenance to support analysis of the provenance log. We highlight several motivating use case scenarios which show how provenance queries can be answered by our approach. Finally, case studies with an atmospheric science research group and a fisheries research group suggest that the automated provenance approach is both efficient and useful to scientists.