SCIRun: a scientific programming environment for computational steering
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Workflow management: models, methods, and systems
Workflow management: models, methods, and systems
Chimera: AVirtual Data System for Representing, Querying, and Automating Data Derivation
SSDBM '02 Proceedings of the 14th International Conference on Scientific and Statistical Database Management
Lineage retrieval for scientific data processing: a survey
ACM Computing Surveys (CSUR)
A survey of data provenance in e-science
ACM SIGMOD Record
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Provenance and Annotation of Data: International Provenance and Annotation Workshop, IPAW 2006, Chicago, Il, USA, May 3-5, 2006, Revised Selected Papers (Lecture Notes in Computer Science)
Pegasus: A framework for mapping complex scientific workflows onto distributed systems
Scientific Programming
Making database systems usable
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Provenance for Visualizations: Reproducibility and Beyond
Computing in Science and Engineering
Querying and Creating Visualizations by Analogy
IEEE Transactions on Visualization and Computer Graphics
ManyEyes: a Site for Visualization at Internet Scale
IEEE Transactions on Visualization and Computer Graphics
Provenance in collection-oriented scientific workflows
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Provenance trails in the Wings-Pegasus system
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Mining Taverna's semantic web of provenance
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Addressing the provenance challenge using ZOOM
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Automatic capture and efficient storage of e-Science experiment provenance
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Extracting causal graphs from an open provenance data model
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
A Semantic Web approach to the provenance challenge
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Tracking provenance in a virtual data grid
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
CHI '08 Extended Abstracts on Human Factors in Computing Systems
Provenance for Computational Tasks: A Survey
Computing in Science and Engineering
Querying and Managing Provenance through User Views in Scientific Workflows
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Towards a model of provenance and user views in scientific workflows
DILS'06 Proceedings of the Third international conference on Data Integration in the Life Sciences
Managing rapidly-evolving scientific workflows
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Provenance collection support in the kepler scientific workflow system
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Performance evaluation of the karma provenance framework for scientific workflows
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Artifact-Centric Business Process Models: Brief Survey of Research Results and Challenges
OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
The Open Provenance Model: An Overview
Provenance and Annotation of Data and Processes
Optimizing user views for workflows
Proceedings of the 12th International Conference on Database Theory
Efficient provenance storage over nested data collections
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
TAPP'09 First workshop on on Theory and practice of provenance
Detecting and resolving unsound workflow views for correct provenance analysis
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Exploring Scientific Workflow Provenance Using Hybrid Queries over Nested Data and Lineage Graphs
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Provenance in Databases: Why, How, and Where
Foundations and Trends in Databases
On the Reachability of Trustworthy Information from Integrated Exploratory Biological Queries
DILS '09 Proceedings of the 6th International Workshop on Data Integration in the Life Sciences
Scientific Workflows: Business as Usual?
BPM '09 Proceedings of the 7th International Conference on Business Process Management
A navigation model for exploring scientific workflow provenance graphs
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Towards scientific workflow patterns
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Provenance query evaluation: what's so special about it?
Proceedings of the 18th ACM conference on Information and knowledge management
WOLVES: achieving correct provenance analysis by detecting and resolving unsound workflow views
Proceedings of the VLDB Endowment
PDiffView: viewing the difference in provenance of workflow results
Proceedings of the VLDB Endowment
Techniques for efficiently querying scientific workflow provenance graphs
Proceedings of the 13th International Conference on Extending Database Technology
Communications of the ACM
A logic for authorization provenance
ASIACCS '10 Proceedings of the 5th ACM Symposium on Information, Computer and Communications Security
Data-centric workflows in government: a new avenue of research?
Proceedings of the 11th Annual International Digital Government Research Conference on Public Administration Online: Challenges and Opportunities
RDFProv: A relational RDF store for querying and managing scientific workflow provenance
Data & Knowledge Engineering
Biocompute: towards a collaborative workspace for data intensive bio-science
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
PinDr0p: using single-ended audio features to determine call provenance
Proceedings of the 17th ACM conference on Computer and communications security
Computer Supported Cooperative Work
Bridging workflow and data provenance using strong links
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Provenance management for data exploration
DILS'10 Proceedings of the 7th international conference on Data integration in the life sciences
SOLOMON: seeking the truth via copying detection
Proceedings of the VLDB Endowment
The Foundations for Provenance on the Web
Foundations and Trends in Web Science
Generating sound workflow views for correct provenance analysis
ACM Transactions on Database Systems (TODS)
Monitoring unmanaged business processes
OTM'10 Proceedings of the 2010 international conference on On the move to meaningful internet systems - Volume Part I
W3P: Building an OPM based provenance model for the Web
Future Generation Computer Systems
Future Generation Computer Systems
Workflows to open provenance graphs, round-trip
Future Generation Computer Systems
Provenance security guarantee from origin up to now in the e-Science environment
Journal of Systems Architecture: the EUROMICRO Journal
Data model for scientific models and hypotheses
The evolution of conceptual modeling
PROPUB: towards a declarative approach for publishing customized, policy-aware provenance
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
CrowdLabs: social analysis and visualization for the sciences
SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Search, adapt, and reuse: the future of scientific workflows
ACM SIGMOD Record
SciProv: an architecture for semantic query in provenance metadata on e-science context
ITBAM'11 Proceedings of the Second international conference on Information technology in bio- and medical informatics
Provenance-based refresh in data-oriented workflows
Proceedings of the 20th ACM international conference on Information and knowledge management
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
Putting lipstick on pig: enabling database-style workflow provenance
Proceedings of the VLDB Endowment
Achieving reproducibility by combining provenance with service and workflow versioning
Proceedings of the 6th workshop on Workflows in support of large-scale science
Type inference and type checking for queries over execution traces
The VLDB Journal — The International Journal on Very Large Data Bases
Exploring provenance in high performance scientific computing
Proceedings of the first annual workshop on High performance computing meets databases
Reconciling provenance policy conflicts by inventing anonymous nodes
ESWC'11 Proceedings of the 8th international conference on The Semantic Web
A core calculus for provenance
POST'12 Proceedings of the First international conference on Principles of Security and Trust
Datalog as a lingua franca for provenance querying and reasoning
TaPP'12 Proceedings of the 4th USENIX conference on Theory and Practice of Provenance
A practical approach to developing a web-based geospatial workflow composition and execution system
Proceedings of the 3rd International Conference on Computing for Geospatial Research and Applications
Database support for exploring scientific workflow provenance graphs
SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Functional programs that explain their work
Proceedings of the 17th ACM SIGPLAN international conference on Functional programming
A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Journal of Grid Computing
Towards integrating workflow and database provenance
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
Designing a provenance-based climate data analysis application
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
Direct data transfer between SOAP web services in orchestration
Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
Distributed time-aware provenance
Proceedings of the VLDB Endowment
Provenance from log files: a BigData problem
Proceedings of the Joint EDBT/ICDT 2013 Workshops
WebLab PROV: computing fine-grained provenance links for XML artifacts
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Enhancing and abstracting scientific workflow provenance for data publishing
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Packing experiments for sharing and publication
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Monitoring SOA-based applications with business provenance
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Compact explanation of data fusion decisions
Proceedings of the 22nd international conference on World Wide Web
Performance evaluation of parallel strategies in public clouds: A study with phylogenomic workflows
Future Generation Computer Systems
Data fusion: resolving conflicts from multiple sources
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
BNCOD'13 Proceedings of the 29th British National conference on Big Data
On assisting scientific data curation in collection-based dataflows using labels
WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Static compiler analysis for workflow provenance
WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
PROPOLIS: provisioned analysis of data-centric processes
Proceedings of the VLDB Endowment
Characterizing workflow-based activity on a production e-infrastructure using provenance data
Future Generation Computer Systems
A core calculus for provenance
Journal of Computer Security - Security and Trust Principles
Hi-index | 0.02 |
Provenance in the context of workflows, both for the data they derive and for their specification, is an essential component to allow for result reproducibility, sharing, and knowledge re-use in the scientific community. Several workshops have been held on the topic, and it has been the focus of many research projects and prototype systems. This tutorial provides an overview of research issues in provenance for scientific workflows, with a focus on recent literature and technology in this area. It is aimed at a general database research audience and at people who work with scientific data and workflows. We will (1) provide a general overview of scientific workflows, (2) describe research on provenance for scientific workflows and show in detail how provenance is supported in existing systems; (3) discuss emerging applications that are enabled by provenance; and (4) outline open problems and new directions for database-related research.