Semantic query optimization for XQuery over XML streams
VLDB '05 Proceedings of the 31st international conference on Very large data bases
A survey of data provenance in e-science
ACM SIGMOD Record
Making database systems usable
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Provenance for Visualizations: Reproducibility and Beyond
Computing in Science and Engineering
Storing and Querying Scientific Workflow Provenance Metadata Using an RDBMS
E-SCIENCE '07 Proceedings of the Third IEEE International Conference on e-Science and Grid Computing
Mining Taverna's semantic web of provenance
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Tackling the Provenance Challenge one layer at a time
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Graphs-at-a-time: query language and access methods for graph databases
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA Solution
IEEE Transactions on Services Computing
Querying and Managing Provenance through User Views in Scientific Workflows
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Differencing Provenance in Scientific Workflows
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Exploring Scientific Workflow Provenance Using Hybrid Queries over Nested Data and Lineage Graphs
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
A characterization of the problem of secure provenancemanagement
ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
Techniques for efficiently querying scientific workflow provenance graphs
Proceedings of the 13th International Conference on Extending Database Technology
Fine-grained and efficient lineage querying of collection-based workflow provenance
Proceedings of the 13th International Conference on Extending Database Technology
RDFProv: A relational RDF store for querying and managing scientific workflow provenance
Data & Knowledge Engineering
Prospective and Retrospective Provenance Collection in Scientific Workflow Environments
SCC '10 Proceedings of the 2010 IEEE International Conference on Services Computing
Provenance-based trustworthiness assessment in sensor networks
Proceedings of the Seventh International Workshop on Data Management for Sensor Networks
Future Generation Computer Systems
The Open Provenance Model core specification (v1.1)
Future Generation Computer Systems
A Provenance Based Mechanism to Identify Malicious Packet Dropping Adversaries in Sensor Networks
ICDCSW '11 Proceedings of the 2011 31st International Conference on Distributed Computing Systems Workshops
OPQL: A First OPM-Level Query Language for Scientific Workflow Provenance
SCC '11 Proceedings of the 2011 IEEE International Conference on Services Computing
Query language constructs for provenance
Proceedings of the 15th Symposium on International Database Engineering & Applications
UTPB: A Benchmark for Scientific Workflow Provenance Storage and Querying Systems
SERVICES '12 Proceedings of the 2012 IEEE Eighth World Congress on Services
Semiring-annotated data: queries and provenance?
ACM SIGMOD Record
Efficient provenance storage for relational queries
Proceedings of the 21st ACM international conference on Information and knowledge management
Algebraic structures for capturing the provenance of SPARQL queries
Proceedings of the 16th International Conference on Database Theory
Hi-index | 0.00 |
Provenance has become increasingly important in scientific workflows to understand, verify, and reproduce the result of scientific data analysis. Most existing systems store provenance data in provenance stores with proprietary provenance data models and conduct query processing over the physical provenance storages using query languages, such as SQL, SPARQL, and XQuery, which are closely coupled to the underlying storage strategies. Querying provenance at such low level leads to poor usability of the system: a user needs to know the underlying schema to formulate queries; if the schema changes, queries need to be reformulated; and queries formulated for one system will not run in another system. In this paper, we present OPQL, a provenance query language that enables the querying of provenance directly at the graph level. An OPQL query takes a provenance graph as input and produces another provenance graph as output. Therefore, OPQL queries are not tightly coupled to the underlying provenance storage strategies. Our main contributions are: (i) we design OPQL, including six types of graph patterns, a provenance graph algebra, and OPQL syntax and semantics, that supports querying provenance at the graph level; (ii) we implement OPQL using a Web service via our OPMProv system; therefore, users can invoke the Web service to execute OPQL queries in a provenance browser, called OPMProVis. The result of OPQL queries is displayed as a provenance graph in OPMProVis. An experimental study is conducted to evaluate the feasibility and performance of OPMProv on OPQL provenance querying.