Abstract: Using Provenance to Visualize Data from Large-Scale Experiments

Authors:
Felipe Horta;Jonas Dias;Kary A. C. S. Ocana;Daniel de Oliveira;Eduardo Ogasawara;Marta Mattoso
Affiliations:
-;-;-;-;-;-
Venue:
SCC '12 Proceedings of the 2012 SC Companion: High Performance Computing, Networking Storage and Analysis
Year:
2012

Citing 0
Cited 1

User-steering of HPC workflows: state-of-the-art and future directions

Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Large-scale scientific computations are often organized as a composition of many computational tasks linked through data flow. The data that flows along this many- task computing often moves from a desktop to a high- performance environment and to a visualization environment. Keeping track of this data flow is a challenge to provenance support in high-performance Scientific Workflow Management Systems. After the completion of a computational scientific experiment, a scientist has to manually select and analyze its staged-out data, for instance, by checking inputs and outputs along computational tasks that were part of the experiment. In this paper, we present a provenance management system that describes the production and consumption relationships between data artifacts, such as files, and the computational tasks that compose the experiment. We propose a query interface that allows for scientists to browse provenance data and select the output they want to visualize using browsers or a high-resolution tiled display.