A mechanism for environment integration
ACM Transactions on Programming Languages and Systems (TOPLAS)
Toward a theoretical understanding of why and when decision tree pruning algorithms fail
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Little-JIL/Juliette: a process definition language and interpreter
Proceedings of the 22nd international conference on Software engineering
Tioga-2: A Direct Manipulation Database Visualization Environment
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Chimera: AVirtual Data System for Representing, Querying, and Automating Data Derivation
SSDBM '02 Proceedings of the 14th International Conference on Scientific and Statistical Database Management
The semantics and execution of a synchronous block-diagram language
Science of Computer Programming
Workflow management in GriPhyN
Grid resource management
Modeling of sensor nets in Ptolemy II
Proceedings of the 3rd international symposium on Information processing in sensor networks
Kepler: An Extensible System for Design and Execution of Scientific Workflows
SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
Flow analysis for verifying properties of concurrent software systems
ACM Transactions on Software Engineering and Methodology (TOSEM)
ASKALON: a tool set for cluster and Grid computing: Research Articles
Concurrency and Computation: Practice & Experience - Grid Performance
An approach for pipelining nested collections in scientific workflows
ACM SIGMOD Record
A survey of data provenance in e-science
ACM SIGMOD Record
Panoply of Utilities in Taverna
E-SCIENCE '05 Proceedings of the First International Conference on e-Science and Grid Computing
The virtual data grid: a new model and architecture for data-intensive collaboration
SSDBM '03 Proceedings of the 15th International Conference on Scientific and Statistical Database Management
Mirroring Resources or Mapping Requests: Implementing WS-RF for Grid Workflows
CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
VisTrails: visualization meets data management
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Scientific workflow management and the Kepler system: Research Articles
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
ASKALON: A Grid Application Development and Computing Environment
GRID '05 Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing
Special Issue: The First Provenance Challenge
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
FLAVERS: a finite state verification technique for software systems
IBM Systems Journal
The JOpera visual composition language
Journal of Visual Languages and Computing
Process technology to facilitate the conduct of science
SPW'05 Proceedings of the 2005 international conference on Unifying the Software Process Spectrum
Provenance collection support in the kepler scientific workflow system
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Hierarchical finite state machines with multiple concurrency models
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Efficient provenance storage over nested data collections
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
A Pattern for Modeling Rework in Software Development Processes
ICSP '09 Proceedings of the International Conference on Software Process: Trustworthy Software Development Processes
Structural Considerations in Defining Executable Process Models
ICSP '09 Proceedings of the International Conference on Software Process: Trustworthy Software Development Processes
Journal of Parallel and Distributed Computing
Process model based incremental project planning
ICCSA'10 Proceedings of the 2010 international conference on Computational Science and Its Applications - Volume Part II
A data dependency based strategy for intermediate data storage in scientific cloud workflow systems
Concurrency and Computation: Practice & Experience
DEX: Digital evidence provenance supporting reproducibility and comparison
Digital Investigation: The International Journal of Digital Forensics & Incident Response
Hi-index | 0.00 |
This paper describes our experiences in exploring the applicability of software engineering approaches to scientific data management problems. Specifically, this paper describes how process definition languages can be used to expedite production of scientific datasets as well as to generate documentation of their provenance. Our approach uses a process definition language that incorporates powerful semantics to encode scientific processes in the form of a Process Definition Graph (PDG). The paper describes how execution of the PDG-defined process can generate Dataset Derivation Graphs (DDGs), metadata that document how the scientific process developed each of its product datasets. The paper uses an example to show that scientific processes may be complex and to illustrate why some of the more powerful semantic features of the process definition language are useful in supporting clarity and conciseness in representing such processes. This work is similar in goals to work generally referred to as Scientific Workflow. The paper demonstrates the contribution that software engineering can make to this domain.