Distributed processing of very large datasets with DataCutter
Parallel Computing - Clusters and computational grids for scientific computing
Chimera: AVirtual Data System for Representing, Querying, and Automating Data Derivation
SSDBM '02 Proceedings of the 14th International Conference on Scientific and Statistical Database Management
Pipeline and Batch Sharing in Grid Workloads
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
ACDS: Adapting Computational Data Streams for High Performance
IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
Condor-G: A Computation Management Agent for Multi-Institutional Grids
HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
A distributed data management middleware for data-driven application systems
CLUSTER '04 Proceedings of the 2004 IEEE International Conference on Cluster Computing
XDTM: the XML data type and mapping for specifying datasets
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
A run-time system for efficient execution of scientific workflows on distributed environments
International Journal of Parallel Programming
Secure XML querying based on authorization graphs
Information Systems Frontiers
Hi-index | 0.00 |
In this paper we look at the application of XML data management support in scientific data analysis workflows. We describe a software infrastructure that aims to address issues associated with metadata management, data storage and management, and execution of data analysis workflows on distributed storage and compute platforms. This system couples a distributed, filter-stream based dataflow engine with a distributed XML-based data and metadata management system. We present experimental results from a biomedical image analysis use case that involves processing of digitized microscopy images for feature segmentation.