Component software: beyond object-oriented programming
Component software: beyond object-oriented programming
Javaserver Pages
Distributed and Parallel Databases
Kepler: An Extensible System for Design and Execution of Scientific Workflows
SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
Delivering web service coordination capability to users
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Parameter sweeps for exploring GP parameters
GECCO '05 Proceedings of the 7th annual workshop on Genetic and evolutionary computation
VisTrails: visualization meets data management
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Interpreting the data: Parallel analysis with Sawzall
Scientific Programming - Dynamic Grids and Worldwide Computing
Dryad: distributed data-parallel programs from sequential building blocks
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Proceedings of the 5th IEEE workshop on Challenges of large applications in distributed environments
Introducing secure provenance: problems and challenges
Proceedings of the 2007 ACM workshop on Storage security and survivability
MapReduce: simplified data processing on large clusters
Communications of the ACM - 50th anniversary issue: 1958 - 2008
Falkon: a Fast and Light-weight tasK executiON framework
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Efficient scheduling of scientific workflows in a high performance computing cluster
CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
Provenance for Computational Tasks: A Survey
Computing in Science and Engineering
Scientific Workflow Systems for 21st Century, New Bottle or New Wine?
SERVICES '08 Proceedings of the 2008 IEEE Congress on Services - Part I
ESCIENCE '08 Proceedings of the 2008 Fourth IEEE International Conference on eScience
Workflows and e-Science: An overview of workflow system features and capabilities
Future Generation Computer Systems
A Strategy for Provenance Gathering in Distributed Scientific Workflows
SERVICES '09 Proceedings of the 2009 Congress on Services - I
An opportunistic algorithm for scheduling workflows on grids
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
Parallelism in bioinformatics workflows
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Data parallelism in bioinformatics workflows using Hydra
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
A P2P approach to many tasks computing for scientific workflows
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
Towards jungle computing with Ibis/Constellation
Proceedings of the 2011 workshop on Dynamic distributed data-intensive applications, programming abstractions, and systems
Supporting dynamic parameter sweep in adaptive and user-steered workflow
Proceedings of the 6th workshop on Workflows in support of large-scale science
Design and implementation of "many parallel task" hybrid subsurface model
Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
Poster: scientific data parallelism using P2P technique
Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion
An adaptive parallel execution strategy for cloud-based scientific workflows
Concurrency and Computation: Practice & Experience
A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Journal of Grid Computing
A framework for readapting and running bioinformatics applications in the cloud
Proceedings of the 2012 ACM Research in Applied Computation Symposium
Using domain-specific data to enhance scientific workflow steering queries
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
Evaluating parameter sweep workflows in high performance computing
Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Performance evaluation of parallel strategies in public clouds: A study with phylogenomic workflows
Future Generation Computer Systems
User-steering of HPC workflows: state-of-the-art and future directions
Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Hi-index | 0.00 |
One of the main advantages of using a scientific workflow management system (SWfMS) to orchestrate data flows among scientific activities is to control and register the whole workflow execution. The execution of activities within a workflow with high performance computing (HPC) presents challenges in SWfMS execution control. Current solutions leave the scheduling to the HPC queue system. Since the workflow execution engine does not run on remote clusters, SWfMS are not aware of the parallel strategy of the workflow execution. Consequently, remote execution control and provenance registry of the parallel activities is very limited from the SWfMS side. This work presents a set of components to be included on the workflow specification of any SWMfS to control parallelization of activities as MTC. In addition, these components can gather provenance data during remote workflow execution. Through these MTC components, the parallelization strategy can be registered and reused, and provenance data can be uniformly queried. We have evaluated our approach by performing parameter sweep parallelization in solving the incompressible 3D Navier-Stokes equations. Experimental results show the performance gains with the additional benefits of distributed provenance support.