Use of PVFS for Efficient Execution of Jobs with Pipeline-Shared I/O
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Identity Boxing: A New Technique for Consistent Global Identity
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
NFSv4 replication for grid storage middleware
Proceedings of the 4th international workshop on Middleware for grid computing
Explicit control a batch-aware distributed file system
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
Deploying virtual machines as sandboxes for the grid
WORLDS'05 Proceedings of the 2nd conference on Real, Large Distributed Systems - Volume 2
Data driven workflow planning in cluster management systems
Proceedings of the 16th international symposium on High performance distributed computing
Towards realistic file-system benchmarks with CodeMRI
ACM SIGMETRICS Performance Evaluation Review
A cost-effective distributed file service with QoS guarantees
Proceedings of the ACM/IFIP/USENIX 2007 International Conference on Middleware
Proceedings of the second international workshop on Data-aware distributed computing
Node-capability-aware replica management for peer-to-peer grids
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
A data locality aware online scheduling approach for I/O-intensive jobs with file sharing
JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
On grid performance evaluation using synthetic workloads
JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
An opportunistic algorithm for scheduling workflows on grids
VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
A cost-effective distributed file service with QoS guarantees
MIDDLEWARE2007 Proceedings of the 8th ACM/IFIP/USENIX international conference on Middleware
Scheduling of tasks with batch-shared I/O on heterogeneous systems
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Gvu: a view-oriented framework for data management in grid environments
HiPC'06 Proceedings of the 13th international conference on High Performance Computing
Faults in large distributed systems and what we can do about them
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
The characteristics and performance of groups of jobs in grids
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Evaluating parameter sweep workflows in high performance computing
Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Hi-index | 0.00 |
We present a study of six batch-pipelined scientific workloads that are candidates for execution on computational grids. Whereas other studies focus on the behavior of single applications, this study characterizes workloads composed of pipelines of sequential processes that use file storage for communication and also share significant data across a batch. This study includes measurements of the memory, CPU, and I/O requirements of individual components as well as analyses of I/O sharing within complete batches. We conclude with a discussion of the ramifications of these workloads for end-to-end scalability and overall system design.