Lessons from characterizating the input/output behavior of parallel scientific applications
Performance Evaluation - Special issue on tools for performance evaluation
A case study in application I/O on Linux clusters
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Database Allocation Strategies for Parallel BLAST Evaluation on Clusters
Distributed and Parallel Databases
Indexing and Retrieval for Genomic Databases
IEEE Transactions on Knowledge and Data Engineering
TurboBLAST(r): A Parallel Implementation of BLAST Built on the TurboHub
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Characterizing parallel file-access patterns on a large-scale multiprocessor
IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
Workload Characterization of Input/Output Intensive Parallel Applications
Proceedings of the 9th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
A Method to Find Uniq e Sequences on Distrib ted Genomic Databases
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
Clusterfile: A Flexible Physical Layout Parallel File System
CLUSTER '01 Proceedings of the 3rd IEEE International Conference on Cluster Computing
Performance Evaluation of Parallel File Systems for PC Clusters and ASCI Red
CLUSTER '01 Proceedings of the 3rd IEEE International Conference on Cluster Computing
Kernel-Level Caching for Optimizing I/O by Exploiting Inter-Application Data Sharing
CLUSTER '02 Proceedings of the IEEE International Conference on Cluster Computing
Research Directions in Parallel I/O for Clusters
CLUSTER '02 Proceedings of the IEEE International Conference on Cluster Computing
I/O Analysis and Optimization for an AMR Cosmology Application
CLUSTER '02 Proceedings of the IEEE International Conference on Cluster Computing
MPI-IO on a Parallel File System for Cluster of Workstations
IWCC '99 Proceedings of the 1st IEEE Computer Society International Workshop on Cluster Computing
The Cluster File System: Integration of High Performance Communication and I/O in Clusters
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
Protein Explorer: A Petaflops Special-Purpose Computer System for Molecular Dynamics Simulations
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
SNAPI '03 Proceedings of the international workshop on Storage network architecture and parallel I/Os
PVFS: a parallel file system for linux clusters
ALS'00 Proceedings of the 4th annual Linux Showcase & Conference - Volume 4
CEFT: A cost-effective, fault-tolerant parallel virtual file system
Journal of Parallel and Distributed Computing
Dynamic load balancing for I/O-intensive applications on clusters
ACM Transactions on Storage (TOS)
Hi-index | 0.00 |
In this work, we investigate parallel I/O efficiencies in parallelised BLAST, the most popular tool for searching similarity in biological databases and implement two variations by incorporating the PVFS and CEFT-PVFS parallel I/O facilities. Our goal is to study the performance gain from parallel I/O under the constraints of different numbers of commodity storage devices in a Linux cluster. We also evaluate two read performance optimisation techniques employed in CEFT-PVFS: (1) doubling the degree of parallelism is shown to have comparable read performance with respect to PVFS when both systems have the same number of servers; (2) skipping hot-spot nodes can reduce the performance penalty when I/O workloads are highly imbalanced. The I/O resource contention between multiple applications, running in the same cluster, can degrade the performance of the original parallel BLAST and the PVFS version up to 10- and 21-fold, respectively; whereas, the one based on CEFT-PVFS, which has the ability to skip hot-spot nodes, suffered only a two-fold performance degradation.