Improving Performance of Multiple Sequence Alignment Analysis in Multi-Client Environments

Authors:
Umit Catalyurek;Eric Stahlberg;Renato Ferreira;Tahsin M. Kurç;Joel H. Saltz
Affiliations:
-;-;-;-;-
Venue:
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Year:
2002

Citing 9
Cited 4

Simultaneous optimization and evaluation of multiple dimensional queries

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Efficient and extensible algorithms for multi query optimization

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Distributed processing of very large datasets with DataCutter

Parallel Computing - Clusters and computational grids for scientific computing
Efficient execution of multiple query workloads in data analysis applications

Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Common Subexpression Processing in Multiple-Query Processing

IEEE Transactions on Knowledge and Data Engineering
Parallel Computation in Biological Sequence Analysis

IEEE Transactions on Parallel and Distributed Systems
Scheduling Multiple Data Visualization Query Workloads on a Shared Memory Machine

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Multiple Query Processing in Deductive Databases using Query Graphs

VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
Generalized Search Trees for Database Systems

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases

A component-based implementation of multiple sequence alignment

Proceedings of the 2003 ACM symposium on Applied computing
Pairwise Distance Matrix Computation for Multiple Sequence Alignment on the Cell Broadband Engine

ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
MT-clustalW: multithreading multiple sequence alignment

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Parallel multiple sequence alignment with decentralized cache support

Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper is concerned with the efficient execution of multiple sequence alignment methods in a multiple client environment. Multiple sequence alignment (MSA) is a computationally expensive method, which is commonly used in computational and molecular biology. Large databases of protein and gene sequences are available to the scientific community. Oftentimes, these databases are accessed by multiple users to execute MSA queries. The data server has to handle multiple concurrent queries in such situations. We look at the effect of data caching on the performance of the data server. We describe an approach for caching intermediate results for reuse in subsequent or concurrent queries. We focus on progressive alignment-based strategies, in particular the CLUSTAL W algorithm. Our results for 350 sets of sequences show an average speedup of up to 2.5 is obtained by caching intermediate results. Our results also show that the cache-enabled CLUSTAL W program scales well on a SMP machine.