Towards Index-based Similarity Search for Protein Structure Databases
CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Replacing suffix trees with enhanced suffix arrays
Journal of Discrete Algorithms - SPIRE 2002
PSIST: Indexing Protein Structures Using Suffix Trees
CSB '05 Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference
A hybrid approach for indexing and searching protein structures
WSEAS Transactions on Computers
Hi-index | 0.00 |
Protein Structure Indexing using Suffix Array (PSISA) is a new technique provides the ability to retrieve similarities of proteins based on the proteins structures. Indexing the protein structure is one approach of searching for protein similarities. In this paper we developed our proposed technique based on novel use of suffix array. We start by converting protein structure into a sequence by extracting local feature vectors; normalization is applied to these vectors components and converts these normalized vectors into a sequence. Sequence is indexed using the suffix array structure, which is used effectively in the searching process to retrieve proteins with similar structure. Proteins with high structural similarities are ranked according to their alignment score against the query protein. The experimental results, which based on the structural classification of proteins (SCOP) dataset, show that our method outperforms existing similar methods in memory utilization. Our results show an enhancement in the memory usage with factor exceeds 35%.