Bioinformatics: Sequence, Structure and Databanks : A Practical Approach
Bioinformatics: Sequence, Structure and Databanks : A Practical Approach
Proceedings of the 5th International Conference on Intelligent Systems for Molecular Biology
Towards Index-based Similarity Search for Protein Structure Databases
CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
Protein structure alignment using geometrical features
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A novel text modeling approach for structural comparison and alignment of biomolecules
WSEAS Transactions on Computers
Hi-index | 0.00 |
A novel data mining approach for similarity search and knowledge discovery in protein structure databases is proposed. PADS (Protein structure Alignment by Directional shape Signatures) incorporates the three dimensional coordinates of the main atoms of each amino acid and extracts a geometrical shape signature along with the direction of each amino acid. As a result, each protein structure is presented by a series of multidimensional feature vectors representing local geometry, shape, direction, and biological properties of its amino acid molecules. Furthermore, a distance matrix is calculated and is incorporated into a local alignment dynamic programming algorithm to find the similar portions of two given protein structures followed by a sequence alignment step for more efficient filtration. The optimal superimposition of the detected similar regions is used to assess the quality of the results. The proposed algorithm is fast and accurate and hence could be used for analysis and knowledge discovery in large protein structures. The method has been compared with the results from CE, DALI, and CTSS using a representative sample of PDB structures. Several new structures not detected by other methods are detected.