Fast SCOP Classification of Structural Class and Fold Using Secondary Structure Mining in Distance Matrix

  • Authors:
  • Jian-Yu Shi;Yan-Ning Zhang

  • Affiliations:
  • Faculty of Life Sciences, Northwestern Polytechnical University, and College of Computer Science, Northwestern Polytechnical University, Xi'An, China 710072;College of Computer Science, Northwestern Polytechnical University, Xi'An, China 710072

  • Venue:
  • PRIB '09 Proceedings of the 4th IAPR International Conference on Pattern Recognition in Bioinformatics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

It is an urgent need to understand the structure-function relationship in proteomic era. One of the important techniques to meet this demand is to analyze and represent the spatial structure of domain which is the functional unit of the whole protein, and perform fast domain classification. In this paper, we introduce a novel method of rapid domain classification. Instead of analyzing directly protein sequence or 3-D tertiary structure, the presented method maps firstly tertiary structure of protein domain into 2-D C***-C*** distance matrix. Then, two distance functions for alpha helix and beta strand are modeled by considering their geometrical properties respectively. After that, the distance functions are further applied to mine secondary structure elements in such distance matrix with the way similar to image processing. Furthermore, composition feature and arrangement feature of secondary structure elements are presented to characterize domain structure for classification of structural class and fold in Structural Classification of Proteins (SCOP) database. Finally, the results compared with other methods show that the presented method can perform effectively and efficiently automatic classification of domain with the benefit of low dimension and meaningful features, but also no need of complicated classifier system.