Silhouettes: a graphical aid to the interpretation and validation of cluster analysis
Journal of Computational and Applied Mathematics
Algorithms on strings, trees, and sequences: computer science and computational biology
Algorithms on strings, trees, and sequences: computer science and computational biology
Classifying proteins by family using the product of correlated p-values
RECOMB '99 Proceedings of the third annual international conference on Computational molecular biology
RECOMB '00 Proceedings of the fourth annual international conference on Computational molecular biology
Approximation of protein structure for fast similarity measures
RECOMB '03 Proceedings of the seventh annual international conference on Research in computational molecular biology
Accelerating Protein Classification Using Suffix Trees
Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Proceedings of the 5th International Conference on Intelligent Systems for Molecular Biology
Cluster validation techniques for genome expression data
Signal Processing - Special issue: Genomic signal processing
Towards Automatic Clustering of Protein Sequences
CSB '02 Proceedings of the IEEE Computer Society Conference on Bioinformatics
IEEE Transactions on Pattern Analysis and Machine Intelligence
Some new indexes of cluster validity
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Clustering of protein substructures for discovery of a novel class of sequence-structure fragments
ITBAM'10 Proceedings of the First international conference on Information technology in bio- and medical informatics
Hi-index | 0.00 |
To study protein clustering is very important in diverse fields such as drug design and environmental industry. For a meaningful clustering, protein structure must be considered. But, protein structures are very complicated and have so much information such as angles, 3-dimensional coordinates. Thus, it is not easy to efficiently compute their relations. In this paper, we present a method to efficiently abstract and cluster protein structures using secondary structure element sequences. Since a secondary structure element sequence is an abstract representation of protein structure, it can be regarded as a useful descriptor to cluster a set of proteins at the abstraction level. Using secondary structure element sequences and their distances, we implemented an automatic protein clustering system and verify their efficiency by experimental results.