Protein Structure Classification Based on Conserved Hydrophobic Residues

Authors:
Pradeep Chowriappa;Sumeet Dua;Jinko Kanno;Hilary W. Thompson
Affiliations:
Louisiana Tech University, Ruston;Louisiana Tech University, Ruston;Louisiana Tech University, Ruston;Louisiana State University Health Sciences Center, New Orleans
Venue:
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Year:
2009

Citing 7
Cited 0

Random Forests

Machine Learning
Mining protein family specific residue packing patterns from protein structure graphs

RECOMB '04 Proceedings of the eighth annual international conference on Resaerch in computational molecular biology
Mismatch string kernels for discriminative protein classification

Bioinformatics
Mining coherent dense subgraphs across massive biological networks for functional discovery

Bioinformatics
Search for folding nuclei in native protein structures

Bioinformatics
'Protein Peeling': an approach for splitting a 3D protein structure into compact fragments

Bioinformatics
LFM-Pro: a tool for detecting significant local structural sites in proteins

Bioinformatics

Quantified Score

Hi-index	0.01

Visualization

Abstract

Protein folding is frequently guided by local residue interactions that form clusters in the protein core. The interactions between residue clusters serve as potential nucleation sites in the folding process. Evidence postulates that the residue interactions are governed by the hydrophobic propensities that the residues possess. An array of hydrophobicity scales has been developed to determine the hydrophobic propensities of residues under different environmental conditions. In this work, we propose a graph-theory-based data mining framework to extract and isolate protein structural features that sustain invariance in evolutionary-related proteins, through the integrated analysis of five well-known hydrophobicity scales over the 3D structure of proteins. We hypothesize that proteins of the same homology contain conserved hydrophobic residues and exhibit analogous residue interaction patterns in the folded state. The results obtained demonstrate that discriminatory residue interaction patterns shared among proteins of the same family can be employed for both the structural and the functional annotation of proteins. We obtained on the average 90 percent accuracy in protein classification with a significantly small feature vector compared to previous results in the area. This work presents an elaborate study, as well as validation evidence, to illustrate the efficacy of the method and the correctness of results reported.