A study of residue correlation within protein sequences and its application to sequence classification

  • Authors:
  • Chris Hemmerich;Sun Kim

  • Affiliations:
  • Center For Genomics and Bioinformatics, Indiana University, Bloomington, India;School of Informatics, Center for Genomics and Bioinformatics, Indiana University, Bloomington, India

  • Venue:
  • EURASIP Journal on Bioinformatics and Systems Biology
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate methods of estimating residue correlation within protein sequences. We begin by using mutual information (MI) of adjacent residues, and improve our methodology by defining the mutual information vector (MIV) to estimate long range correlations between nonadjacent residues. We also consider correlation based on residue hydropathy rather than protein-specific interactions. Finally, in experiments of family classification tests, the modeling power of MIV was shown to be significantly better than the classic MI method, reaching the level where proteins can be classified without alignment information.