A similarity network approach for the analysis and comparison of protein sequence/structure sets

  • Authors:
  • Ioannis Valavanis;George Spyrou;Konstantina Nikita

  • Affiliations:
  • School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou Str., Zografos, 157 80 Athens, Greece;Biomedical Informatics Unit, Biomedical Research Foundation, Academy of Athens, 4 Soranou Efessiou Str., 115 27 Athens, Greece;School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou Str., Zografos, 157 80 Athens, Greece

  • Venue:
  • Journal of Biomedical Informatics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A set of proteins is a complex system whose elements are interrelated on the concept of sequence- and structure-based similarity. Here, we applied a similarity network-based methodology for the representation and analysis of protein sequences and structures sets using a non-redundant set of 311 proteins and three different information criteria based on sequence-derived features, sequence local alignment and structural alignment. A wide set of measurements, like network degree, clustering coefficient, characteristic path length and vertex centrality were utilized to characterize the networks' topology. Protein similarity networks were found medium or highly interconnected and the existence of both clusters and random edges classified their fully connected versions as Small World Networks (SWNs). The SWN architecture was able to host the continuous similarity transition among proteins and model the protein information flow during evolution. Recently reported ancestral elements, like the @a/@b class and certain folds, were remarkably found to act as hubs in the networks. Additionally, the moderate information value of sequence-derived features when used for fold and class assignment was shown on a network basis. The methodology described here can be applied for the analysis of other complex systems which consist of interrelated elements and a certain information flow.