A fuzzy sets based generalization of contact maps for the overlap of protein structures

  • Authors:
  • David Pelta;Natalio Krasnogor;Carlos Bousono-Calzon;José L. Verdegay;J. Hirst;Edmund Burke

  • Affiliations:
  • Department of Computer Science and Artificial Intelligence E.T.S.I. Informática, Universidad de Granada, 18071 Granada, Spain;Automated Scheduling, Optimisation and Planning Research Group University of Nottingham, Nottingham NG8 1BB, UK;Universidad Carlos III de Madrid, Avenida de la Universidad 30, Leganes, Madrid, Spain;Department of Computer Science and Artificial Intelligence E.T.S.I. Informática, Universidad de Granada, 18071 Granada, Spain;School of Chemistry, University Park Campus University of Nottingham, Nottingham United Kingdom;Automated Scheduling, Optimisation and Planning Research Group University of Nottingham, Nottingham NG8 1BB, UK

  • Venue:
  • Fuzzy Sets and Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.21

Visualization

Abstract

The comparison of protein structures is an important problem in bioinformatics. As a protein biological role is derived from its three-dimensional native state, the comparison of a new protein structure (with unknown function) with other protein structures (with known biological activity) can shed light into the biological role of the former. Consequently, advances in the comparison (and clustering) of proteins according to their three-dimensional configurations might also have an impact on drug discovery and other biomedical research that relies on understanding the inter-relations between structure and function in proteins. The contributions described in this paper are: Firstly, we propose a generalization of the maximum contact map overlap problem (MAX-CMO) by means of fuzzy sets and systems. The MAX-CMO is a model for protein structure comparison. In our new model, namedgeneralized maximum fuzzy contact map overlap (GMAX-FCMO), a contact map is defined by means of one (or more) fuzzy thresholds and one (or more) membership functions. The advantages and limitations of our new model are discussed. Secondly, we show how a fuzzy sets based metaheuristic can be used to compute protein similarities based on the new model. Finally, we compute the protein structure similarity of real-world proteins and show how our new model correctly measures their (di)similarity.