Incorporating knowledge of secondary structures in a l-system-based encoding for protein folding

  • Authors:
  • Gabriela Ochoa;Gabi Escuela;Natalio Krasnogor

  • Affiliations:
  • Department of Computer Science, Universidad Simon Bolivar, Caracas, Venezuela;Department of Computer Science, Universidad Simon Bolivar, Caracas, Venezuela;School of Computer Science and I.T., University of Nottingham, Nottingham, UK

  • Venue:
  • EA'05 Proceedings of the 7th international conference on Artificial Evolution
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

An encoding scheme for protein folding on lattice models, inspired by parametric L-systems, was proposed. The encoding incorporates problem domain knowledge in the form of predesigned production rules that capture commonly known secondary structures: α-helices and β-sheets. The ability of this encoding to capture protein native conformations was tested using an evolutionary algorithm as the inference procedure for discovering L-systems. Results confirmed the suitability of the proposed representation. It appears that the occurrence of motifs and sub-structures is an important component in protein folding, and these sub-structures may be captured by a grammar-based encoding. This line of research suggests novel and compact encoding schemes for protein folding that may have practical implications in solving meaningful problems in biotechnology such as structure prediction and protein folding.