Bayesian network multi-classifiers for protein secondary structure prediction

  • Authors:
  • Vıctor Robles;Pedro Larrañaga;José M. Peña;Ernestina Menasalvas;Marıa S. Pérez;Vanessa Herves;Anita Wasilewska

  • Affiliations:
  • Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain;Department of Computer Science and Artificial Intelligence, University of the Basque Country, San Sebastián, Spain;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain;Department of Computer Architecture and Technology, Technical University of Madrid, Madrid, Spain;Department of Computer Science, University of Stony Brook, Stony Brook, NY, USA

  • Venue:
  • Artificial Intelligence in Medicine
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Successful secondary structure predictions provide a starting point for direct tertiary structure modelling, and also can significantly improve sequence analysis and sequence-structure threading for aiding in structure and function determination. Hence the improvement of predictive accuracy of the secondary structure prediction becomes essential for future development of the whole field of protein research. In this work we present several multi-classifiers that combine the predictions of the best current classifiers available on Internet. Our results prove that combining the predictions of a set of classifiers by creating composite classifiers is a fruitful one. We have created multi-classifiers that are more accurate than any of the component classifiers. The multi-classifiers are based on Bayesian networks. They are validated with 9 different datasets. Their predictive accuracy results outperform the best secondary structure predictors by 1.21% on average. Our main contributions are: (i) we improved the best know predictive accuracy by 1.21%, (ii) our best results have been obtained with a new semi nai@?ve Bayes approach named Pazzani-EDA and (iii) our multi-classifiers combine results of previously build classifiers predictions obtained through Internet, thanks to our development of a Java application.