Protein classification with multiple algorithms

  • Authors:
  • Sotiris Diplaris;Grigorios Tsoumakas;Pericles A. Mitkas;Ioannis Vlahavas

  • Affiliations:
  • Dept. of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Thessaloniki, Greece;Dept. of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece;Dept. of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Thessaloniki, Greece;Dept. of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece

  • Venue:
  • PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Nowadays, the number of protein sequences being stored in central protein databases from labs all over the world is constantly increasing. From these proteins only a fraction has been experimentally analyzed in order to detect their structure and hence their function in the corresponding organism. The reason is that experimental determination of structure is labor-intensive and quite time-consuming. Therefore there is the need for automated tools that can classify new proteins to structural families. This paper presents a comparative evaluation of several algorithms that learn such classification models from data concerning patterns of proteins with known structure. In addition, several approaches that combine multiple learning algorithms to increase the accuracy of predictions are evaluated. The results of the experiments provide insights that can help biologists and computer scientists design high-performance protein classification systems of high quality.