Ensemble of diversely trained support vector machines for protein fold recognition

  • Authors:
  • Abdollah Dehzangi;Abdul Sattar

  • Affiliations:
  • Institute for Integrated and Intelligent Systems (IIIS), Griffith University, Brisbane, Australia, National ICT Australia (NICTA), Brisbane, Australia;Institute for Integrated and Intelligent Systems (IIIS), Griffith University, Brisbane, Australia, National ICT Australia (NICTA), Brisbane, Australia

  • Venue:
  • ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Protein Fold Recognition (PFR) is defined as assigning a given protein to a fold based on its major secondary structure. PFR is considered as an important step toward protein structure prediction and drug design. However, it still remains as an unsolved problem for biological science and bioinformatics. In this study, we explore the impact of two novel feature extraction methods namely overlapped segmented distribution and overlapped segmented autocorrelation to provide more local discriminatory information for the PFR compared to previously proposed methods found in the literature. We study the impact of our proposed feature extraction methods using 15 promising physicochemical attributes of the amino acids. Afterwards, by proposing an ensemble Support Vector Machines (SVM) which are diversely trained using features extracted from different physicochemical-based attributes, we enhance the protein fold prediction accuracy for up to 5% better than similar studies found in the literature.