Feature analysis and classification of protein secondary structure data

  • Authors:
  • S. Y. M. Shi;P. N. Suganthan

  • Affiliations:
  • School of Electrical and Electronic Engineering, Nanyang Technological University, Republic of Singapore;School of Electrical and Electronic Engineering, Nanyang Technological University, Republic of Singapore

  • Venue:
  • ICANN/ICONIP'03 Proceedings of the 2003 joint international conference on Artificial neural networks and neural information processing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we investigate feature analysis for the prediction of the secondary structure of protein sequences using support vector machines (SVMs) and k-nearest neighbor algorithm (kNN). We apply feature selection and scaling techniques to obtain a number of distinct feature subsets with different features and each scaled differently. The feature selection and the scaling are performed using the mutual information (MI). We formulate the feature selection and scaling as combinatorial optimization problem and obtain solutions using a Hopfield-style algorithm. Our experimental results show that the feature subset selection improves the performance for both SVM and kNN while the feature scaling is consistently beneficial for kNN.