New techniques for extracting features from protein sequences

  • Authors:
  • J. T. L. Wang;Q. Ma;D. Shasha;C. H. Wu

  • Affiliations:
  • Department of Computer and Information Science, New Jersey Institute of Technology, University Heights, Newark, New Jersey;Novartis Pharmaceuticals Corporation, Summit, New Jersey;Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, New York;National Biomedical Research Foundation, Georgetown University Medical Center, 3900 Reservoir Road, NW, Washington, DC

  • Venue:
  • IBM Systems Journal - Deep computing for the life sciences
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose new techniques to extract features from protein sequences. We then use the features as inputs for a Bayesian neural network (BNN) and apply the BNN to classifying protein sequences obtained from the PIR (Protein Information Resource) database maintained at the National Biomedical Research Foundation. To evaluate the performance of the proposed approach, we compare it with other protein classifiers built based on sequence alignment and machine learning methods. Experimental results show the high precision of the proposed classifier and the complementarity of the bioinformatics tools studied in the paper.