Counter-Propagation Neural Networks for Molecular Sequence Classification: Supervised LVQ and Dynamic Node Allocation

  • Authors:
  • Cathy Wu;Hsi-Lien Chen;Sheng-Chih Chen

  • Affiliations:
  • Department of Epidemiology/Biomathematics, The University of Texas Health Center at Tyler, Tyler, TX 75710 E-mail: wu@uthct.edu;Department of Epidemiology/Biomathematics, The University of Texas Health Center at Tyler, Tyler, TX 75710 E-mail: wu@uthct.edu;Intelligent Computing Systems, Tyler, TX 75703

  • Venue:
  • Applied Intelligence
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

A modified counter-propagation (CP) algorithm with supervised learning vector quantizer (LVQ) and dynamic node allocation has been developedfor rapid classification of molecular sequences. The molecular sequences were encoded into neural input vectors using an n-gram hashing method for word extraction and a singular value decomposition (SVD) method for vector compression. The neural networks used were three-layered, forward-only CP networks that performed nearest neighbor classification. Several factors affecting the CP performance were evaluated, including weight initialization, Kohonen layer dimensioning, winner selection and weight update mechanisms. The performance of the modified CP network was compared with the back-propagation (BP) neural network and the k-nearest neighbor method. The major advantages of the CP network are its training and classification speed and its capability to extract statistical properties of the input data. The combined BP and CP networks can classify nucleic acid or protein sequences with a close to 100% accuracy at a rate of about one order of magnitude faster than other currently available methods.