Counter-Propagation Neural Networks for Molecular Sequence Classification: Supervised LVQ and Dynamic Node Allocation

Authors:
Cathy Wu;Hsi-Lien Chen;Sheng-Chih Chen
Affiliations:
Department of Epidemiology/Biomathematics, The University of Texas Health Center at Tyler, Tyler, TX 75710 E-mail: wu@uthct.edu;Department of Epidemiology/Biomathematics, The University of Texas Health Center at Tyler, Tyler, TX 75710 E-mail: wu@uthct.edu;Intelligent Computing Systems, Tyler, TX 75703
Venue:
Applied Intelligence
Year:
1997

Citing 8
Cited 1

Self-organization and associative memory: 3rd edition

Self-organization and associative memory: 3rd edition
Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems

Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Neurocomputing

Neurocomputing
Neural network architectures: an introduction

Neural network architectures: an introduction
Artificial neural systems: foundations, paradigms, applications, and implementations

Artificial neural systems: foundations, paradigms, applications, and implementations
Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations
Optimal brain damage

Advances in neural information processing systems 2
Neural Networks for Full-Scale Protein Sequence Classification: Sequence Encoding with Singular Value Decomposition

Machine Learning - Special issue on applications in molecular biology

Flexible case-based retrieval for comparative genomics

Applied Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

A modified counter-propagation (CP) algorithm with supervised learning vector quantizer (LVQ) and dynamic node allocation has been developedfor rapid classification of molecular sequences. The molecular sequences were encoded into neural input vectors using an n-gram hashing method for word extraction and a singular value decomposition (SVD) method for vector compression. The neural networks used were three-layered, forward-only CP networks that performed nearest neighbor classification. Several factors affecting the CP performance were evaluated, including weight initialization, Kohonen layer dimensioning, winner selection and weight update mechanisms. The performance of the modified CP network was compared with the back-propagation (BP) neural network and the k-nearest neighbor method. The major advantages of the CP network are its training and classification speed and its capability to extract statistical properties of the input data. The combined BP and CP networks can classify nucleic acid or protein sequences with a close to 100% accuracy at a rate of about one order of magnitude faster than other currently available methods.