Advanced classification approach for neuronal phoneme recognition system based on efficient constructive training algorithm

Authors:
Mondher Frikha;Sabeur Masmoudi;Ahmed Ben Hamida;Mohamed Chtourou
Affiliations:
Advanced Technologies for Medical and Signals, Research Unity, ATMS, ENIS, Sfax University, Sfax, Tunisia;Advanced Technologies for Medical and Signals, Research Unity, ATMS, ENIS, Sfax University, Sfax, Tunisia;Advanced Technologies for Medical and Signals, Research Unity, ATMS, ENIS, Sfax University, Sfax, Tunisia;Control & Energies Managment (CEM) Lab, Research Laboratory, ENIS, Sfax University, Sfax, Tunisia
Venue:
International Journal of Speech Technology
Year:
2013

Citing 16
Cited 0

Multilayer feedforward networks are universal approximators

Neural Networks
Review of neural networks for speech recognition

Neural Computation
Speech recognition using neural networks

Speech recognition using neural networks
Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering

Foundations of Neural Networks, Fuzzy Systems, and Knowledge Engineering
Connected Letter Recognition with a Multi-State Time Delay Neural Network

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Research on Chinese Digit Speech Recognition Based on Multi-Weighted Neural Network

PACIIA '08 Proceedings of the 2008 IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application - Volume 01
Enhanced phone posteriors for improving speech recognition systems

IEEE Transactions on Audio, Speech, and Language Processing
SWAT: a spiking neural network training algorithm for classification problems

IEEE Transactions on Neural Networks
Efficient MLP constructive training algorithm using a neuron recruiting approach for isolated word recognition system

International Journal of Speech Technology
Combining formant frequency based on variable order LPC coding with acoustic features for TIMIT phone recognition

International Journal of Speech Technology
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator

IEEE Transactions on Audio, Speech, and Language Processing
Noise-Robust Automatic Speech Recognition Using a Predictive Echo State Network

IEEE Transactions on Audio, Speech, and Language Processing
Spatio-Temporal Feature-Extraction Techniques for Isolated Gesture Recognition in Arabic Sign Language

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features

IEEE Transactions on Audio, Speech, and Language Processing
Deep and Wide: Multiple Layers in Automatic Speech Recognition

IEEE Transactions on Audio, Speech, and Language Processing
Sparse Multilayer Perceptron for Phoneme Recognition

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces a neural network optimization procedure allowing the generation of multilayer perceptron (MLP) network topologies with few connections, low complexity and high classification performance for phoneme's recognition. An efficient constructive algorithm with incremental training using a new proposed Frame by Frame Neural Networks (FFNN) classification approach for automatic phoneme recognition is thus proposed. It is based on a novel recruiting hidden neuron's procedure for a single hidden-layer. After an initializing phase started with initial small number of hidden neurons, this algorithm allows the Neural Networks (NNs) to adjust automatically its parameters during the training phase. The modular FFNN classification method is then constructed and tested to recognize 5 broad phonetic classes extracted from the TIMIT database. In order to take into account the speech variability related to the coarticulation effect, a Context Window of Three Successive Frame's (CWTSF) analysis is applied. Although, an important reduction of the computational training time is observed, this technique penalized the overall Phone Recognition Rate (PRR) and increased the complexity of the recognition system. To alleviate these limitations, two feature dimensionality reduction techniques respectively based on Principal Component Analysis (PCA) and Self Organizing Maps (SOM) are investigated. It is observed an important improvement in the performance of the recognition system when the PCA technique is applied.Optimal neuronal phone recognition architecture is finally derived according to the following criteria: best PRR, minimum computational training time and complexity of the BPNN architecture.