Review of neural networks for speech recognition
Neural Computation
Hidden Markov models for speech recognition
Technometrics
A text-independent speaker recognition system based on vowel spotting
Speech Communication
State of the art in continuous speech recognition
Voice communication between humans and machines
Connectionist Speech Recognition: A Hybrid Approach
Connectionist Speech Recognition: A Hybrid Approach
IEEE Transactions on Pattern Analysis and Machine Intelligence
Context-Dependent Hybrid HME/HMM Speech Recognition using Polyphone Clustering Decision Trees
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 3 - Volume 3
Combining ANNs To Improve Phone Recognition
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 4 - Volume 4
Hidden Neural Networks: A Framework for HMM/NN Hybrids
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 4 - Volume 4
Phonetic words decoding software in the problem of Russian speech recognition
Automation and Remote Control
Hi-index | 0.00 |
In this paper we present a hybrid ANN/HMM syllable recognition system based on vowel spotting. Using an advanced multilevel vowel-spotting module we track all vowel phonemes in speech signals from where we model the speech segments located between two successive vowels which are defined as syllables. In order to achieve minimum vowel losses and accurate detection, we focus on taking special care of the vowel spotter which is based on three different techniques: discrete hidden Markov models (DHMMs), multilayer perceptrons and heuristic rules.To set up the models of the syllable segments, hybrid DHMMs with multiple codebooks are used. The usual DHMM probability parameters are replaced by combined neural network outputs. For this purpose, we use both context dependent and context independent neural networks.The syllable recognition system was tested with the TIMIT and NTIMIT databases and the results obtained showed 75.09% and 59.30% average syllable recognition accuracy, respectively. It has to be noted that to achieve the above results no grammars or syllable-based lexicons were used.