A multilayer perceptron postprocessor to hidden Markov modeling for speech recognition
ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Hi-index | 0.00 |
A Chinese speech recognizer with unlimited vocabulary is described. The system has two major components: the acoustic recognition component, which includes an HMM (hidden Markov model)-based phone recognizer, a NN (neural network)-based initial refiner, and a NN-based tone classifier; and the lexical and homonym processor, which is based on a knowledge database extracted from large amounts of texts. This real-time recognizer is implemented on a PC-386 enhanced by only one digital signal processing board on which a TMS-320c25 chip operates as the CPU. On average, it takes only 0.19 s to recognize a one-syllable word. The recognition accuracy for syllables, tones, and words is 92.5%, 99.6%, and 97.5%, respectively.