Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision

Authors:
D. O'Shaughnessy;H. Tolba
Affiliations:
INRS-Telecommun., Quebec Univ., Verdun, Que., Canada;-
Venue:
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Year:
1999

Citing 0
Cited 3

Automatic speech recognition and speech variability: A review

Speech Communication
Incorporating the voicing information into HMM-based automatic speech recognition in noisy environments

Speech Communication
Improvement of speaker identification by combining prosodic features with acoustic features

SINOBIOMETRICS'04 Proceedings of the 5th Chinese conference on Advances in Biometric Person Authentication

Quantified Score

Hi-index	0.00

Visualization

Abstract

We show that the concept of voiced-unvoiced (VU) classification of speech sounds can be incorporated not only in speech analysis or speech enhancement processes, but also can be useful for recognition processes. That is, the incorporation of such a classification in a continuous speech recognition (CSR) system not only improves its performance in low SNR environments, but also limits the time and the necessary memory to carry out the process of the recognition. The proposed V-U classification of the speech sounds has two principal functions: (1) it allows the enhancement of the voiced and unvoiced parts of speech separately; (2) it limits the Viterbi (1967) search space, and consequently the process of recognition can be carried out in real time without degrading the performance of the system. We prove via experiments that such a system outperforms the baseline HTK when a V-U decision is included in both front- and far-end of the HTK-based recognizer.