Design of Speech Recognition Engine

Authors:
Ludek Müller;Josef Psutka;Lubos Smídl
Affiliations:
-;-;-
Venue:
TDS '00 Proceedings of the Third International Workshop on Text, Speech and Dialogue
Year:
2000

Citing 1
Cited 3

Statistical methods for speech recognition

Statistical methods for speech recognition

The Influence of a Filter Shape in Telephone-Based Recognition Module Using PLP Parameterization

TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Automatic switchboard operator

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Web text data mining for building large scale language modelling corpus

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper concerns a speaker independent recognition engine of Czech continuous speech designed for Czech telephone applications and describes the recognition module as an important component of a telephone dialogue system being designed and constructed at the Department of Cybernetics, the University of West Bohemia. The recognition is based on a statistical approach. The left-to-right three-state HMMs with an output probability density function expressed as multivariate Gaussian mixture are used to model triphones as basic units in acoustic modelling and stochastic regular grammars are implemented to reduce a task perplexity. A real time recognition process is supported by a very computation cost reduction approach estimating log-likelihood scores of Gaussian mixtures and also by a beam pruning used during Viterbi decoding. The present paper concerns the main part of the engine - a speaker independent recognition engine for continuous Czech speech.