Gender-dependent acoustic models fusion developed for automatic subtitling of parliament meetings broadcasted by the Czech TV

Authors:
Jan Vaněk;Josef V. Psutka
Affiliations:
Department of Cybernetics, West Bohemia University, Pilsen, Czech Republic;Department of Cybernetics, West Bohemia University, Pilsen, Czech Republic
Venue:
TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Year:
2010

Citing 2
Cited 2

Discriminative Training of Gender-Dependent Acoustic Models

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Automatic online subtitling of the czech parliament meetings

TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue

Automatic topic identification for large scale language modeling data filtering

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue

Quantified Score

Hi-index	0.00

Visualization

Abstract

Gender-dependent (male/female) acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model. This paper deals with a problem how to use these gender-based acoustic models in a real-time LVCSR (Large Vocabulary Continuous Speech Recognition) system that is for more than one year used by the Czech TV for automatic subtitling of Parliament meetings that are broadcasted on the channel ČT24. Frequent changes of speakers and the direct connection of the LVCSR system to the TV audio stream require switching/fusion of models automatically and as soon as possible. The paper presents various techniques based on using the output probabilities for quick selection of a better model or their combinations. The best proposed method achieved over 11% relative WER reduction in comparision with the GI model.