Using gradient descent optimization for acoustics training from heterogeneous data

Authors:
Martin Karafiát;Igor Szöke;Jan Černocký
Affiliations:
Brno University of Technology, Faculty of Information Technology, Department of Computer Graphics and Multimedia, Speech@FIT, Brno, Czech Republic;Brno University of Technology, Faculty of Information Technology, Department of Computer Graphics and Multimedia, Speech@FIT, Brno, Czech Republic;Brno University of Technology, Faculty of Information Technology, Department of Computer Graphics and Multimedia, Speech@FIT, Brno, Czech Republic
Venue:
TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Year:
2010

Citing 1
Cited 0

Investigation of silicon auditory models and generalization of linear discriminant analysis for improved speech recognition

Investigation of silicon auditory models and generalization of linear discriminant analysis for improved speech recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we study the use of heterogeneous data for training of acoustic models. In initial experiments, a significant drop of accuracy has been observed on in-domain test set if the data was added without any regularization. A solution is proposed by getting control over the training data by optimization of the weights of different data-sets. The final models shows good performance on all various tests linked to various speaking styles. Furthermore, we used this approach to increase the performance over just the main test set. We obtained 0.3% absolute improvement on basic system and 0.4% on HLDA system although the size of the heterogeneous data set was quite small.