Cross-task portability of a broadcast news speech recognition system

Authors:
N. Bertoldi;F. Brugnara;M. Cettolo;M. Federico;D. Giuliani
Affiliations:
ITC-irst Centro per la Ricerca Scientifica e Tecnologica, Via Sommarive 18, I-38050, Povo, Italy;ITC-irst Centro per la Ricerca Scientifica e Tecnologica, Via Sommarive 18, I-38050, Povo, Italy;ITC-irst Centro per la Ricerca Scientifica e Tecnologica, Via Sommarive 18, I-38050, Povo, Italy;ITC-irst Centro per la Ricerca Scientifica e Tecnologica, Via Sommarive 18, I-38050, Povo, Italy;ITC-irst Centro per la Ricerca Scientifica e Tecnologica, Via Sommarive 18, I-38050, Povo, Italy
Venue:
Speech Communication
Year:
2002

Citing 2
Cited 1

Self-organized language modeling for speech recognition

Readings in speech recognition
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)

Genericity and portability for task-independent speech recognition

Computer Speech and Language

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper reports on experiments of porting the ITC-irst Italian broadcast news recognition system to two spontaneous dialogue domains. Porting was investigated by applying state-of-the-art adaptation methods on acoustic and language models, and by evaluating the trade-off between performance and required amount of task specific annotated data. The use of different levels of supervision for acoustic model adaptation was also studied. By employing 2 h of manually annotated speech, word error rates of 26.0% and 28.4% were achieved by the adapted systems. These results are to be compared with the performance of two domain specific baseline systems, 22.6% and 21.2%, respectively, which were developed on much more training data. Finally, a robust method is presented that allows to tune the insertion of spontaneous speech phenomena by the speech decoder.