Speech transcription and spoken document retrieval in finnish

Authors:
Mikko Kurimo;Ville Turunen;Inger Ekman
Affiliations:
Neural Networks Research Centre, Helsinki University of Technology, Espoo, Finland;Neural Networks Research Centre, Helsinki University of Technology, Espoo, Finland;Department of Information Studies, University of Tampere, Finland
Venue:
MLMI'04 Proceedings of the First international conference on Machine Learning for Multimodal Interaction
Year:
2004

Citing 3
Cited 0

Managing gigabytes (2nd ed.): compressing and indexing documents and images

Managing gigabytes (2nd ed.): compressing and indexing documents and images
Indexing and retrieval of broadcast news

Speech Communication - Special issue on accessing information in spoken audio
Unsupervised discovery of morphemes

MPL '02 Proceedings of the ACL-02 workshop on Morphological and phonological learning - Volume 6

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a baseline spoken document retrieval system in Finnish that is based on unlimited vocabulary continuous speech recognition. Due to its agglutinative structure, Finnish speech can not be adequately transcribed using the standard large vocabulary continuous speech recognition approaches. The definition of a sufficient lexicon and the training of the statistical language models are difficult, because the words appear transformed by many inflections and compounds. In this work we apply the recently developed language model that enables n-gram models of morpheme-like subword units discovered in an unsupervised manner. In addition to word-based indexing, we also propose an indexing based on the subword units provided directly by our speech recognizer, and a combination of the both. In an initial evaluation of newsreading in Finnish, we obtained a fairly low recognition error rate and average document retrieval precisions close to what can be obtained from human reference transcripts.