Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Subword-based approaches for spoken document retrieval
Subword-based approaches for spoken document retrieval
Hi-index | 0.00 |
In this paper we propose a new type of syllable-based unit for recognition and language model to improve recognition rate for Korean phones, syllables and characters We propose ‘combined' units for which both Korean characters and syllable units realized in speech are taken into consideration We can obtain character, syllable and phone sequences directly from the recognition results by using proposed units To test the performance of the proposed approach we perform two types of experiments First, we perform language modeling for phones, characters, syllables and propose combined units based on the same text corpus, and we test the performance for each unit Second, we perform a vector space model based retrieval experiment by using the proposed combined units.