Document expansion for speech retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Building searchable collections of enterprise speech data
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion
ACM Transactions on Asian Language Information Processing (TALIP)
Position specific posterior lattices for indexing speech
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Soft indexing of speech content for search in spoken documents
Computer Speech and Language
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Vocabulary independent spoken term detection
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Indexing confusion networks for morph-based spoken document retrieval
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Access to recorded interviews: A research agenda
Journal on Computing and Cultural Heritage (JOCCH)
Web derived pronunciations for spoken term detection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
The effect of language models on phonetic decoding for spoken term detection
SSCS '09 Proceedings of the third workshop on Searching spontaneous conversational speech
IEEE Transactions on Audio, Speech, and Language Processing
ACM Transactions on Speech and Language Processing (TSLP)
Automated speech and audio analysis for semantic access to multimedia
SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
Direct posterior confidence for out-of-vocabulary spoken term detection
ACM Transactions on Information Systems (TOIS)
Spoken Content Retrieval: A Survey of Techniques and Technologies
Foundations and Trends in Information Retrieval
Hi-index | 0.00 |
The effects of out-of-vocabulary (OOV) items in spoken document retrieval (SDR) are investigated. Several sets of transcriptions were created for the TREC-8 SDR task using a speech recognition system varying the vocabulary sizes and OOV rates, and the relative retrieval performance measured. The effects of OOV terms on a simple baseline IR system and on more sophisticated retrieval systems are described. The use of a parallel corpus for query and document expansion is found to be especially beneficial, and with this data set, good retrieval performance can be achieved even for fairly high OOV rates.