DL '97 Proceedings of the second ACM international conference on Digital libraries
Evaluating evaluation measure stability
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Phonetic confusion matrix based spoken document retrieval
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
A system for unrestricted topic retrieval from radio news broadcasts
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Position specific posterior lattices for indexing speech
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Searching the audio notebook: keyword search in recorded conversations
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Soft indexing of speech content for search in spoken documents
Computer Speech and Language
Indexing confusion networks for morph-based spoken document retrieval
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A critical assessment of spoken utterance retrieval through approximate lattice representations
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
General indexation of weighted automata: application to spoken utterance retrieval
SpeechIR '04 Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004
IEEE Transactions on Audio, Speech, and Language Processing
Performance analysis for lattice-based speech indexing approaches using words and subword units
IEEE Transactions on Audio, Speech, and Language Processing
ACM Transactions on Speech and Language Processing (TSLP)
Direct posterior confidence for out-of-vocabulary spoken term detection
ACM Transactions on Information Systems (TOIS)
Spoken Content Retrieval: A Survey of Techniques and Technologies
Foundations and Trends in Information Retrieval
Hi-index | 0.00 |
We explore the problem of out of vocabulary (OOV) queries in audio indexing systems by comparing three indexing methods on a broadcast news repository containing 75 hours of audio. Our systems are word-based, phoneme-based and a novel system based on syllable-like units called particles. To better examine the performance of these three approaches we use a query set where the percentage of OOVs has been artificially increased to 50%. We additionally investigate whether the combination of the three indexing techniques can yield improvements in retrieval. We explore several simple combination strategies such as weighted combinations. We find that combining word and sub-word based systems results in improved retrieval performance.