Open-vocabulary speech indexing for voice and video mail retrieval
MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Document expansion for speech retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Effects of out of vocabulary words in spoken document retrieval (poster session)
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Subword-based approaches for spoken document retrieval
Speech Communication
Advances in phonetic word spotting
Proceedings of the tenth international conference on Information and knowledge management
Mutual relevance feedback for multimodal query formulation in video retrieval
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Spoken document retrieval from call-center conversations
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Position specific posterior lattices for indexing speech
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A system for unrestricted topic retrieval from radio news broadcasts
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
General indexation of weighted automata: application to spoken utterance retrieval
SpeechIR '04 Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004
Web derived pronunciations for spoken term detection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Fast decoding for open vocabulary spoken term detection
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Multimedia search capabilities of Chinese language search engines
Information Processing and Management: an International Journal
Contextual information improves OOV detection in speech
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Performance analysis for lattice-based speech indexing approaches using words and subword units
IEEE Transactions on Audio, Speech, and Language Processing
Query-driven strategy for on-the-fly term spotting in spontaneous speech
EURASIP Journal on Audio, Speech, and Music Processing - Special issue on scalable audio-content analysis
Novel methods for query selection and query combination in query-by-example spoken term detection
Proceedings of the 2010 international workshop on Searching spontaneous conversational speech
Tandem decoding of children's speech for keyword detection in a child-robot interaction scenario
ACM Transactions on Speech and Language Processing (TSLP)
Learning sub-word units for open vocabulary speech recognition
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Direct posterior confidence for out-of-vocabulary spoken term detection
ACM Transactions on Information Systems (TOIS)
ACM Transactions on Information Systems (TOIS)
Query by babbling: a research agenda
Proceedings of the first workshop on Information and knowledge management for developing region
Hi-index | 0.00 |
We are interested in retrieving information from speech data like broadcast news, telephone conversations and roundtable meetings. Today, most systems use large vocabulary continuous speech recognition tools to produce word transcripts; the transcripts are indexed and query terms are retrieved from the index. However, query terms that are not part of the recognizer's vocabulary cannot be retrieved, and the recall of the search is affected. In addition to the output word transcript, advanced systems provide also phonetic transcripts, against which query terms can be matched phonetically. Such phonetic transcripts suffer from lower accuracy and cannot be an alternative to word transcripts.We present a vocabulary independent system that can handle arbitrary queries, exploiting the information provided by having both word transcripts and phonetic transcripts. A speech recognizer generates word confusion networks and phonetic lattices. The transcripts are indexed for query processing and ranking purpose.The value of the proposed method is demonstrated by the relative high performance ofour system, which received the highest overall ranking for US English speech data in the recent NIST Spoken Term Detection evaluation.