Vocabulary independent spoken term detection

Authors:
Jonathan Mamou;Bhuvana Ramabhadran;Olivier Siohan
Affiliations:
IBM Haifa Research Labs, Haifa, Israel;IBM T. J. Watson Research Center, Yorktown Heights, NY;IBM T. J. Watson Research Center, Yorktown Heights, NY
Venue:
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2007

Citing 10
Cited 13

Open-vocabulary speech indexing for voice and video mail retrieval

MULTIMEDIA '96 Proceedings of the fourth ACM international conference on Multimedia
Document expansion for speech retrieval

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Effects of out of vocabulary words in spoken document retrieval (poster session)

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Subword-based approaches for spoken document retrieval

Speech Communication
Advances in phonetic word spotting

Proceedings of the tenth international conference on Information and knowledge management
Mutual relevance feedback for multimodal query formulation in video retrieval

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Spoken document retrieval from call-center conversations

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Position specific posterior lattices for indexing speech

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A system for unrestricted topic retrieval from radio news broadcasts

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
General indexation of weighted automata: application to spoken utterance retrieval

SpeechIR '04 Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004

Web derived pronunciations for spoken term detection

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Fast decoding for open vocabulary spoken term detection

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Multimedia search capabilities of Chinese language search engines

Information Processing and Management: an International Journal
Contextual information improves OOV detection in speech

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Performance analysis for lattice-based speech indexing approaches using words and subword units

IEEE Transactions on Audio, Speech, and Language Processing
Query-driven strategy for on-the-fly term spotting in spontaneous speech

EURASIP Journal on Audio, Speech, and Music Processing - Special issue on scalable audio-content analysis
Novel methods for query selection and query combination in query-by-example spoken term detection

Proceedings of the 2010 international workshop on Searching spontaneous conversational speech
Tandem decoding of children's speech for keyword detection in a child-robot interaction scenario

ACM Transactions on Speech and Language Processing (TSLP)
Learning sub-word units for open vocabulary speech recognition

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Direct posterior confidence for out-of-vocabulary spoken term detection

ACM Transactions on Information Systems (TOIS)
Comparison of methods for language-dependent and language-independent query-by-example spoken term detection

ACM Transactions on Information Systems (TOIS)
Query by babbling: a research agenda

Proceedings of the first workshop on Information and knowledge management for developing region
A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

We are interested in retrieving information from speech data like broadcast news, telephone conversations and roundtable meetings. Today, most systems use large vocabulary continuous speech recognition tools to produce word transcripts; the transcripts are indexed and query terms are retrieved from the index. However, query terms that are not part of the recognizer's vocabulary cannot be retrieved, and the recall of the search is affected. In addition to the output word transcript, advanced systems provide also phonetic transcripts, against which query terms can be matched phonetically. Such phonetic transcripts suffer from lower accuracy and cannot be an alternative to word transcripts.We present a vocabulary independent system that can handle arbitrary queries, exploiting the information provided by having both word transcripts and phonetic transcripts. A speech recognizer generates word confusion networks and phonetic lattices. The transcripts are indexed for query processing and ranking purpose.The value of the proposed method is demonstrated by the relative high performance ofour system, which received the highest overall ranking for US English speech data in the recent NIST Spoken Term Detection evaluation.