Applying Bayesian networks to information retrieval
Communications of the ACM
Retrieving spoken documents by combining multiple index sources
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
DL '97 Proceedings of the second ACM international conference on Digital libraries
New techniques for open-vocabulary spoken document retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
CueVideo (demonstration abstract): automated video/audio indexing and browsing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
On Relevance, Probabilistic Indexing and Information Retrieval
Journal of the ACM (JACM)
Query Expansion for Imperfect Speech: Applications in Distributed Learning
CBAIVL '00 Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'00)
Detecting topical events in digital video
MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Structure and content-based segmentation of speech transcripts
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic discovery of salient segments in imperfect speech transcripts
Proceedings of the tenth international conference on Information and knowledge management
Advances in phonetic word spotting
Proceedings of the tenth international conference on Information and knowledge management
Streaming-Media Knowledge Discovery
Computer
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
Extracting Keyphrases from Spoken Audio Documents
Information Retrieval Techniques for Speech Applications [this book is based on the workshop “Information Retrieval Techniques for Speech Applications”, held as part of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in New Orleans, USA, in September 2001].
Technologies for constructing intelligent systems
Toward speech as a knowledge resource
IBM Systems Journal
A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents
ACM Transactions on Asian Language Information Processing (TALIP)
A multi-modal system for the retrieval of semantic video events
Computer Vision and Image Understanding - Special issue on event detection in video
Exploring the use of latent topical information for statistical Chinese spoken document retrieval
Pattern Recognition Letters
An approximate multi-word matching algorithm for robust document retrieval
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Search the audio, browse the video: a generic paradigm for video collections
EURASIP Journal on Applied Signal Processing
Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Spoken Document Retrieval Based on Approximated Sequence Alignment
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Word Topic Models for Spoken Document Retrieval and Transcription
ACM Transactions on Asian Language Information Processing (TALIP)
Machine learning in a multimedia document retrieval framework
IBM Systems Journal
Automatic generation of conference video proceedings
Journal of Visual Communication and Image Representation
The design of phoneme grouping for coarse phoneme recognition
IEA/AIE'07 Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems
IEEE Transactions on Audio, Speech, and Language Processing
An effective access mechanism to digital interview archives
ECDL'05 Proceedings of the 9th European conference on Research and Advanced Technology for Digital Libraries
Direct posterior confidence for out-of-vocabulary spoken term detection
ACM Transactions on Information Systems (TOIS)
Spoken Content Retrieval: A Survey of Techniques and Technologies
Foundations and Trends in Information Retrieval
Hi-index | 0.00 |
Combined word-based index and phonetic indexes have been used to improve the performance of spoken document retrieval systems primarily by addressing the out-of-vocabulary retrieval problem. However, a known problem with phonetic recognition is its limited accuracy in comparison with word level recognition. We propose a novel method for phonetic retrieval in the CueVideo system based on the probabilistic formulation of term weighting using phone confusion data in a Bayesian framework. We evaluate this method of spoken document retrieval against word-based retrieval for the search levels identified in a realistic video-based distributed learning setting. Using our test data, we achieved an average recall of 0.88 with an average precision of 0.69 for retrieval of out-of-vocabulary words on phonetic transcripts with 35% word error rate. For in-vocabulary words, we achieved a 17% improvement in recall over word-based retrieval with a 17% loss in precision for word error rites ranging from 35 to 65%.