A statistical approach to machine translation
Computational Linguistics
The video mail retrieval project: experiences in retrieving spoken documents
Intelligent multimedia information retrieval
Statistical methods for speech recognition
Statistical methods for speech recognition
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A hidden Markov model information retrieval system
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Transcribing broadcast news for audio and video indexing
Communications of the ACM
Document language models, query models, and risk minimization for information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic multimedia retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Combining Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web
CBAIVL '98 Proceedings of the IEEE Workshop on Content - Based Access of Image and Video Libraries
Bayesian models for visual information retrieval
Bayesian models for visual information retrieval
A practical part-of-speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Experimental result analysis for a generative probabilistic image retrieval model
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A case study on array query optimisation
Proceedings of the 1st international workshop on Computer vision meets databases
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Detecting cartoons: a case study in automatic video-genre classification
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Multi channel sequence processing
Proceedings of the First international conference on Deterministic and Statistical Methods in Machine Learning
A probabilistic approach to medical image retrieval
CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
A unified context model for web image retrieval
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A selective weighted late fusion for visual concept recognition
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Computer Vision and Image Understanding
Hi-index | 0.00 |
We present a probabilistic model for the retrieval of multimodal documents. The model is based on Bayesian decision theory and combines models for text-based search with models for visual search. The textual model is based on the language modelling approach to text retrieval, and the visual information is modelled as a mixture of Gaussian densities. Both models have proved successful on various standard retrieval tasks. We evaluate the multimodal model on the search task of TREC's video track. We found that the disclosure of video material based on visual information only is still too difficult. Even with purely visual information needs, text-based retrieval still outperforms visual approaches. The probabilistic model is useful for text, visual, and multimedia retrieval. Unfortunately, simplifying assumptions that reduce its computational complexity degrade retrieval effectiveness. Regarding the question whether the model can effectively combine information from different modalities, we conclude that whenever both modalities yield reasonable scores, a combined run outperforms the individual runs.