A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Story Segmentation and Detection of Commercials in Broadcast News Video
ADL '98 Proceedings of the Advances in Digital Libraries Conference
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The Journal of Machine Learning Research
The Journal of Machine Learning Research
Accurate methods for the statistics of surprise and coincidence
Computational Linguistics - Special issue on using large corpora: I
Probabilistic author-topic models for information discovery
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Video-based event recognition: activity representation and probabilistic recognition methods
Computer Vision and Image Understanding - Special issue on event detection in video
Maximum entropy model-based baseball highlight detection and classification
Computer Vision and Image Understanding - Special issue on event detection in video
Mining temporal patterns of movement for video content classification
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Semantic indexing and retrieval of video
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Temporal feature induction for baseball highlight classification
Proceedings of the 15th international conference on Multimedia
Situated models of meaning for sports video retrieval
NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Shot detection and motion analysis for automatic MPEG-7 annotation of sports videos
ICIAP'05 Proceedings of the 13th international conference on Image Analysis and Processing
Event based indexing of broadcasted sports video by intermodalcollaboration
IEEE Transactions on Multimedia
A unified approach to shot change detection and camera motion characterization
IEEE Transactions on Circuits and Systems for Video Technology
Automated sip detection in naturally-evoked video
ICMI '08 Proceedings of the 10th international conference on Multimodal interfaces
Sports event detection using temporal patterns mining and web-casting text
AREA '08 Proceedings of the 1st ACM workshop on Analysis and retrieval of events/actions and workflows in video streams
ICIAP '09 Proceedings of the 15th International Conference on Image Analysis and Processing
A perceptual hashing algorithm using latent dirichlet allocation
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
A new spatio-temporal method for event detection and personalized retrieval of sports video
Multimedia Tools and Applications
Video fingerprinting using Latent Dirichlet Allocation and facial images
Pattern Recognition
Proceedings of the 20th ACM international conference on Multimedia
Latent topic model for audio retrieval
Pattern Recognition
Journal of Visual Communication and Image Representation
Hi-index | 0.00 |
This paper presents a methodology for automatically indexing a large corpus of broadcast baseball games using an unsupervised content-based approach. The method relies on the learning of a grounded language model which maps query terms to the non-linguistic context to which they refer. Grounded language models are learned from a large, unlabeled corpus of video events. Events are represented using a codebook of automatically discovered temporal patterns of low level features extracted from the raw video. These patterns are associated with words extracted from the closed captioning text using a generalization of Latent Dirichlet Allocation. We evaluate the benefit of the grounded language model by extending a traditional language model based approach to information retrieval. Experimental results indicate that using a grounded language model nearly doubles performance on a held out test set.