SVDPACK: A Fortran-77 Software Library for the Sparse Singular Value Decomposition
SVDPACK: A Fortran-77 Software Library for the Sparse Singular Value Decomposition
A novel method for spoken text feature extraction in semantic video retrieval
PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
Video retrieval using high level features: exploiting query matching and confidence-based weighting
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Hi-index | 0.00 |
Many researchers try to utilize the semantic information extracted from visual feature to directly realize the semantic video retrieval or to supplement the automated speech recognition (ASR) text retrieval. But bridging the gap between the low-level visual feature and semantic content is still a challenging task. In this paper, we study how to effectively use Latent Semantic Indexing (LSI) to improve the semantic video retrieval through the ASR texts. The basic LSI method has been shown effective in the traditional text retrieval and the noisy ASR text retrieval. In this paper, we further use the lexiconguided semantic clustering to effectively remove the noise introduced by news video's additional contents, and use the cluster-based LSI to automatically mine the semantic structure underlying the terms expression. Tests on the TRECVID 2005 dataset show that the above two enhancements achieve 21.3% and 6.9% improvements in performance over the traditional vector-space model(VSM) and the basic LSI separately.